Daniel Madalitso Phiri
Vision for Websites: Training Your Frontend to See
#1about 1 minute
Defining vision as the ability to deduce and understand
The concept of vision for websites is redefined from simply seeing to the ability to deduce, understand, and act on information.
#2about 4 minutes
Demo of a multimodal e-commerce search application
A live demonstration showcases an e-commerce store where users can search for products using both text queries and by uploading images.
#3about 2 minutes
What is multimodality in artificial intelligence?
Multimodality enables search queries to use multiple media types like text, images, and audio to capture more context and improve user interaction.
#4about 2 minutes
Why multimodal AI creates richer user experiences
Multimodal interfaces provide more natural and context-aware interactions, moving beyond simple keyword searches to a more intuitive experience.
#5about 4 minutes
Differentiating generative AI from embedding models
Embedding models encapsulate information into numerical representations (vectors), unlike generative models which create new data.
#6about 4 minutes
How vector search works by measuring distance
Vector search operates by converting a query into an embedding and finding the closest, most semantically similar items in a multidimensional space.
#7about 2 minutes
Creating a unified space for multimodal search
Different data types like text, images, and audio are processed by specific encoders and plotted into a single, unified vector space for cross-modal queries.
#8about 9 minutes
Implementing text-based image search with Weaviate
A code walkthrough demonstrates how to build a text-to-image search feature using a Next.js frontend and a Weaviate backend with a `nearText` query.
#9about 4 minutes
Implementing visual search with an image query
The code for an image-to-image search is explained, showing how a base64 image is sent to the backend to perform a `nearImage` vector search.
#10about 2 minutes
Expanding vision to other creative applications
Beyond e-commerce, multimodal vision can be applied to creative use cases like movie recommenders, educational tools, and map navigation.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
1:04:01 MIN
Exploring modern tools for web interaction and analysis
WeAreDevelopers LIVE - the weekly developer show with Chris Heilmann and Daniel Cranney
17:41 MIN
Presenting live web scraping demos at a developer conference
Tech with Tim at WeAreDevelopers World Congress 2024
48:13 MIN
The future of web development is faster and simpler
The Eternal Sunshine of the Zero Build Pipeline
29:30 MIN
An overview of the vector database market
What comes after ChatGPT? Vector Databases - the Simple and powerful future of ML?
06:28 MIN
A full-stack architecture for streaming AI responses
Streaming AI Responses in Real-Time with SSE in Next.js & NestJS
07:14 MIN
Will AI agents make traditional web design obsolete
WAD Live 22/01/2025: Exploring AI, Web Development, and Accessibility in Tech with Stefan Judis
08:31 MIN
Learning front-end development by recreating existing websites
WeAreDevelopers LIVE - Gaps in CSS, EU Accessibility Act and more!
00:05 MIN
Demonstrating the future of software with natural language
Best practices: Building Enterprise Applications that leverage GenAI
Featured Partners
Related Videos
WeAreDevelopers LIVE - the weekly developer show with Chris Heilmann and Daniel Cranney
WAD Live 22/01/2025: Exploring AI, Web Development, and Accessibility in Tech with Stefan Judis
Build UIs that learn - Discover the powerful combination of UI and AI
Eliran Natan
Virtual Reality – The path to create your world
Drishti Jain
Modern Web Development with Nuxt3
Alexander Lichter
Web APIs you might not know about
Sasha Shynkevich
Explore new web features before everyone else
Nikita Dubko
Vuejs and TypeScript- Working Together like Peanut Butter and Jelly
Rob Richardson
From learning to earning
Jobs that call for the skills explored in this talk.


Software developer (AI/Computer Vision)
Fastview360 Ltd
Stapeley, United Kingdom
Remote
€40K
C++
.NET
Azure
+5






Front-End Engineer Vue.Js - IA et Machine Learning - Fullremote H/F
OCTOPUS IT
Paris, France
Remote
€50-68K
C++
Python
Vue.js
+2












