Daniel Madalitso Phiri

Sep 18, 2024 • WeAreDevelopers LIVE

Vision for Websites: Training Your Frontend to See

Build web apps that see. Learn how to implement powerful visual search with vector embeddings in just a few lines of code.

#1about 1 minute

Defining vision as the ability to deduce and understand

The concept of vision for websites is redefined from simply seeing to the ability to deduce, understand, and act on information.

#2about 4 minutes

Demo of a multimodal e-commerce search application

A live demonstration showcases an e-commerce store where users can search for products using both text queries and by uploading images.

#3about 2 minutes

What is multimodality in artificial intelligence?

Multimodality enables search queries to use multiple media types like text, images, and audio to capture more context and improve user interaction.

#4about 2 minutes

Why multimodal AI creates richer user experiences

Multimodal interfaces provide more natural and context-aware interactions, moving beyond simple keyword searches to a more intuitive experience.

#5about 4 minutes

Differentiating generative AI from embedding models

Embedding models encapsulate information into numerical representations (vectors), unlike generative models which create new data.

#6about 4 minutes

How vector search works by measuring distance

Vector search operates by converting a query into an embedding and finding the closest, most semantically similar items in a multidimensional space.

#7about 2 minutes

Creating a unified space for multimodal search

Different data types like text, images, and audio are processed by specific encoders and plotted into a single, unified vector space for cross-modal queries.

#8about 9 minutes

Implementing text-based image search with Weaviate

A code walkthrough demonstrates how to build a text-to-image search feature using a Next.js frontend and a Weaviate backend with a `nearText` query.

#9about 4 minutes

Implementing visual search with an image query

The code for an image-to-image search is explained, showing how a base64 image is sent to the backend to perform a `nearImage` vector search.

#10about 2 minutes

Expanding vision to other creative applications

Beyond e-commerce, multimodal vision can be applied to creative use cases like movie recommenders, educational tools, and map navigation.

1 month ago

Senior Fullstack Engineer – Angular/.Net (f/m/d)

Apaleo
München, Germany

Remote

Senior

21 days ago

Senior Frontend Engineer (Angular) (m/f/d)

Riverty
Berlin, Germany

Senior

28 days ago

Frontend Developer Angular (m/w/d)

Technoly GmbH
Berlin, Germany

Senior

From learning to earning

Jobs that call for the skills explored in this talk.

4 days ago

Senior Software Engineer (Frontend)

awork GmbH
Hamburg, Germany

€75-90K

Senior

React

Angular

TypeScript

3 days ago

Frontend Developer

PURVIEW
Boiro, Spain

API

GIT

REST

React

Figma

17 days ago

Frontend Developer

Ai Driven
St. Gallen, Switzerland

Remote

CSS

GIT

HTML

RxJS

2 days ago

Frontend Web Entwickler:in

vierdimensional Brand Intelligence Group
Aulendorf, Germany

Remote

€33-54K

CSS

HTML

WordPress

Sr AI Engineer | Remote - Europe | TS/Vue/NodeJS

13 days ago

Sr AI Engineer | Remote - Europe | TS/Vue/NodeJS

n8n

Remote

Senior

API

React

Vue.js

Node.js

13 days ago

Sr AI Engineer | Remote - Europe | TS/Vue/NodeJS

n8n

Senior

API

React

Vue.js

Node.js

TypeScript

13 days ago

Sr AI Engineer | Remote - Europe | TS/Vue/NodeJS

n8n

Remote

Senior

API

React

Vue.js

Node.js

13 days ago

Technical leader vue.js

VmWay S.r.l.
Rome, Italy

REST

Vue.js

Continuous Integration

Frontend-Softwareentwicklung im Web mit Vue, HTML, CSS, JavaScript mit Wohnsitz in Berlin

yesterday

Frontend-Softwareentwicklung im Web mit Vue, HTML, CSS, JavaScript mit Wohnsitz in Berlin

Zebresel - Deine Agentur für d igitale Medien.
Berlin, Germany

PHP

API

CSS

MVC

GIT

Defining vision as the ability to deduce and understand

Demo of a multimodal e-commerce search application

What is multimodality in artificial intelligence?

Why multimodal AI creates richer user experiences

Differentiating generative AI from embedding models

How vector search works by measuring distance

Creating a unified space for multimodal search

Implementing text-based image search with Weaviate

Implementing visual search with an image query

Expanding vision to other creative applications

Senior Fullstack Engineer – Angular/.Net (f/m/d)

Senior Frontend Engineer (Angular) (m/f/d)

Frontend Developer Angular (m/w/d)

Matching moments

Introduction to generative AI in the browser

Generate AI in the Browser with Chrome AI - Raymond Camden

The future of web development is faster and simpler

The Eternal Sunshine of the Zero Build Pipeline

An overview of the vector database market

What comes after ChatGPT? Vector Databases - the Simple and powerful future of ML?

A full-stack architecture for streaming AI responses

Streaming AI Responses in Real-Time with SSE in Next.js & NestJS

Will AI agents make traditional web design obsolete

WAD Live 22/01/2025: Exploring AI, Web Development, and Accessibility in Tech with Stefan Judis

Learning front-end development by recreating existing websites

WeAreDevelopers LIVE - Gaps in CSS, EU Accessibility Act and more!

Exploring web development quirks and creative AI failures

WeAreDevelopers LIVE – Building on Algorand: Real Projects and Developer Tools

Demonstrating the future of software with natural language

Best practices: Building Enterprise Applications that leverage GenAI

Featured Partners

Related Videos

WeAreDevelopers LIVE – AI vs the Web & AI in Browsers

From ML to LLM: On-device AI in the Browser

WeAreDevelopers LIVE - Vector Similarity Search Patterns for Efficiency and more

What comes after ChatGPT? Vector Databases - the Simple and powerful future of ML?

Multimodal Generative AI Demystified

WeAreDevelopers LIVE – Frontend Inspirations, Web Standards and more

WeAreDevelopers LIVE - the weekly developer show with Chris Heilmann and Daniel Cranney

Develop AI-powered Applications with OpenAI Embeddings and Azure Search

Related Articles

From learning to earning

Senior Software Engineer (Frontend)

Frontend Developer

Frontend Developer

Frontend Web Entwickler:in

Sr AI Engineer | Remote - Europe | TS/Vue/NodeJS

Sr AI Engineer | Remote - Europe | TS/Vue/NodeJS

Sr AI Engineer | Remote - Europe | TS/Vue/NodeJS

Technical leader vue.js

Frontend-Softwareentwicklung im Web mit Vue, HTML, CSS, JavaScript mit Wohnsitz in Berlin