Kateryna Hrytsaienko
Multilingual NLP pipeline up and running from scratch
#1about 3 minutes
The challenge of building end-to-end NLP pipelines
There is a lack of comprehensive guides for integrating multilingual NLP models into applications with proper CI/CD practices, especially for non-English languages.
#2about 5 minutes
Understanding the core components of an NLP pipeline
A typical NLP pipeline consists of three key stages: pre-processing, feature extraction, and modeling, with pre-processing being critical for handling unstructured data.
#3about 8 minutes
Why simply translating everything to English is not enough
Translating all text to English for NLP analysis can decrease accuracy by up to 20% due to lost semantic nuance and dialectical differences.
#4about 10 minutes
Generalizing languages with stemming and bag-of-words
Handle similar languages by using stemming to find common root words and a bag-of-words model with a similarity index to treat them as a single language.
#5about 5 minutes
Achieving high accuracy with a unified language model
By training classifiers on stemmed and normalized vectors from multiple similar languages, it's possible to achieve high accuracy of around 90% in tasks like topic classification.
#6about 8 minutes
Choosing the right deployment strategy for your model
Decide between embedding your model or exposing it as an API, considering options like serverless for simple cases or Kubernetes for scalable, cloud-agnostic deployments.
#7about 7 minutes
Implementing a CI/CD pipeline for your NLP model
Establish an MLOps workflow with continuous training, integration, and delivery by containerizing your model with Docker and automating builds with tools like GitHub Actions.
#8about 6 minutes
Q&A on slang processing, debugging, and transformers
The Q&A covers practical advice on handling slang with dictionaries, debugging with robust logging, and understanding the complexity gap between traditional methods and transformers like BERT.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
22:33 MIN
Automating the data pipeline with multi-cloud services
Leverage Cloud Computing Benefits with Serverless Multi-Cloud ML
03:32 MIN
Building an AI-powered app with Rust and Docker
Coffee with Developers - Francesco Ciulla
21:06 MIN
Practical applications for real-time Python data pipelines
Python-Based Data Streaming Pipelines Within Minutes
25:15 MIN
Creating a scalable translation mapping solution
Unleashing the power of AI to prevent financial crime
02:38 MIN
Common challenges in developing machine learning applications
Data Fabric in Action - How to enhance a Stock Trading App with ML and Data Virtualization
30:09 MIN
Deploying the machine learning model with Docker
Leverage Cloud Computing Benefits with Serverless Multi-Cloud ML
27:46 MIN
Key takeaways for modern data processing
Convert batch code into streaming with Python
14:34 MIN
AI engineering does not always require Python
WeAreDevelopers LIVE: What's happening to React?, All-in-one editors, Fireships and Firebases & more
Featured Partners
Related Videos
DevOps for AI: running LLMs in production with Kubernetes and KubeFlow
Aarno Aukia
Creating Industry ready solutions with LLM Models
Vijay Krishan Gupta & Gauravdeep Singh Lotey
What do language models really learn
Tanmay Bakshi
Multimodal Generative AI Demystified
Ekaterina Sirazitdinova
Overview of Machine Learning in Python
Adrian Schmitt
From Traction to Production: Maturing your LLMOps step by step
Maxim Salnikov
The state of MLOps - machine learning in production at enterprise scale
Bas Geerdink
The best of both worlds: Combining Python and Kotlin for Machine Learning
Nils Kasseckert
From learning to earning
Jobs that call for the skills explored in this talk.




Senior Systems/DevOps Developer (f/m/d)
Bonial International GmbH
Berlin, Germany
Senior
Python
Terraform
Kubernetes
Elasticsearch
Amazon Web Services (AWS)


Cloud Engineer (m/w/d)
fulfillmenttools
Köln, Germany
€50-65K
Intermediate
TypeScript
Google Cloud Platform
Continuous Integration


DevOps Engineer – Kubernetes & Cloud (m/w/d)
epostbox epb GmbH
Berlin, Germany
Intermediate
Senior
DevOps
Kubernetes
Cloud (AWS/Google/Azure)


Senior Backend Engineer – AI Integration (m/w/x)
chatlyn GmbH
Vienna, Austria
Senior
JavaScript
AI-assisted coding tools




Security-by-Design for Trustworthy Machine Learning Pipelines
Association Bernard Gregory
Machine Learning
Continuous Delivery
Data Scientist- Python/MLflow-NLP/MLOps/Generative AI
ITech Consult AG
Zürich, Switzerland
Azure
Python
PyTorch
TensorFlow
Machine Learning
+1


