Talk
Registration required!
October 6, 2020
2:30 pm
3:30 pm
(CET)

PySpark: Combining Machine Learning and Big Data

Powered by
No items found.

About the session

With the ever-increasing flow of data, comes the industry focus on how to use those data for driving business & insights; but what about the size of the data these days, we have to deal with? The more cleaner data you have, its good for training your ML (Machine Learning) models, but sadly neither the world feeds you clean data nor the huge amount of data is capable of fast processing using common libraries like Pandas etc. How about using the potential of big data libraries with support in Python to deal with this huge amount of data for deriving business insights using ML techniques? But how can we amalgamate the two? Usually people in the ML domain prefer using Python; so combining the potential of Big Data technologies like Spark to supplement ML is a matter of ease with PySpark (a Python package to use the Spark’s capabilities).

About the speaker

Ayon Roy
Ayon Roy
LuLu International Exchange

Watch recording

Registration required!

Save your spot

6 Oct
,
2:30 pm
3:30 pm
(CET)
Save my spotSave my spotSave my spotSave my spot
Code of Conduct
WeAreDevelopers welcomes everyone and is dedicated to defending anybody from harassment, regardless of gender, gender identity, and expression, sexual orientation, disability, physical appearance, body size, race, age or religion.
Read more
Diversity & Inclusion
At the WeAreDevelopers Events we empower underrepresented groups by giving them the stage to share their knowledge and experiences. It is crucial for our international events to bring together the perspectives of people with different backgrounds.
Read more