Data Science Engineer
Interested in building a better product by using data? Whatever we do, whether it’s decision making, or dealing with machine learning problems, we need data.
In Kurio, we’re partnering with hundreds of publishers. We collect bunch of data from our mobile application. These make us a fun place for doing data science projects, whether it’s a simple classification problem, or complex recommender system.
Oh, and mostly, we use Python.
The Data Science Team
In this team, you will get to work on both engineering-related tasks and data science-related tasks.
The data-science-related tasks might include:
- Maintain, and if possible, improve existing models (retrain) used in production environment
- Opportunities for researching state-of-the-art machine learning models (i.e. Named-Entity-Recognition)
- Working with data analysis projects along with our data analysts
The engineering-related tasks are developing services that use the model in the production environment. Some examples of engineering-related tasks include:
- Building simple APIs that utilize machine learning model.
- Build services that further enhance the newsfeed algorithm (i.e. deduplication, personalization).
- Some ETL jobs
We write 2 kinds of code:
-
Scripting code
A script to get the job done. Just remember that eventually, somebody else will take a look at your code, and that someone could be your future self. Try to be mindful and write a readable code.
-
Production-level code
No slow-performing codes are allowed. Fully comply with PEP8.
What it’s like working at PT Kurio
- We value people that keeps learning
- We appreciate the extra miles given
- All engineers have access to all of our repositories
What it’s like working with this team
- Building machine learning models used in production environment
- Opportunities for researching state-of-the-art models (i.e. named-entity-recognition)
- Maintain, and if possible, improve existing models (retrain)
- Scripting
- web scraping, for gathering data
- modeling for training machine learning models
- data exploration for data analysis
What you will do
- Data exploration and analysis
- Bunch of classifiers (gathering data and training models)
- Clustering
- Newsfeed (incld. recommender engine)
- Building services that utilize the machine learning model
Qualifications
- Kurio’s values (Get sh*t done, Be Bold, Stay Curious, Excellence, Collaboration)
- Basic knowledge in computer-science is a must
- Have a foundation on machine learning concepts and algorithms (text-related is preferred)
- Python
- Able to scrape website for gathering training data (and cleaning)
- Familiar with one or more databases (SQL and/or NoSQL, i.e. MySQL and/or MongoDB)
- Familiar with Git workflow and its principles
Preferred qualifications
- Knowledge on containerized-solutions (docker, kubernetes) is a bonus
- Experienced on software architecture
- Experienced on production-level machine learning systems
- Knowledge on dev-ops practices