Internship - Data Science Engineer
Interested in building a better product by using data? Whatever we do, whether it’s decision making, or dealing with machine learning problems, we need data.
In Kurio, we’re partnering with hundreds of publishers. We collect bunch of data from our mobile application. These make us a fun place for doing data science projects, whether it’s a simple classification problem, or complex recommender system.
In this team, you will get to work on both engineering-related tasks and data science-related tasks.
The data-science-related tasks might include:
- Maintain, and if possible, improve existing models (retrain) used in production environment
- Opportunities for researching state-of-the-art machine learning models (i.e. Named-Entity-Recognition)
- Working with data analysis projects along with our data analysts
The engineering-related tasks are developing services that use the model in the production environment. Some examples of engineering-related tasks include:
- Building simple APIs that utilize machine learning model.
- Build services that further enhance the newsfeed algorithm (i.e. deduplication, personalization).
- Some ETL jobs
We write 2 kinds of code:
-
Scripting code
A script to get the job done. Just remember that eventually, somebody else will take a look at your code, and that someone could be your future self. Try to be mindful and write a readable code.
-
Production-level code
No slow-performing codes are allowed. Fully comply with PEP8.
What it’s like working at PT Kurio
- We value people that keeps learning
- We appreciate the extra miles given
- All engineers have access to all of our repositories
What you will do
- Gathering and Cleaning data
- Modeling machine learning models
- Build API that uses the machine learning model
Qualifications
- Kurio’s values (Get sh*t done, Be Bold, Stay Curious, Excellence, Collaboration)
- Basic knowledge in computer-science is a must (things like data structure, OOP)
- Have strong passion on Machine Learning, both theory and implementation
- Have a foundation on machine learning concepts and algorithms (text-related is preferred)
- Constantly learning, exploring, and implementing new technologies
- Python for data scientist
- SQL vs NoSQL Database (i.e. MySQL, MongoDB)
- Familiar with Git
Preferred qualifications
- Able to scrape website for gathering training data
- Knowledge on several data-science-related tools
Suitable for students from:
- Computer Science
- Computer Science - Mathematics
- Computer Science - Statistics
Extra suitable for stream:
- Artificial Intelligence, Natural Language Processing, Computer Vision