Samu Syrjänen
I’m a Data Science Master’s student at the University of Helsinki. I have 1.5 years of paid work experience as a data engineer and ML engineer from University of Helsinki and Aalto University. My work experience mainly consists of data science and data engineering related tasks such as developing ML algorithms, analyzing data, and making automated pipelines for various data processing tasks. I have a Computer Science Bachelor's Degree and I have also studied some Physics (33 cr ECTS) and Mathematics (65 cr ECTS) on the side. I also know a thing or two about sailboats and Leopard 2A6 battle tanks, and how to lead their crews.
I'm looking for long-term work opportunities to gain experience and develop more specialized skills. Future career interests include working with data pipelines, analytics, cloud platforms, and machine learning to provide solutions for product development, marketing, and business intelligence problems. Besides the technical roles, I'm also able to work in business administration or as a product manager, where a more tech-heavy background might sometimes be beneficial.
- Strengths
- - Enjoy keeping all aspects of data and data pipelines clean and organized to ensure fault tolerance while minimizing human error
- - Strong interest in data engineering skills and tools
- - Experienced with coordinating tasks and requirements between international teams
- - Education and work experience revolved around data and software since 2019
- - Practical project and work experience with databases, algorithms, and data pipelines
- - Degree structure has a strong emphasis on software engineering, machine learning, mathematics, and statistics
- - Practical experience and training with leading sailboat and battle tank crews
- Weaknesses
- - Only ~1.5 year's worth of paid work experience
- - No expert-level talent in any specific niche yet
Looking for work!
Career
Research Assistant, Data Engineer
Aalto University
As part of Jaan Praks research group at Aalto University and ESA's Hera space mission, I'm responsible for creating a pipeline to clean, calibrate, and analyze hyperspectral image data, and turn it into final data products. The data is received from the ASPECT Hyperspectral Imager on the Hera/Milani space probe. The mission's objective is to gather data about the Didymos binary asteroid system, which was previously in 2022 shot with a DART spacecraft in hopes to succesfully redirect the Didymos's moon, Dimorphos, and to study it's effects. The larger aim is to research this kind of asteroid redirection as a means for planetary defence.
My work entails plenty of data analysis and cross-national coordination between teams working on this project. After the data calibration, my pipeline analyzes the hyperspectral images with a convolutional neural network, which will allow us to gain insights into the mineral composition of the target asteroids Didymos and Dimorphos binary asteroid system. The data will be further analyzed with various methods, and documented in scientific literature. The images and derived information will additionally be used to make a reconstructed 3D model containing all related information.
Related to this position, I attended a PDS4 workshop held in ESAC, Madrid, that gave a deep dive into the Planetary Data System (PDS4) archiving format. PDS4 is the latest standard used in NASA's and ESA's scientific data archives.
Aspect Hyperspectral Imager Job CertificateResearch Assistant, ML Engineer
University of Helsinki
As part of this space weathering research project, I created a Convolutional Neural Network enhanced Gaussian Process algorithm for estimating asteroid surface age. The algorithm uses asteroid hyperspectral reflectance spectra to give an age estimate. It stands out as a surprisingly flexible algorithm to predict outcomes even with a sparse training set, as in our case.
More information in the journal article.
Journal Article Job CertificateBachelor's Degree in Computer Science
University of Helsinki
Products
Skills
- Python
- Scrum
- Github
- Data Cleaning and Processing
- Databricks
- [ML] Gaussian Process
- SQL
- ML training/Hyperparameter Optimization
- [ML] Local Binary Pattern
- Agile Development Methodologies
- [ML] K-Means Clustering
- HTML & CSS
- Lakehouse, Medallion Architecture
- ETL/ELT Pipelines
- Spark
- [ML] Convolutional Neural Networks (CNN)
- PyTorch
- Databases
- Software Architectures
- Automatic Testing
- Data Encryption
- Exploratory Data Analysis (EDA)
- AWS
- [ML] Feature Selection Methods
- [ML] Linear Regression
- [ML] Decision Trees
- [ML] Random Forests
- [ML] Gradient Boosting
- OpenCV (Computer Vision)
- JavaScript/TypeScript
Languages
- English CEFR C1
- Finnish Native
- Japanese Beginner
Contact Me
samu.syrjanen@gmail.com