Well-versed in digging through data to find key insights and curating a compelling story from complex analyses, passionate about delving into data from different systems, at different timescales, and in complex formats to uncover hidden relationships.
Machine Learning knowledge acquired from personal experimentation with Spark: Linear / Logistic Regression, Decision Trees, NaiveBayes, Alternating Least Squares (Recommender Systems), TF-IDF
Professional Background (formerly): ETL Developer / Traditional DWHs / Kimball's Methodology
Experienced Data Scientist.
Keywords: Apache Spark, scaling algorithms.
Well-versed in digging through data to find key insights and curating a compelling story from complex analyses, passionate about delving into data from different systems, at different timescales, and in complex formats to uncover hidden relationships.
Machine Learning knowledge acquired from personal experimentation with Spark: Linear / Logistic Regression, Decision Trees, NaiveBayes, Alternating Least Squares (Recommender Systems), TF-IDF
Professional Background (formerly): ETL Developer / Traditional DWHs / Kimball's Methodology
Computer Science Skills / Core: Data Structures, Algorithms, Functional Programming Paradigm, Relational Databases
Big Data Framework / Core: Spark
Big Data / Other: Apache Kafka => Spark Streaming from Kafka topics
Source Control: GitHub
Source Control / Other: BitBucket
DevOps / Other: Docker / DockerHub
Programming Languages / Core: Python, Scala
Programming Language / Other: Haskell
Keen interest in experimenting with open-source Big Data technologies.
E-mail address in the profile.