Turning Data into Actionable Insights
Python | R | SQL | Git | AWS | Hadoop | PyTorch | LSTM | SNN | Tableau
I’m Anastasiya — a data scientist and Python developer with a strong foundation in business and digital marketing, I now focus on delivering AI-powered solutions that drive measurable outcomes through data analysis, automation, and customer insight. My experience spans behavioral analytics, campaign optimization, and reporting—translating complex data into actionable strategies that align with business goals and user engagement.
This portfolio highlights hands-on projects in predictive modeling, Hadoop MapReduce, neuromorphic computing (SNNs using PyTorch & Norse), and time-series forecasting with LSTM.
Built a machine learning model to classify text as human-written or AI-generated using a custom dataset I created and published on Kaggle (2,500 human-written vs 2,500 AI texts).
Tools: TF-IDF, Joblib, Logistic Regression, SVC, Naive Bayes, Random Forest
View on GitHub | View on KaggleSimulated portfolio rebalancing using historical stock price data for a $5 million equity fund. Evaluated daily vs. periodic rebalancing performance.
Tools: R, Portfolio Theory, Time Series
View on GitHubDeployed Hadoop on AWS EC2 and executed a MapReduce job to analyze 1M+ ratings from MovieLens dataset using custom Java classes.
Tools: Java, AWS EC2, Hadoop, MapReduce
View on GitHubLed implementation of SNNs using PyTorch and Norse, benchmarking performance against traditional ANNs using SHD/N-MNIST datasets.
Tools: Python, Norse, Deep Learning, SNN
View on GitHubForecasted COVID-19 regional case trends using LSTM deep learning models to detect surges and support health planning.
Tools: Python, Pandas, LSTM, ML
View on GitHubBuilt a regression model using R to predict housing prices based on historical housing data. Focused on preprocessing, model tuning, and evaluation using performance metrics like RMSE and R².
Tools: R, caret, ggplot2, dplyr
View on GitHubRebuilt and evaluated GRU and LSTM models for predicting solar wind CME events using time series data (12–60 hr windows). Assessed model accuracy using confusion matrices and ROC curves.
Tools: Python, TensorFlow, Keras, GRU, LSTM, Time Series, ROC-AUC, Confusion Matrix
View on GitHubWith a foundation in business, logistics, healthcare, IT, and marketing, I approach data science through a deeply interdisciplinary lens. My work blends real-world industry insight with technical skills in machine learning and data analysis to transform complex problems into clear, actionable strategies.
I'm currently working at NJIT as an IT Technician, where I support technology operations and streamline workflows while pursuing my MS in Data Science. This role allows me to stay hands-on with campus systems and user experience, adding practical IT infrastructure knowledge to my data-focused training.
I'm passionate about using data to drive ethical AI, biotech innovation, and operational efficiency. Whether it’s streamlining processes or uncovering key insights, I aim to create smart, impactful solutions that empower both businesses and communities.
📜 Graduate Certificate in Big Data Essentials – NJIT, May 2025
Completed 4-course graduate-level program focused on large-scale data processing, distributed computing, and real-world applications using Hadoop, MapReduce, and Apache Spark.
View Certificate (PDF) |
Verify via Parchment
If you'd like to connect or collaborate, feel free to reach out:
AnastasiyaKotelnikova21@gmail.com