Text Classification 5: Fine Tuning BERT With HuggingFace
Data: arxiv.org
Techniques: Fine Tuning, BERT, Hugging Face
Date: August, 2025
Retrievial Augmented Generation On JFK Speeches: Part 2
Data: JFK Library Website
Techniques: Retrievial Augmented Generation (RAG), LangChain, Pinecone
Date: April, 2025
Retrievial Augmented Generation On JFK Speeches: Part 1
Data: JFK Library Website
Techniques: Aysncio, LangChain, Embeddings, Pinecone, Vector Databases
Date: March, 2025
Building & Deploying A Serverless Multimodal ChatBot: Part 2
Data: N/A
Techniques: LLMs, Docker, DockerHub, GitHub Actions, Google Cloud Run
Date: Jan, 2025
Building & Deploying A Serverless Multimodal ChatBot: Part 1
Data: N/A
Techniques: LangChain, Llama3, Groq, Google Cloud API, Streamlit
Date: Dec, 2024
Creating An AI-Powered JFK Speech Writer: Part 2
Data: JFK Library Website
Techniques: Tensorflow, Keras, Gate Reccurent Units (GRUs), Recurrent Neural Networks
Date: April, 2023
Creating An AI-Powered JFK Speech Writer: Part 1
Data: JFK Library Website
Techniques: Web Scraping, BeautifulSoup, Google Cloud
Date: Dec, 2022
Text Classification 4: Deep Learning With Tensorflow & Optuna
Data: arxiv.org
Techniques: Deep Learning, CNN, NLP, Tensorflow, Keras, Optuna
Date: Nov, 2022
Writing A Scikit-Learn Compatible Clustering Algorithm
Data: Synthetic
Techniques: K-Means clustering, NumPy, Scikit-Learn
Date: May, 2022
Frequentist & Bayesian Statistics With Py4J & PyMC3
Data: Synthetic
Techniques: Scala, Py4J, PyMC3, Maximum Likelihood & Bayesian Methods
Date: May, 2021
Text Classification 3: A Machine Learning Powered Web App
Data:
arxiv.org
Techniques: FastAPI, Bootstrap, Docker, Google Cloud Run
Date: Oct, 2020
Text Classification 2: Natural Language Toolkit
Data:
arxiv.org
Techniques: Natural Language Processing, Hyperparameter tunning
Date: Jan, 2020
Text Classification 1: Imbalanced Data
Data:
arxiv.org
Techniques: Naive Bayes, Imbalanced-Learn, Weighted Support Vector Machines
Date: Jan, 2020
Numerical Linear Algebra In Machine Learning
Data:
NYC Mayor's Office Of Sustainability
Techniques: Regression, Alternating Least Squares, Cholesky & Singular Value Decomp.
Date: Dec, 2019
Sentiment Analysis 2: Machine Learning with Spark
Data:
Twitter
Techniques: Sentiment Analysis, Apache Spark, ML Pipelines, Google Cloud
Date: May, 2019
Sentiment Analysis 1: ETL With Spark and MongoDB
Data:
Twitter
Techniques: Extract-Transform-Load, Apache Spark, MongoDB, PyMongo
Date: April, 2019
GreenBuildings1: Exploratory Analysis & Outlier Removal
Data:
NYC Mayor's Office Of Sustainability
Techniques: Exploratory data analysis, outlier removal, isolation forests
Date: May, 2018
Unix Tools For Data Science
Data: N/A
Techniques: Unix commands, vi, sftp, screen, grep, git
Date: Oct, 2017
ETL Pipelines With Apache Airflow
Data:
OpenWeatherMap API
Techniques: ETL, Apache Airflow, PostgreSQL
Date: Aug, 2017
Recommender Systems
Data:
Amazon Product Data
Techniques: Collaborative fitlering
Date: April, 2017
Intro To Relational Databases
Data: Created in post
Techniques: SQLite, PostgreSQL, SQLAlchemy
Date: April, 2017
Forecasting Crime Rates In New York City
Data:
NYC Open Data
Techniques: Time series, ARIMA, StatsModels, exploratory analysis
Date: Mar, 2017
Interactive Visualizations With Bokeh
Data:
Chicago Data Portal
Techniques: Bokeh, data visualization, GeoPandas
Date: Mar, 2017
Analyzing Wikimedia's Click-Through Rates
Data:
Wikimedia
Techniques: Pandas, analytics, click-through rate analysis
Date: Feb, 2017