Mohammad Sayem Chowdhury
Data Scientist & Machine Learning Engineer
About Me
Passionate data scientist with expertise in machine learning, natural language processing, and advanced analytics. I specialize in transforming complex data into actionable insights through innovative analytical approaches and cutting-edge technology solutions.
My work spans across various domains including predictive modeling, NLP applications, interactive data visualization, and end-to-end machine learning pipeline development. I'm particularly interested in solving real-world problems through data-driven decision making and advanced statistical modeling.
"Data is the new oil, but analytics is the refinery." - This philosophy drives my approach to extracting meaningful value from data and creating solutions that make a real impact.
🎓 Expertise
📊 Portfolio Stats
Portfolio Overview
Comprehensive collection of data science projects demonstrating expertise across multiple domains
📈 Portfolio Overview
Explore a comprehensive collection of data science projects demonstrating proficiency in machine learning algorithms, statistical analysis, data visualization, and real-world problem solving. Each project includes detailed analysis, code implementation, and actionable insights.
🌟 Featured Projects
Handpicked highlights showcasing advanced data science techniques and real-world applications
SpaceX Launch Prediction
End-to-end machine learning pipeline predicting Falcon 9 first stage landing success with 85%+ accuracy using advanced feature engineering and ensemble methods
Car Price Prediction Model
Comprehensive automobile price prediction using regression analysis, feature engineering, and model evaluation with statistical significance testing
Sentiment Analysis NLP
Advanced sentiment classification using transformer models, achieving 92% accuracy with custom preprocessing and deep learning architectures
Interactive Data Visualization
Dynamic interactive dashboards with real-time data visualization, custom plot types, and responsive design using Plotly and Dash frameworks
📁 Complete Project Collection
Comprehensive portfolio organized by domain expertise and technical focus areas
SpaceX Capstone
End-to-end data science project
This notebook is part of my personal data science project. All content and analysis are original and tailored for my own exploration of SpaceX launch ...
Welcome to my personal data science project! This notebook is the first step in a progressive, end-to-end workflow where I explore, collect, clean, an...
This notebook is a key chapter in my end-to-end SpaceX Falcon 9 data science project. After collecting and cleaning launch data, I now dive into SQL-b...
This notebook is a visual chapter in my end-to-end SpaceX Falcon 9 data science project. After collecting, cleaning, and analyzing launch data, I now ...
Machine learning analysis and model development
Capstone_StackOverflow_Survey\data_analysis
Project collection
This notebook is part of my personal data analytics project. Here, I perform exploratory data analysis on survey and job data for my own learning and ...
Capstone_StackOverflow_Survey\data_collection
Project collection
This notebook is part of my personal data analytics project. Here, I collect and analyze job data using APIs for my own learning and portfolio.
In this notebook, I explore and analyze the Stack Overflow survey dataset as part of my data analysis project.
This notebook demonstrates how to use HTTP and the Python Requests library to access and interact with web APIs and resources as part of my data analy...
This notebook implements a simple jobs API using Flask and Python for data access and demonstration purposes.
This notebook demonstrates web scraping and data extraction using Python.
This notebook reviews and practices web scraping techniques using Python.
Capstone_StackOverflow_Survey\data_visualization
Project collection
This notebook focuses on visualizing survey and job data using Python data visualization libraries.
Capstone_StackOverflow_Survey\data_wrangling
Project collection
This notebook covers data wrangling tasks on survey and job data, including cleaning, deduplication, and handling missing values.
Data Analysis
Core data analysis projects
*By Mohammad Sayem Chowdhury*
*By Mohammad Sayem Chowdhury*
*By Mohammad Sayem Chowdhury*
*By Mohammad Sayem Chowdhury*
*By Mohammad Sayem Chowdhury*
*By Mohammad Sayem Chowdhury*
*By Mohammad Sayem Chowdhury*
*By Mohammad Sayem Chowdhury*
ML\Classification
Project collection
This notebook is organized by key machine learning topics, progressing from basic to advanced, and is designed for hands-on experimentation and projec...
Machine learning analysis and model development
In this notebook, I explore how K-Nearest Neighbors (KNN) can be used to segment customers based on real-world data. My aim is to understand how diffe...
This notebook is organized by key machine learning topics, progressing from basic to advanced, and is designed for hands-on experimentation and projec...
ML\Clustering
Project collection
This notebook is organized by key machine learning topics, progressing from basic to advanced, and is designed for hands-on experimentation and projec...
This notebook is organized by key machine learning topics, progressing from basic to advanced, and is designed for hands-on experimentation and projec...
This notebook is organized by key machine learning topics, progressing from basic to advanced, and is designed for hands-on experimentation and projec...
ML\Recommender Systems
Project collection
*By Mohammad Sayem Chowdhury*
*By Mohammad Sayem Chowdhury*
ML\Regression
Project collection
This notebook is organized by key machine learning topics, progressing from basic to advanced, and is designed for hands-on experimentation and projec...
This notebook is organized by key machine learning topics, progressing from basic to advanced, and is designed for hands-on experimentation and projec...
This notebook is organized by key machine learning topics, progressing from basic to advanced, and is designed for hands-on experimentation and projec...
This notebook is organized by key machine learning topics, progressing from basic to advanced, and is designed for hands-on experimentation and projec...
ML\The Best Classifier
Project collection
Welcome! This notebook is organized by key machine learning topics, progressing from basic to advanced, and is designed for hands-on experimentation a...
NLP\01_Text_Preprocessing
Project collection
*By Mohammad Sayem Chowdhury*
*By Mohammad Sayem Chowdhury*
NLP\02_Basic_NLP_Concepts
Project collection
In this lab, we will focus on the `build_freqs()` helper function and visualizing a dataset fed into it. In our goal of tweet sentiment analysis, this...
In this lecture notebook you will create a vocabulary from a tagged dataset and learn how to deal with words that are not present in this vocabulary w...
In this lecture notebook you will create a matrix using some tag information and then modify it using different approaches.
NLP\03_Classification_Models
Project collection
*By Mohammad Sayem Chowdhury*
*By Mohammad Sayem Chowdhury*
**Objectives:** Visualize and interpret the logistic regression model
NLP\04_Word_Embeddings
Project collection
*By Mohammad Sayem Chowdhury*
In this series of ungraded notebooks, you'll try out all the individual techniques that you learned about in the lectures. Practicing on small example...
In previous lecture notebooks you saw all the steps needed to train the CBOW model. This notebook will walk you through how to extract the word embedd...
In this lecture notebook you will be given an introduction to the continuous bag-of-words model, its activation functions and some considerations when...
In previous lecture notebooks you saw how to prepare data before feeding it to a continuous bag-of-words model, the model itself, its architecture and...
In this ungraded notebook, you'll try out all the individual techniques that you learned about in the lecture. Practicing on small examples will prepa...
NLP\05_Language_Models
Project collection
Estimated Time: 10 minutes
Estimated Time: 20 minutes
NLP\06_Advanced_Techniques
Project collection
In this lab, we are going to practice the most important concepts related to the hash functions explained in the videos. You will be using these in th...
*By Mohammad Sayem Chowdhury*
In this lab, you will have the opportunity to practice once again with the NumPy library. This time, we will explore some advanced operations with arr...
NLP\07_Neural_Networks
Project collection
In this notebook you will learn how to create a siamese model in TensorFlow.
Welcome to the first lab in this course. Here you will see and try out some basics of TensorFlow and get familiar with some of the useful functions th...
In this notebook you'll see how to calculate the full triplet loss, step by step, including the mean negative and the closest negative. You'll also ca...
NLP\08_Sequence_Models
Project collection
In this notebook you will learn how to calculate perplexity. You will calculate it from scratch using `numpy` library. First you can import it and set...
In this notebook you'll take another look at the hidden state activation function. It can be written in two different ways.
NLP\09_Advanced_Projects
Project collection
Welcome to the first assignment of course 3. **This is a practice assignment**, which means that the grade you receive won't count towards your final ...
*By Mohammad Sayem Chowdhury*
Welcome to the second programming assignment of Course 3. In this assignment, you will learn to build more complicated models with Tensorflow. By comp...
*By Mohammad Sayem Chowdhury*
Welcome to the second assignment of Course 2 in the Natural Language Processing specialization. This assignment will develop skills in part-of-speech ...
Welcome to the third assignment of course 3. In this assignment you will explore Siamese networks applied to natural language processing. You will fur...
*By Mohammad Sayem Chowdhury*
You will now implement your first machine translation system and then you
Welcome to the fourth (and last) programming assignment of Course 2!
Natural language processing and text analysis
NLP\Personal_Experiments
Project collection
- **Experiment Name:** [Your experiment name here]
NLP\Utils_and_Resources\Assignment1
Project collection
As you've learned, attention allows a seq2seq decoder to use information from each encoder step instead of just the final encoder hidden state. In the...
In this ungraded lab, you will implement a popular metric for evaluating the quality of machine-translated text: the BLEU score proposed by Kishore Pa...
The 2017 paper [Attention Is All You Need](https://arxiv.org/abs/1706.03762) introduced the Transformer model and scaled dot-product attention, someti...
NLP\Utils_and_Resources\Assignment1\Files\tf
Project collection
Welcome to the first assignment of Course 4. Here, you will build an English-to-Portuguese neural machine translation (NMT) model using Long Short-Ter...
NLP\Utils_and_Resources\Assignment\Files\tf
Project collection
Welcome to the third assignment of course 4. In this assignment you will explore question answering. You will implement the "Text to Text Transfer fro...
NLP\Utils_and_Resources\Lab2\Files\home\jovyan\work
Project collection
In order to process text in neural network models it is first required to **encode** text as numbers with ids, since the tensor operations act on numb...
NLP\Utils_and_Resources\Lab\Files\home\jovyan\work
Project collection
In this lab, you will learn how to implement the positional encoding of words in the transformer.
In this lab, you will implement the masking, that is one of the essential building blocks of the transformer. You will see how to define the masks and...
In this notebook you'll explore the three ways of attention (encoder-decoder attention, causal attention, and bi-directional self attention) and how t...
NLP\Utils_and_Resources\Lab\Files\tf
Project collection
You've seen how to use BERT and other transformer models for a wide range of natural language tasks, including machine translation, summarization, and...
In the previous Hugging Face ungraded lab, you saw how to use the pipeline objects to use transformer models for NLP tasks. In that lab, the model did...
Python Fundamentals
Core Python programming concepts
*Author: Mohammad Sayem Chowdhury*
*Author: Mohammad Sayem Chowdhury*
*Author: Mohammad Sayem Chowdhury*
*Author: Mohammad Sayem Chowdhury*
*Author: Mohammad Sayem Chowdhury*
**Author:** Mohammad Sayem Chowdhury
**Author:** Mohammad Sayem Chowdhury
**Author:** Mohammad Sayem Chowdhury
*Author: Mohammad Sayem Chowdhury*
**Author:** Mohammad Sayem Chowdhury
*Author: Mohammad Sayem Chowdhury*
*Author: Mohammad Sayem Chowdhury*
*Author: Mohammad Sayem Chowdhury*
*Author: Mohammad Sayem Chowdhury*
Python\Web Scrapping
Project collection
**Author: Mohammad Sayem Chowdhury**
**Author: Mohammad Sayem Chowdhury**
**Author: Mohammad Sayem Chowdhury**
**Author: Mohammad Sayem Chowdhury**
General Projects
Miscellaneous notebooks
SQL & Databases
Database analysis and queries
*Author: Mohammad Sayem Chowdhury*
*Author: Mohammad Sayem Chowdhury*
*By Mohammad Sayem Chowdhury*
*Author: Mohammad Sayem Chowdhury*
*Author: Mohammad Sayem Chowdhury*
*By Mohammad Sayem Chowdhury*
*Author: Mohammad Sayem Chowdhury*
*Author: Mohammad Sayem Chowdhury*
Data Visualization
Charts, dashboards, and interactive plots
1. Getting to Know the Data
In this notebook, I document my first steps with Dash for building interactive web applications in Python. My goal is to understand the basics and see...
In this notebook, I experiment with building interactive dashboards using Dash. My goal is to learn how to make data visualizations more engaging and ...
In this notebook, I build and analyze a dashboard for flight delay statistics. My goal is to understand the patterns behind flight delays and create v...
Mapping data is one of my favorite ways to make information come alive. Here, I step away from Matplotlib and dive into Folium—a library built for int...
My goal is to become well-rounded with Python visualization libraries and concepts, so I can choose the best technique and tool for any data problem o...
1. Exploring Data with pandas
