Hello, I'm Shreeram Venkatesh 🖐️

A Data Visionary fueled by Passion, Driven by Innovation, and Dedicated to Unraveling the Complexities of Data Science with Genuine Enthusiasm 🏆

About

With a relentless passion for data analysis and an unyielding drive for innovation, I, Shreeram Venkatesh, am committed to unraveling the complexities of data science. Armed with a Master's in Data Science from the esteemed Illinois Institute of Technology and a Bachelor's in Electronics and Communication Engineering from SRM IST, Chennai, I bring a blend of academic prowess and industry experience to the table. My journey spans from spearheading teams to achieve remarkable accuracy in machine learning systems to transforming data analytics processes, resulting in substantial revenue boosts and operational efficiencies.

As a dedicated data professional, I have mastered a diverse array of tools and technologies, including SQL, Python, AWS, Azure, Power BI, and Tableau. My portfolio showcases a rich tapestry of projects, from E-Commerce Market Analysis to Telecom Churn Rate Data Warehousing, each demonstrating my unwavering commitment to excellence and innovation. With a genuine enthusiasm for uncovering insights and a penchant for pushing the boundaries of what's possible, I am poised to make a meaningful impact in the dynamic field of data science. Let's embark on a journey of discovery together, where the possibilities are as limitless as the data itself.

  • Programming Language-SQL, Python, R, C++/C , Embedded C, DAX and Java
  • Data Visualization-Power Bi, Tableau
  • Machine Learning, Deep Learning
  • Cloud Computing-Microsoft Azure, Data Engineering, Data Lake, Databricks,AWS Sagemaker, AWS Lambda,AWS S3, ETL, Hadoop, AWS, Big Data
  • Microsoft Excel, MS Word, Data Wrangling, FileMaker, Google Classroom
  • Software Platforms-MySQL, PostgreSQL, SAP, SaaS, MATLAB, Jupyter Notebook, Eclipse, GitHub, VS Code, Streamlit

Seeking a dynamic role that merges my expertise in Data Analysis, Data Science, and Data Engineering, where I can contribute to challenging projects while fostering professional development and personal growth. Eager to leverage my skills in a stimulating environment that offers diverse experiences and opportunities for advancement.

Experience

Data Analyst Intern
  • Spearheaded the development of an innovative real-time Machine Learning price guidance system at Labelmaster, leveraging over 10 million client records to drive exceptional revenue optimization.
  • Employed advanced Machine Learning models and statistical hypothesis techniques to craft personalized price guidance strategies, resulting in a remarkable 95% accuracy rate and a substantial revenue increase.
  • Orchestrated seamless cross-functional collaboration, validating models and integrating the system with precision. This meticulous approach led to a X% surge in revenue.
  • Communicated actionable insights effectively through a Power BI dashboard, enhancing operational efficiency by 25%.
  • Engineered a user-friendly application with Streamlit, enabling stakeholders to make informed decisions swiftly. These initiatives collectively propelled organizational growth and success.
  • Tools: SQL, Microsoft Excel, Power BI, Python, Machine Learning, Web-Developement, Microsoft Word, Data Cleaning, Unsupervised Learning, Statistical Hypothesis Testing, Data Analysis, Project Management, Business developement
Jan 2024 - May 2024 | Chicago, IL, U.S.A
Graduate Teaching Assistant
  • As a Graduate Teaching Assistant for the CS-425 Database Organization course at Illinois Institute of Technology, I collaborated closely with professors to enhance classroom experiences and led engaging discussion sections, deepening the understanding of over 140 students in complex database concepts and technologies such as SQL, MySQL, and PostgreSQL.
  • Provided personalized support during office hours, meticulously evaluated and graded homework and projects, and offered constructive feedback to promote academic growth.
  • Additionally, I guided over 25+ teams in building and optimizing database systems, resulting in significantly improved data management practices.
  • Actively contributed to curriculum development by incorporating insights from student interactions, mentored students on crafting efficient, scalable queries, and implemented best practices for database modeling and performance tuning, all while ensuring meticulous attention to detail during exam proctoring.
  • Tools: SQL,MySQL,Database Management System (DBMS),Relational Databases, PostgreSQL, Microsoft Word, Microsoft Excel, OLAP, Data Analysis, Project Management
August 2023 - May 2024 | Chicago, IL, U.S.A
Graduate Student Assistant
  • Administered standardized assessments to determine admissions into CPS selective enrollment elementary program and perform data synchronization, data entry, data validation using Microsoft Excel to ensure data accuracy and completeness.
  • Skills Used: CRM, FileMaker, Data Entry, Microsoft Excel
October 2023 - May 2024 | Chicago,IL, U.S.A
Data Analyst
  • Spearheaded data transformation initiatives at KPIT Technology Limited, optimizing ETL processes in AUTOSAR BSW and RTE layer using cloud-based solutions, GIT pipelines and SQL. This resulted in an 80% enhancement in data synchronization and system performance.
  • Utilized advanced ML models to process AUTOSAR diagnostic data, achieving a remarkable X% boost in diagnostic accuracy.
  • Crafted intricate Python scripts to transform AUTOSAR Ethernet Time Synchronization, leading to a 60% increase in project automation and orderly delivery of deliverables.
  • Consolidated data from various automotive teams (CAN, LIN, and Ethernet teams), ensuring quality and consistency, significantly reducing integration time.
  • Leveraged data visualization tools to create actionable insights, for creating dashboards, empowering stakeholders and accelerating project delivery.
  • Engaged in Agile and Scrum methodologies, optimizing project management efficiency through adept utilization of project management tool
  • Skills Used: R, Microsoft Excel, Tableau, Python, Data Visualization, AWS (S3, EMR, Glue, RedShift), SQL, Apache Spark, Teradata, Adobe Analytics
Febraury 2020 - May 2022 | Bangalore, Karnataka, India
Data Analyst Intern
  • Developed an end-to-end pipeline for ADAS insights at KPIT using Azure Data Lake, SQL, Python, Databricks, and Tableau, increasing customer satisfaction by 90%.
  • Coordinated telemetry data projects with XG Boost and clustering, boosting predictive accuracy by 60%.
  • Defined KPIs and created interactive Tableau dashboards, improving data visualization effectiveness and decision-making by 25%.
  • Skills Used: Python, Scikit-learn, Azure, Azure Data lake, SQL, Databricks, Tableau, Teradata, Machine Learning
March 2019 - January 2020 | Bangalore, Karnataka, India

Projects

music streaming app
E-Commerce-Market Analysis and Insights

Led E-commerce analysis in R, employing EDA, Random Forest, PCA, and hypothesis testing to unveil user behaviour, identify popular products, and predict trends. Developed predictive models for actionable insights.

Accomplishments
    Skills Used:
  • R
  • EDA
  • PCA
  • Random Forest
  • Feature Engineering
  • Feature Selection
  • RFE
  • Cross-validation
  • Hyperparameter tuning
music streaming app
Cloud-Powered Mobile Price Prediction: Using AWS SageMaker and Lambda for Seamless ML Pipeline

This project involves building a sophisticated machine learning pipeline with Amazon SageMaker and AWS services. Starting with infrastructure setup, the process initializes the Boto3 SDK to create an S3 bucket and upload data to Sagemaker Local Storage. Data exploration follows, leading to partitioning into train/test CSV files and re-uploading to the S3 bucket. A training script executes within a SageMaker container, producing model artifacts stored in the S3 bucket. Deployment focuses on operationalizing the trained model via a SageMaker Endpoint. This entails developing a Lambda function, establishing a REST API in API Gateway, deploying the API, and comprehensive testing using tools like Postman and Python scripts. This project showcases proficiency in AWS services, machine learning pipeline development, and deploying ML models as RESTful APIs for practical applications.

Accomplishments
    Skills Used:
  • Python
  • AWS
  • S3
  • AWS EC2
  • AWS Sagemaker
  • Random Forest
  • AWS Lambda
  • Rest-API
  • Machine Learning
  • Data preprocessing
  • Hyperparameter tuning
music streaming app
Movie Recommendation with Content Based, Collaborative Filtering, and Matrix Factorization Methods

Developed an advanced movie recommendation system employing content-based and collaborative filtering, along with matrix factorization. Analysed movie features and user interactions to offer personalized suggestions, boosting user engagement. Skills: Machine Learning, Python, Recommendation Systems

Accomplishments
    Skills Used:
  • Python
  • EDA
  • PCA
  • Random Forest
  • Feature Engineering
  • Feature Selection
  • T-SNE
  • Machine Learning
  • Data preprocessing
  • Web-developement
  • Unsupervised Learning
  • Streamlit
  • Hyperparameter tuning
quiz app
Data Warehousing on Telecom Churn Rate Data Using Aws Redshift

Utilized an open-source data for telecom customer churn rate from Kaggle, implemented AWS redshift on it and used Machine learning Algorithms to deliver insights and visualized the data to predict telecom churn rate with an accuracy score of 85%

Accomplishments
    Skills Used:
  • AWS Redshift
  • Machine Learning
  • SQL
  • EDA
  • XGBoost
  • S3
  • EMR
  • Data Visualization
Screenshot of ATLIQ
Power BI Dashboard for ATLIQ Software Company Ltd for business insights

Deployed a database given in a course offered by Codebasics, and designed a Power Bi dashboard to get some Business Insights of Atliq Technologies and Improved the sales and performance of the Company by 60% Efficiency.

Accomplishments
  • Proficient in data modeling, KPI indicators, date table creation (using M language), and data validation techniques for robust data representation in PowerBI.
  • Competent in measure creation (using DAX language), calculated columns, page navigation (with buttons and bookmarks), and preventing errors with functions like "divide." Experienced in publishing reports, creating PowerBI apps, and managing collaboration and access permissions in PowerBI services.
  • Skills: Power Bi, SQL, Data Cleaning, ETL, Excel, Data Modelling, Data Optimization, Python and DAX Query

Skills

Primary Skills

  • Programming Language-SQL, Python, R, C++/C , HTML, Embedded C and Java
  • Data Visualization -Power Bi, Tableau, Dax Query
  • Machine Learning (Supervised and Unsupervised Learning)-Feature Engineering, Feature Selection, Model Validation, Model Development, Model Evaluation
  • Cloud Computing-Microsoft Azure, AWS, S3, EMR, Redshift, Sagemaker, PySpark, Spark SQL, Big Data, Data Engineering, Data Lake, Databricks, Azure Data Factory
  • Microsoft Excel, Microsoft Word, Microsoft PowerPoint presentation

Libraries

NumPy
Pandas
OpenCV
scikit-learn
matplotlib
seaborn
Apache Spark

Secondary Skills

  • Operating System- Windows, Linux, Ubuntu
  • FileMaker, Google Classroom
  • Software Platforms-MySQL, PostgreSQL, MATLAB, Jupyter Notebook, Eclipse, GitHub, VS Code

Other

AWS
Microsoft Azure
Git

Education

Illinois Institute of Technology

Chicago,IL USA

Degree: Master of Science (M.A.S) in Data Science
Fall 2022 - Spring 2024

    Relevant Courseworks:

    • Database Organization
    • Project Management
    • Statistical Learning
    • Introduction to Algorithms
    • Big Data Technologies
    • Data Preparation and Analysis
    • Monte Carlo Methods in Finance
    • Applied Statistics
    • Data Science Practicum
    • Machine Learning

SRM Institute of Science and Technology, Chennai

Tamil Nadu, India

Degree: Bachelor of Technology in Electronics and Communication Engineering
GPA: 3.49/4
June 2015 - May 2019

Contact