ksirohi-blog
ksirohi-blog
Kshitiz Sirohi
5 posts
Passionate and Driven by Technology LinkedIn
Don't wanna be here? Send us removal request.
ksirohi-blog 6 years ago
Text
About me
Who am I? I am an Engineer with passion to work with technology that drives business values. I like working with Programming, Databases, Cloud and other IT Technologies.聽
馃懆馃徎鈥嶐煉宦燝itHub聽聽
Badge -聽AWS Certified Solution Architect - Associate (SAA-C02)
Tumblr media
Badge -聽 Google Cloud Engineer - Associate (ACE)
Tumblr media
0 notes
ksirohi-blog 6 years ago
Text
Education
Northeastern University聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 Master鈥檚 in Data Analytics (GPA 3.8)聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 August 2020 Coursework: Advance Cloud Computing, Big Data Technologies, Data Warehouse & SQL,聽Predictive Modeling聽聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽
GB Pant Engineering College聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽Bachelor鈥檚 Computer Engineering聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 聽 July 2017聽 聽Coursework: Data Structures, Database Management Systems, Object Oriented Programming, Software Engineering, Artificial Intelligence聽 聽
0 notes
ksirohi-blog 6 years ago
Photo
Tumblr media
0 notes
ksirohi-blog 6 years ago
Text
Work Experience
Truepill, Inc. (February 2022 - Currently working) DATA ENGINEER
Zartico (September 2020 - February 2022) DATA ENGINEER
Northeastern University (January 2020 - September 2020) TEACHING ASSISTANT
0 notes
ksirohi-blog 6 years ago
Text
Projects
MirrorText (Python, Docker, AWS, Bash, Linux) May (2020) Built a web application that helps the user to measure text similarity between two documents using tf-idf & cosine similarity method. Used Docker for containerization and hosted the service on AWS EC2 with Linux instance.聽Useful for finding plagiarism, or one can use its API as well.聽聽
Spark Application on Uber Rides Data聽(Spark, AWS, Bash)聽 May (2020) Developed a Spark code on AWS EC2 instance to process 14 million rows of uber rides data. Also used bash to automate data collection and to run my scripts. I learned how fast and useful spark is when we have huge amount of data.聽
PriceComm (AWS, Database, Data-Pipeline, Web-Scrapping, JSON) March (2020) Developed a comparison platform that shows real-time prices, deals and discounts on multiple products from number of e-commerece websites. It is like tripadvisor, but only for shoes and cloths.
ETL (Wearable Devices)聽(Spark, AWS, S3, Postgres, Python, OOPS) April (2020) Followed data warehousing best practice to perform ETL, and also designed a star schema. I learned how to combine cutting edge tools and techniques to implement fundamentals of data modeling.聽
Quantitative Database Analysis (SQL, MySQL, Power BI, Python) December (2019) Designed a relational database and then calculated following things - (Customer life cycle value, Cross-Segment Analysis, Sales Forecasting). Learned how to deal with聽OLTP transactional data and perform joins in it.聽
Stock Exchnage with Pymongo聽(NoSQL, Python)聽 April (2020) Collected real time stock-exchange data by making API calls using python, and then configured the MongoDB database to save results as document-store.
Fraud Detection Using Customer Transaction (Machine Leaning, Python) September (2019) Trained 3 different machine learning models that can detect fraudalent transactions on the usage of credit cards. Biggest challenge I faced was the imbalance class and parameter tuning.聽
Lead Score Prediction and Cost Benefit Analysis聽(GLM, Regression) December (2019) (Blog Post) Implemented logistic regression to perform classification and to calculate probability of possible future customer. Also, calculated cost and benefit of acquiring new customers.聽
Cancer Classification Using Gene Expressions聽(Predictive Mod., Python)
Property Assessment System聽(R-Shiny, Tableau)
Analysis of Boston Utility Consumption Patterns (Python)
Fake News Classification (R, TF-IDF, RandomForest)
Visualization (Tableau)
Fright Cost Analysis for ConMed Corporation聽
Life-Expectancy Data Analysis聽
TripAdvisor Reviews Data Analysis (R)聽聽
0 notes