About me

Passionate about crafting elegant solutions that make a real impact on end users. With a knack for turning complex problems into elegant, sustainable solutions through data science, I transform raw data into strategic, actionable insights.

Beyond the data realm, my entrepreneurial spirit drives me to envision a future where daily tasks are redefined, enabling people to focus on what truly matters. My ultimate goal is to integrate AI, Cognitive Science, and Robotics to create transformative solutions.

I am committed to harnessing data and technology for an innovative and meaningful tomorrow.

What i'm doing

Professionally

  • anomaly icon

    Anomaly Detection

    Uncovering hidden patterns and ensuring operational reliability


    Specialized in the domain of Fraud Detection, applied various models such as GBT, XGBoost, TabNet, DevNet and ConNet to identify abusive data points. In addition to models, built multiple rule-based pipelines to catch fixed fraud patterns which were identified through EDA.

    Also, worked on group frauds identification by leveraging graph theory. Modelled the entities in a graph, connected them with various KPIs and applied both community detection algorithms and models such as Louvain Algorithm, Leiden Algorithm, PSO optimization, Graph Convolution Network and Deep Modularity Network.

  • customer icon

    Propensity Modelling

    Predicting customer behavior and driving strategic decisions


    Dived deep into predicting the value of a new user on a platform, experimented with various custom-built neural network architectures such as MLP, SIMO & MIMO with modified loss functions. Also, applied various tree-based models built in tensorflow by leveraging TF-DF module.

    Have built both classification models and regression models in which risk score and life time value are predicted respectively.

  • location icon

    Location Intelligence

    Transforming geospatial data into actionable insights


    Leveraged the GPS pings data to generate suggestions & rectify the incorrectly logged locations in the system. Designed an automated pipeline to regularly look for incorrect locations, generate suggestions and rectify. Explored multiple methods to generate suggestions such as Geometric Median, Clustering, Outlier Filtering, etc.

    Also, worked on the Facility Location Identification and Demand Estimation where in post predicting the demand heatmap in a city, determine the optimal locations for the facility to cater the maximum demand and ensure profitability. Applied various constraints and optimizations from Linear Programming using the tools like Gurobi & Pulp.

  • content icon

    Generative AI

    Revolutionizing content creation and transforming ideas into reality


    Worked on an image inpainting pipeline which beautifies the dish images using stable diffusion and also adds text & gradient overlays. Utilized the ControlNet and CLIP Segmentation models from diffusers and transformers packages for inpainting.

    Contributed to a in-house Text2SQL pipeline which is a multi-stage RAG pipeline, building context at each stage as its get fed more metadata and narrows down to the specific columns and tables required to generate the query.

  • opex icon

    Operational Excellence

    Driving optimizations and efficiency through data-driven strategies


    Implemented various optimizations in terms of storage and compute capacity, saving thousands of dollars per month in tech cost. Understood the in-depth workings of Redis, DynamoDB, Kafka and Yack to identify the optimization opportunities.

    Moreover, have designed & implemented various automations to reduce the human dependency and enable self-serve capability to the pipelines. Leveraged the APIs of Databricks, GSuite, GCP, AWS and Slack.

  • analysis icon

    Data Analysis

    Unlocking the potential of data with precise and insightful analysis


    Have performed varied analysis ranging from Exploratory Data Analysis (EDA) to Total Addressable Market (TAM) Sizing. This helps us guage the insights & actions from the data and its impact.

    Given the experience in Anomaly Detection, Anecdotal Analysis and Root Cause Analysis are also part of my prominent skills. This helps us in explaining why a model or rule behaved like that in a particular case and also, reach an unbiased conclusion of what should have actually happened.

Personally

  • design icon

    SaaS Applications

    Building innovative and robust SaaS platforms


    I keep looking for the pain points in day-to-day life which can be easily solved with a tech platform and try my best to make it a reality. Few of the platforms I have built in past are

    Bolt - AI infused swing trading platform which takes charge of the entire trading process, from strategically selecting stocks to optimal investment allocation and timely executions on the exchange.

    inDex - an open data community platform where researchers, organizations, contributors, consumers, etc can come together and help each other in piling up non existing datasets for crafting new revolutionary products.

    PixelRides - a cab booking platform with bargaining capability enabled for customers & drivers to reach a satisfactory price before booking the ride.

  • design icon

    Trading & Investment

    Crafting efficient strategies for dynamic market conditions


    Always on hunt to identify and design an even-more accurate strategy which can shortlist potential stocks and provide precise entry & exit points.

    Play around with a heirarichal structure of strategies from a longer time frame to the shortest time frame, like a waterfall model which can filter for only the strong choices at the end.

  • design icon

    Reading

    Constantly expanding my knowledge from articles, blogs, and books


    I like to keep myself updated with the latest things happening around me, I try to get my quick bites from articles and blogs. And when I want to learn something new or bigger, I take up books or courses.

  • design icon

    Retro Gaming & Shows

    Exploring the charm and nostalgia to spark creativity and relaxation


    Tekken-3, Mario, Sonic, Contra and Adventure Island are some of my favourite retro games.

Testimonials

  • Pradeep Janardhanan

    Pradeep Janardhanan CEO @Vsualthree60

    Masihullah is reliable, dedicated and eternally upbeat. Masihullah multitasks effectively and is able to handle a high-volume workload. He consistently met and surpassed all our expectations on delivering gold standard technology solutions. Masihullah's has a team player mind-set, enthusiastic embrace of change, ability to work with minimal supervision and unwavering commitment to exceed expectations.


    Organized and diligent, Masihullah quickly learned technology systems and software that were unfamiliar to him when he first started with Vsualthree60. Masihullah is a hardworking, top-performing professional and has my highest recommendation.

  • Meghana Negi

    Meghana Negi Senior Manager @Swiggy

    Lorem, ipsum dolor sit amet consectetur adipisicing elit. Fugiat, ipsum alias earum laborum veritatis maxime eveniet delectus perferendis hic deserunt modi, vitae quas doloribus. Dicta voluptatibus et voluptate eos minima.

  • Akash Deep

    Akash Deep MLE @Nykaa

    Lorem ipsum dolor sit amet, consectetur adipisicing elit. Fugiat voluptatem neque perferendis dignissimos, pariatur vitae ullam illo, tenetur rerum officia dolorum sequi itaque aut at veniam ipsam dolorem. Vero, quae.

Worked at

Resume

Education

  1. Indian Institute of Information Technology Sri City

    2018 — 2022

    BTech with Honors in Computer Science and Engineering
    CGPA - 9.41

    Honors Research (2020-2022) Title : Exploring Collaborative Strategies for PTZ Cameras Network

    • Created real-world traffic simulations using Agent-based Modelling
    • Studied various collaborative strategies to build intelligent vehicle tracking system
    • Publication : Masihullah S, and Subu K (2022, June). "A Decentralized Collaborative Strategy for PTZ Camera Network Tracking System using Graph Learning" Proceedings of the 2022 5th International Conference on Mathematics and Statistics, Paris, France.

Experience [WIP]

  1. Data Scientist II, Swiggy

    Nov 2023 — Present
    • Took full ownership of Trust & Safety and Supply Vendor Charters. In TnS, manages various fraud detection and propensity models. In Supply, manages NLP models for menu validation checks, item description generator, item category recommendations.
    • Facility Location & Demand Estimation Modeling
    • Picked up Gen AI use cases of menu image beautification and text2sql
  2. Data Scientist I, Swiggy

    Jun 2022 — Oct 2023
    • Menu Items Validations NLP Models
    • Location Intelligence Models to generate suggestions
    • New User Propensity Model
    • Refunds Fraud Detection Models
    • Publication : Piyush N, Rutvik V, Masihullah S, Meghana N, & Jose M (2024, January). "Utilizing DevNet with Variational Loss for Fraud Detection in Hyperlocal Food Delivery." In Proceedings of the 7th Joint International Conference on Data Science & Management of Data, Bangalore, India.
  3. Data Science Intern, Swiggy

    Jun 2021 — Jan 2022
    • Worked on Fraudulent Graph Community Detection and Graph Convolution Networks.
    • Built an end-to-end graph-based fraud detection framework covering various fraud types using community detection. This is a real-time non-GNN framework that identities groups of fraudsters.
    • Our framework outperformed existing SOTA approaches in fraud detection, both in terms of performance and computation time.
    • Publication : Masihullah S, Meghana N, Jose M, & Jairaj S (2022, August). "Identifying Fraud Rings Using Domain Aware Weighted Community Detection." In International Cross-Domain Conference for Machine Learning and Knowledge Extraction, Vienna, Austria.
  4. Research Intern, IBM

    Jan 2020 — Jun 2020
    • Responsibilities to do research and compete with the current state-of-the-art model in road segmentation and pothole detection. Both qualitatively and quantitatively evaluated.
    • Developed a novel end-to-end neural network architecture with inspiration from DeepLab V3 annd applied few-shot learning.
    • Publication : Masihullah S, Ritu G, Prerana M, Anupama R (2021, January), "Attention Based Coupled Framework for Road and Pothole Segmentation", in International Conference on Pattern Recognition (ICPR2020), Milan, Italy.
  5. Machine Learning Intern, Vsualthree60

    Jul 2019 — Jan 2021
    • Designed an efficient AI-driven crowd surveillance model to recognize a person's nationality, age, gender and emotion. Created a database to store the facial features to recognize the person in future footages.
    • Built an OCR based Arabic license plate reader and object detection for traffic monitoring.
    • AI-based documents information extractor for automated screening and extraction of data.
    • Business-oriented AI chatbot using NLP providing company's information and appointments.
    • Virtual clothing overlay with automated body measurements using distributed computing.

Skills

  • Languages - Python, SQL, Bash, JavaScript, HTML, CSS, Java, C, Dart, Julia
  • DS Tools - Tensorflow, Keras, Pytorch, OpenCV, Sklearn, Numpy, Pandas, PySpark
  • Ops Tools - Databricks, Snowflake, Kafka, AWS, GCP, Git
  • Others - Django, Flask, Flutter, Robot Operating System

Portfolio

Publications