Manabendra Rout

I'm a

About

Hello! I’m Manabendra Rout, a seasoned Machine Learning Research Engineer with a passion for pushing the boundaries of AI technology. With over 8 years of experience in data science and machine learning, I specialize in developing cutting-edge solutions in Generative AI, natural language processing (NLP), and large language model (LLM) fine-tuning. My expertise lies in building scalable AI systems, fine-tuning models for domain-specific applications, and optimizing inference for real-world use cases.

Beyond technical implementation, I take pride in my role as a mentor and collaborator, guiding junior team members, leading design discussions, and ensuring high-quality deliverables through thorough code reviews. I thrive at the intersection of research and practical application, transforming complex ideas into impactful solutions.

Feel free to explore my portfolio to learn more about my projects and contributions to the field of AI.

Machine Learning Researcher

I am a self-taught programmer and Senior Applied AI Engineer currently working with sirion.ai on a Contract Lifecycle Management (CLM) product. I have worked on multiple Generative AI, NLP and Computer Vision microservices for incorporating various features to the product.

  • Birthday: 1 August 1994
  • Website: mrout94.github.io
  • Phone: +91-7757088502
  • Email: manabendrarout@gmail.com
  • City: Bengaluru, India
  • Age: 30
  • Degree: Bachelors
  • Alma Mater: NIT Rourkela
  • Open to Roles: Applied AI Engineer, Data Scientist

I specialize in developing Generative AI applications, Natural Language Processing (NLP), and fine-tuning Large Language Models (LLMs), with expertise in Python and frameworks like PyTorch. My work spans scalable model deployment, optimization, and monitoring to ensure robust real-world applications. I believe a strong foundation in mathematics, programming, and problem-solving is essential for success in the rapidly evolving AI landscape.

Resume

A methodical and experienced AI Engineer with expertise in Deep Learning, Generative AI, and NLP. Passionate about creating innovative solutions and advancing the field of artificial intelligence.

Summary

Manabendra Rout

Innovative and detail-oriented AI Engineer with 8+ years of experience in developing, fine-tuning, and deploying Generative AI and NLP solutions. Proficient in managing the complete lifecycle of AI projects, including data generation, model training, optimization, inferencing, and scalable deployment for real-world applications.

  • Bengaluru, Karnataka, India
  • (+91) 7757088502
  • manabendrarout@gmail.com

Education

B.Tech in Mechanical Engineering

2012 - 2016

National Institute of Technology (NIT), Rourkela, India

Internship

Academic Intern

May 2015 - July 2015

Tata Motors, Jamshedpur, India

Achieved a significant reduction in torque variation of truck engines by developing an anomaly detection algorithm to preemptively identify defective engines and trace problematic parameters to their sources. Utilized statistical techniques and Isolation Forest.

Professional Experience

Applied AI Engineer

Feb, 2022 - Present

Sirion, Bengaluru, India

  • Created a RAG-based chatbot utilizing open-source LLMs to support advanced contract Q&A.
  • Fine-tuned multiple open-source LLMs (e.g., Llama3, Llama2, Qwen, MPT) for legal domain tasks such as instruction-following and conversational Q&A using distributed training methods (FSDP, DDP).
  • Fine-tuned multiple text embedding models using contrastive learning techniques and designed novel data generation approaches for domain-specific embeddings, improving downstream task performance.
  • Contributed to development of a Auto Redlining Application using GPT-4o for automating contract modification/negotiations in alignment with organizational guidelines.

Data Scientist

Sep, 2020 - Feb, 2022

United Health Group, Noida, India

  • Applied NLP techniques for topic modeling of customer feedback, enabling actionable insights to reduce cancellations by 17%.
  • Delivered a predictive modeling system achieving an F1 score of 0.82 for forecasting order outcomes.

Assistant Manager

Jul 2016 - Sep 2020

Bajaj Auto Limited, Pune, India

  • Developed a GBM-based model to predict bearing friction loss, reducing analysis time by 12%.
  • Designed a web-based framework for automating test data handling, achieving 90% time savings.
  • Applied ML techniques to address material strength predictions and dynamic component behavior.

Projects

Actions speak louder than words. As I am not at liberty to share the projects I have done professionally due to organizational non-disclosure agreements, however I can share some of the hobby/hackathon projects I have undertaken in my own time.

  • All
  • Structural Data
  • Computer Vision
  • NLP

Leaf Disease Classification

Identify the type of leaf disease from a given image of a cassava plant.

Tools Used:- Tensorflow and Keras
CNN Backbone:- InceptionV3

Health Insurance Lead Prediction

Build a model to predict whether the person will be interested in their proposed Health plan/policy given some information like demographics, transaction history, etc.

Material Strength Prediction

Predicting UTS (strength) of Ferrous materials based on their chemical composition and manufacturing method.

Certifications

I have completed several courses on Deep Learning, Computer Vision and NLP on my quest to better understand the technology. Below are the credentials for some MOOCs I have completed successfully.

Neural Networks and Deep Learning - Coursera

Completed Coursera's "Neural Networks and Deep Learning" course whilst completing all the coding assignments with 100% grade.
This course covers majority of the foundational concept of neural networks and deep learning including intuition and basic mathematics.

View Credentials

Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization

Completed Coursera's "Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization" course whilst completing all the coding assignments with 99.2% grade.
This course tries to open the deep learning black box to understand the processes that drive performance and generate good results systematically.

View Credentials

Structuring Machine Learning Projects

Completed Coursera's "Structuring Machine Learning Projects" course whilst completing all the coding assignments with 100% grade.
This course teaches how to diagnose errors in a machine learning system; prioritize strategies for reducing errors; understand complex ML settings, such as mismatched training/test sets, and comparing to and/or surpassing human-level performance; and apply end-to-end learning, transfer learning, and multi-task learning.

View Credentials

Convolutional Neural Networks

Completed Coursera's "Convolutional Neural Networks" course whilst completing all the coding assignments with 99.45% grade.
This course teaches how to build a convolutional neural network, including recent variations such as residual networks; apply convolutional networks to visual detection and recognition tasks; and use neural style transfer to generate art and apply these algorithms to a variety of image, video, and other 2D or 3D data.

View Credentials

Sequence Models

Completed Coursera's "Sequence Models" course whilst completing all the coding assignments with 100% grade.
This course teaches how to build and train Recurrent Neural Networks (RNNs) and commonly-used variants such as GRUs and LSTMs; apply RNNs to Character-level Language Modeling; gain experience with natural language processing and Word Embeddings; and use HuggingFace tokenizers and transformer models to solve different NLP tasks such as NER and Question Answering.

View Credentials

Skills