About
            Hello! I’m Manabendra Rout, a seasoned Machine Learning Research Engineer with a passion for pushing the
            boundaries of AI technology. With over 8 years of experience in data science and machine learning, I
            specialize in developing cutting-edge solutions in Generative AI, natural language processing (NLP), and
            large language model (LLM) fine-tuning. My expertise lies in building scalable AI systems, fine-tuning
            models for domain-specific applications, and optimizing inference for real-world use cases.
Beyond
            technical implementation, I take pride in my role as a mentor and collaborator, guiding junior team members,
            leading design discussions, and ensuring high-quality deliverables through thorough code reviews. I thrive
            at the intersection of research and practical application, transforming complex ideas into impactful
            solutions.
Feel free to explore my portfolio to learn more about my projects and contributions to the
            field of AI.
          
Machine Learning Researcher
I am a self-taught programmer and Senior Applied AI Engineer currently working with sirion.ai on a Contract Lifecycle Management (CLM) product. I have worked on multiple Generative AI, NLP and Computer Vision microservices for incorporating various features to the product.
- Birthday: 1 August 1994
 - Website: mrout94.github.io
 - Phone: +91-7757088502
 - Email: manabendrarout@gmail.com
 - City: Bengaluru, India
 
- Age: 30
 - Degree: Bachelors
 - Alma Mater: NIT Rourkela
 - Open to Roles: Applied AI Engineer, Data Scientist
 
I specialize in developing Generative AI applications, Natural Language Processing (NLP), and fine-tuning Large Language Models (LLMs), with expertise in Python and frameworks like PyTorch. My work spans scalable model deployment, optimization, and monitoring to ensure robust real-world applications. I believe a strong foundation in mathematics, programming, and problem-solving is essential for success in the rapidly evolving AI landscape.
Resume
A methodical and experienced AI Engineer with expertise in Deep Learning, Generative AI, and NLP. Passionate about creating innovative solutions and advancing the field of artificial intelligence.
Summary
Manabendra Rout
Innovative and detail-oriented AI Engineer with 8+ years of experience in developing, fine-tuning, and deploying Generative AI and NLP solutions. Proficient in managing the complete lifecycle of AI projects, including data generation, model training, optimization, inferencing, and scalable deployment for real-world applications.
- Bengaluru, Karnataka, India
 - (+91) 7757088502
 - manabendrarout@gmail.com
 
Education
B.Tech in Mechanical Engineering
2012 - 2016
National Institute of Technology (NIT), Rourkela, India
Internship
Academic Intern
May 2015 - July 2015
Tata Motors, Jamshedpur, India
Achieved a significant reduction in torque variation of truck engines by developing an anomaly detection algorithm to preemptively identify defective engines and trace problematic parameters to their sources. Utilized statistical techniques and Isolation Forest.
Professional Experience
Applied AI Engineer
Feb, 2022 - Present
Sirion, Bengaluru, India
- Created a RAG-based chatbot utilizing open-source LLMs to support advanced contract Q&A.
 - Fine-tuned multiple open-source LLMs (e.g., Llama3, Llama2, Qwen, MPT) for legal domain tasks such as instruction-following and conversational Q&A using distributed training methods (FSDP, DDP).
 - Fine-tuned multiple text embedding models using contrastive learning techniques and designed novel data generation approaches for domain-specific embeddings, improving downstream task performance.
 - Contributed to development of a Auto Redlining Application using GPT-4o for automating contract modification/negotiations in alignment with organizational guidelines.
 
Data Scientist
Sep, 2020 - Feb, 2022
United Health Group, Noida, India
- Applied NLP techniques for topic modeling of customer feedback, enabling actionable insights to reduce cancellations by 17%.
 - Delivered a predictive modeling system achieving an F1 score of 0.82 for forecasting order outcomes.
 
Assistant Manager
Jul 2016 - Sep 2020
Bajaj Auto Limited, Pune, India
- Developed a GBM-based model to predict bearing friction loss, reducing analysis time by 12%.
 - Designed a web-based framework for automating test data handling, achieving 90% time savings.
 - Applied ML techniques to address material strength predictions and dynamic component behavior.
 
Projects
Actions speak louder than words. As I am not at liberty to share the projects I have done professionally due to organizational non-disclosure agreements, however I can share some of the hobby/hackathon projects I have undertaken in my own time.
- All
 - Structural Data
 - Computer Vision
 - NLP
 
Certifications
I have completed several courses on Deep Learning, Computer Vision and NLP on my quest to better understand the technology. Below are the credentials for some MOOCs I have completed successfully.
            Neural Networks and Deep Learning - Coursera
                Completed Coursera's "Neural Networks and Deep Learning" course whilst completing all the coding
                assignments with 100% grade.
                This course covers majority of the foundational concept of neural
                networks and deep learning including intuition and basic mathematics.
              
            Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization
                Completed Coursera's "Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and
                Optimization" course whilst completing all the coding assignments with 99.2% grade.
                This course tries to open the deep learning black box to understand the processes that drive performance
                and generate good results systematically.
              
            Structuring Machine Learning Projects
                Completed Coursera's "Structuring Machine Learning Projects" course whilst completing all the coding
                assignments with 100% grade.
                This course teaches how to diagnose errors in a machine learning system; prioritize strategies for
                reducing errors; understand complex ML settings, such as mismatched training/test sets, and comparing to
                and/or surpassing human-level performance; and apply end-to-end learning, transfer learning, and
                multi-task learning.
              
            Convolutional Neural Networks
                Completed Coursera's "Convolutional Neural Networks" course whilst completing all the coding assignments
                with 99.45% grade.
                This course teaches how to build a convolutional neural network, including recent variations such as
                residual networks; apply convolutional networks to visual detection and recognition tasks; and use
                neural style transfer to generate art and apply these algorithms to a variety of image, video, and other
                2D or 3D data.
              
            Sequence Models
                Completed Coursera's "Sequence Models" course whilst completing all the coding assignments with
                100% grade.
                This course teaches how to build and train Recurrent Neural Networks (RNNs) and commonly-used variants
                such as GRUs and LSTMs; apply RNNs to Character-level Language Modeling; gain experience with natural
                language processing and Word Embeddings; and use HuggingFace tokenizers and transformer models to solve
                different NLP tasks such as NER and Question Answering.