Navigation

Data Scientist | Software Engineer - AI

I bring advanced analytics experience in diverse domain (Supply Chain, Insurtech, Fintech, Edtech). Proficient in transforming complex data into meaningful insights. Specializes in streamlining data pipelines, pattern mining, predictive modeling, and crafting innovative solutions for business success. I excel in intelligent system development through my R&D skills in AI/ML/NLP.

Manoj Adhikari Resume

Professional Experience

R&D Engineer-AI & Analytics

East Tennessee State University

Sep 2023 – Present
Tennessee, USA

  • Analyzed 50K+ user queries from a university Q&A platform to identify recurring campus infrastructure issues, leading to an 18% improvement in satisfaction post-survey.
  • Build incremental and interactive dashboard and reports to communicate insights with university administration.
  • Developed ETL for unstructured web text data, enabling incremental scraping, cleaning, and transformation with reliable scheduling and fault tolerance.
  • Perform sentiment analysis on course feedback to segment and monitor sentiment trajectories of student.
  • Developed interactive dashboards to support university administration, improving decision-making processes.
  • Ensured data governance and security by implementing role-based access controls and maintaining compliance with university data privacy standards.
  • Created detailed documentation for data collection, transformation, and analysis workflows and dashboard usage, ensuring clarity for non-technical users.

  • Technologies: Machine Learning RAG Transfomers ChromaDB Semantic Search Prompt Engineering Authentication & Authorization REST APIs Rate Limiting Beautifulsoup Docker React Flask

Data Scientist

Renegade Insurance

Feb 2022 – July 2023
Oregon, USA

  • Design, build and maintain Spark Streaming pipelines to integrate and warehouse 16+ insurance companies data ensuring high data integrity for real-time analytics, reporting and product support.
  • Performed EDA and in-depth analysis on historical data to identify trends in claims, premiums, and customer behavior over time.
  • Define and analyze key performance indicators (KPIs) such as policy uptake, retention rates, claims ratios, sales and customer lifetime that impacts on business success.
  • Developed dashboard and reports to track KPIs in PowerBI & Tableau to assess product success.
  • Ensured data governance by implementing data quality checks, encryption protocols, and access controls in compliance with industry standards.
  • Conducted A/B tests for newly released product features, provide insights, and documented results to improve engagement and product performance.
  • Collaborated with cross-functional teams in Agile/Scrum environments to deliver data solutions on time and within scope.
  • Performed data versioning and documentation of data model, ETL processes and workflow.

  • Technologies: Machine Learning Python PyMongo MongoDB Apache Airflow PySpark PowerBI Software Design

Data Scientist

BitsKraft Pvt. Ltd.

Jan 2020 – Jan 2022
Kathmandu, Nepal

  • Forecasted short and long-term demand at SKU, category, and regional levels using time-series models (ARIMA).
  • Developed pipeline for data extraction, transformation, validation, data integrity and quality checks, versioning and warehousing data documents in NoSQL (MongoDB cluster) using python.
  • Performed descriptive analysis on client’s product sales performance, conversion rates, cart abandonment, customer lifetime value (CLV), and average order value (AOV).
  • Conducted customer segmentation analysis using RFM (Recency, Frequency, Monetary) modeling to identify high-value customers and tailor marketing strategies, resulting in improved retention and targeted campaign.
  • Implemented clustering (K-Means) to cluster customer, association rule mining (Apriori) for market basket analysis, and classification models (Random Forest, XGBoost) to classify product performance.
  • Worked directly with clients and vendors to identify analytics requirements, document and communicate performance metrics with actionable insights with non-technical stakeholders.

  • Technologies: Machine Learning Python PowerBI Pandas Numpy

Technical Skills

I've developed a diverse skill set across multiple domains, technologies, and frameworks:

Programming

Python R JavaScript Java C C++ C#

Database

SQL (MySQL, PostgreSQL) NoSQL (MongoDB, Elasticsearch) Amazon Redshift ChromaDB PineCone

BI & Orchestration tools

PowerBI Tableau Amazon QuickSight Looker Excel Streamlit Apache Airflow PySpark Spark Streaming

Cloud & Tools

AWS GCP Docker Kubernetes CI/CD (GitHub Actions) S3 SageMaker Lambda Azure AI Studio Git Flask Django REST Framework REST APIs FastAPI

AI/ML & NLP

GenAI LLMs TensorFlow PyTorch Scikit-learn Transformers RAG SpaCy Neural Network Model Optimization Neural Evaluation Traditional Models Keras

Soft Skills

Leadership Stakeholder Engagement Collaboration Problem-Solving Teamwork Research Documentation

Featured Projects

A showcase of data science projects demonstrating practical solutions to real-world problems

BucBuddy - University Assistance System

Advanced RAG architecture based conversational & context aware intelligent Q&A system designed to address complex queries for current & prospective ETSU scholars, featuring advanced accessibility capabilities. (Funded- Small Grants in Support).

React Flask Langchain Llama 2 ChromaDB Hugging Face Sentence Transformers AWS Docker

Transcripta – Audio insight platform

Transcripta is a web and mobile application that allows users to record audio, get real-time transcriptions, and receive AI-generated summaries and actionable insights. It’s designed for meetings, lectures, interviews, or brainstorming sessions.

Python React FastAPI Pandas

Phishing Awareness and Simulation System (REST APIs)

Phishing Simulation System with multi-factor authentication with feature of interactive analytical dashboards & reports on breach data, template creation, role, & permission management.

Python Django REST framework SQL

Supply Chain Analytics

Using a real-world dataset, the project aims to identify and address key challenges in shipment and inventory management. It analyzes supply chain inefficiencies and delivers interactive dashboards to inform stakeholders and recommend structural improvements.

Python Pandas NumPy PowerBI

Customer Segmentation Using Clustering

Applied K-Means and Hierarchical clustering on retail data computing RFM metrics to identify distinct customer groups. Conducted data cleaning, visualization, normalization, and cluster evaluation (silhouette score) to uncover high-value and low-value customer segments.

Python Pandas NumPy Matplotlib Seaborn Scikit-learn

Sentiment360 - Customer Sentiment & Product Insights

An interactive NLP analytics system that analyzes Amazon product reviews. Performs sentiment classification and LDA-based topic modeling for diagnostic analysis. Identified key pain points across product lines, helping product quality improvements.

Python Pandas NLP Streamlit BERT PowerBI

Education & Publications

Education

Master's, Computer Science

GPA: 4.0

East Tennessee State University

Aug 2023 - May 2025 Johnson City, Tennessee

Relevant Courses:

Machine Learning NLP Data Analytics & Visualization Software Verification & Validation Software Production Software Design Software System Engineering Software Project Management

Bachelor's, Computer Science

GPA: 3.68

Tribhuvan University

Aug 2016 - May 2020 Kathmandu, Nepal

Relevant Courses:

Data Warehouse & Data Mining Analysis of Algorithm Artificial Intelligence Web Technologies DBMS Theory of Computation Image Processing Calculas Linear Algebra Discrete Mathematics Simulation & Modeling Numerical Methods Computer Architecture

Publications

Awards

A Small Grant in Support of Capstone Projects to Support Scholarly Research Excellence (2025, ETSU )

For desigining and developing intelligent QnA platform of high accessibility features for ETSU Stakeholders to enhance the university resource exploration, resource utilization, and understand user resource need.

The Fr. Martin P. Coyne, S.J. Memorial Award for Outstanding Moral Uprightness (2020, St. Xavier's College)

For readiness to reach out to others with respect and for upholding the values of honesty and fairness at all times.

Frs. Locke & Stiller Research Awards (LSRA) Cycle 2 (2019, St. Xavier's College)

For research work on Assessment of Criminal Activites in kathmandu Valley using Risk Matrix & KNN Classifiers.

Google Crowdsource Certificate of Excellence (2018, Google)

In recognition of being top contributor during the Google Crowdsource Campaign for enriching the Nepali language through technology.

Contact Me

Feel free to reach out to me for collaborations, opportunities, or just to say hello!

Get In Touch

Location

California, USA

Send a Message

Manoj Adhikari

Data Scientist & AI Engineer

© 2025 Manoj Adhikari. All rights reserved.