ML Intern at MBUSA HQ

ML Intern at MBUSA HQ

Internship • Machine Learning/NLP

About The Project

Developed large-scale NLP pipelines and agentic LLM workflows to extract anomalies and failure symptoms from millions of service records, and trained predictive models for repeat and 90-day failure recurrences. Delivered components for a dealer-facing dashboard to inform proactive service and parts planning.

Achievements

  • Built Spark-based agentic framework using LLM prompting for anomaly extraction at scale

  • Fine-tuned domain-adapted LLaMA-2 for structured field alignment (missing/inconsistent data)

  • Engineered NLP stack (TF-IDF, KeyBERT, BERTopic) on 2M+ service records

  • Trained XGBoost models for repeat failures & 90-day recurrence; integrated into dealer forecasting

Links

sunny.sunho.park@gmail.com

unsplash.com/@reddfrancisco
unsplash.com/@reddfrancisco