M.S. Student · POSTECH ML Lab

Seoyeon
Lee.

graduate researcher in machine learning

I'm an M.S. student at the POSTECH Graduate School of AI, researching in the ML Lab. My interests center on LLM agents, machine learning for NLP, and generative AI.

M.S. in AI · POSTECH POSTECH ML Lab Pohang, Republic of Korea
scroll down
01 About

An M.S. student at POSTECH, working on LLM agents and machine learning for NLP.

I'm an M.S. student at the POSTECH Graduate School of AI, in the Machine Learning Lab, where I am supervised by Prof. Dongwoo Kim. My research interests are LLM agents, machine learning for NLP, and generative AI.

Before POSTECH I studied Computer Science at Ewha Womans University and interned on the AI team at Lotte Homeshopping, where I built embedding-based product-matching engines over hundreds of thousands of products and a conversational data-analysis tool. That applied-ML background is the foundation I now build research on.

Position
M.S. Student & Graduate Researcher
Lab
Machine Learning Lab, Graduate School of AI
Focus
LLM agents · ML for NLP · Generative AI
Prior
B.S. Computer Science, Ewha Womans University
02 Experience
Lotte Homeshopping AI Team · Research Intern
Seoul, Republic of Korea Jun – Sep 2025
A

Product-matching engine for broadcast scheduling

Built KR-SBERT embedding + DBSCAN clustering pipelines to identify identical and similar products across internal and competitor catalogs, improving the accuracy of sales forecasting. Similarity combined product name, category, brand and price with tuned weights.

~100KSKUs processed
85%duplicate-removal rate
2.67xmore matched pairs
0.84competitor-match correlation
1.79Msimilarity computations automated
KR-SBERTDBSCAN Cosine similarityPython
B

LLM document-QA & certificate-verification system

Automated the reading of QA documents with LLMs on AWS Bedrock. Designed a chunking strategy that fixed table-reading failures in a RAG QA chatbot, and built a Selenium / PyMuPDF pipeline that extracts reference numbers from PDFs, verifies test-report authenticity online, and assembles evidence screenshots — shipped as Lambda APIs.

AWS BedrockLambda RAGSelenium PyMuPDF
C

Conversational data-analysis (BI) tool

Prototyped a Dockerized BI assistant that unifies natural-language SQL queries with general LLM chat. A routing flow sends each query to a Text-to-SQL path or a chat path, runs it against MySQL, and post-processes the result into a user-friendly answer.

20% → 87%answer accuracy (4x+)
OpenWebUILlama 3-8B QdrantMySQL Text-to-SQLDocker
03 Selected Projects
P / 012025

Personalized Perfume Recommender

A hybrid recommender combining content-based filtering over perfume notes and descriptions with collaborative filtering over user preferences. S-BERT embeddings indexed in FAISS for fast similarity search; user "likes" stored as a Neo4j graph for similar-user recommendations. Deployed with Streamlit.

FAISS content-based + Neo4j collaborative — hybrid
S-BERTFAISS Neo4jStreamlit
Repository
P / 022025

Nolli — Logic Quiz Platform for Children

A platform that builds children's logical reasoning through quizzes. GPT-4o mini generates and grades quizzes; a rubric-based chain-of-thought prompt returns per-criterion scores with reasons. A Korean-specialized Llama model was fine-tuned with LoRA + 8-bit quantization for efficient training and memory use.

LoRA + 8-bit quantization · rubric-based CoT
Llama / LoRAGPT-4o mini FastAPIPyTorch
Repository
P / 032024

Scholli — Scholarship Recommender

A personalized scholarship-recommendation platform centered on AI and backend development. Generative AI drives the recommendations, and an optimized filtering algorithm cut processing time by ~97.3%. Built on Django, MySQL and Redis, containerized and deployed on AWS.

~97.3% faster filtering · 2 competition awards
DjangoRedis OpenAIAWS / Docker
Repository
P / 042024

Skin Cancer Detection (ISIC 2024)

A CNN classifier that distinguishes benign from malignant skin lesions on the Kaggle ISIC 2024 dataset. Data augmentation addressed severe class imbalance, and a fine-tuned ResNet-50 lifted classification performance.

ResNet-50 fine-tuning · augmentation for imbalance
PyTorchResNet-50 NumPy
Repository
P / 052023

Find Your Pokémon — Image Classification

A deep-learning classifier for 20 Pokémon species, covering the full pipeline from data preprocessing through systematic comparison of several architectures. MobileNet was selected as the most efficient model and fine-tuned, reaching 90.41% test accuracy.

90.41% test accuracy · multi-model comparison
TensorFlowMobileNet NumPy
Repository
04 Technical Skills

ML & Research

  • PyTorch
  • TensorFlow
  • scikit-learn
  • SBERT / embeddings
  • ResNet · MobileNet
  • LoRA · quantization

LLM & Retrieval

  • RAG pipelines
  • AWS Bedrock
  • Llama · OpenAI
  • FAISS · Qdrant
  • Neo4j
  • Text-to-SQL

Backend & Infra

  • FastAPI
  • Django
  • Flask
  • Docker
  • AWS · MS Azure
  • Selenium

Languages & Data

  • Python · C++ · Java
  • NumPy · Pandas
  • MySQL · SQLite
  • Redis · Firebase
05 Education & Recognition

Education

2026 –
POSTECH — Graduate School of AI
M.S. in Artificial Intelligence · ML Lab
2021 – 2025
Ewha Womans University
B.S. in Computer Science · GPA 4.08 / 4.5
2024
EPITA, Strasbourg — France
Summer exchange program

Awards

2024
Excellence Award — 4th AI Idea Competition
For innovative real-world use of AI in the personalized scholarship system · 54:1 competition ratio.
2024
Encouragement Award — Campus SW Startup Competition
For software development capability and startup potential.

Selected Program & Activities

2025
SKT FLY AI Challenger
Deep-learning study, Azure cloud deployment, AI team projects.
2024
AI Research Club
Paper reading on recent AI methods and project work.
2023
Algorithm Study
Core algorithms and problem solving in C++.
2023
Game Development Club
Unity and C#; built game projects with the team.
06 Get in Touch

Let's build something
that learns.

I'm always glad to talk about LLM agents, ML for NLP, or generative AI — research collaborations, internships, or just ideas. The fastest way to reach me is email.