Skip to content
View l-yohai's full-sized avatar
😀
😀

Block or report l-yohai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
l-yohai/README.md

Typing SVG

Portfolio LinkedIn Email Location

Profile Views


🧠 About Me

const yohan = {
    role: "AI Research Engineer @ KakaoBank 🏦",
    focus: ["Reasoning Systems", "Memory & Cognition", "Post-Training"],
    scale: "26M+ Active Users 🚀",
    mission: "Making LLMs Think Like Humans 🧬",
    passion: [
        "🔮 Parallel & Continuous Reasoning",
        "⚡ Scalable LLM Optimization (200B+ params)",
        "🧬 Episodic Memory & Cognitive Modeling",
        "🛡️ Trustworthy & Aligned AI Systems"
    ]
};

💡 I don't just scale parameters — I reimagine how models think, remember, and align with human cognition.


🔬 Research Focus

🧠 Human-like Reasoning

# Current LLMs: Sequential 🐌
for token in sequence:
    output.append(generate(token))

# My Vision: Parallel & Continuous 🚀
thoughts = parallel_reasoning(context)
output = diffuse_and_refine(thoughts)

Inspired by: Soft Token Reasoning Goal: Models that think before they speak

⚙️ Scalable Reasoning Systems

@ KakaoBank: 200B+ Parameter LLMs

🎯 Key Innovations:

  • 🔄 Interleaved reasoning (function + memory + tools)
  • 🌿 Multi-instruction branching
  • ⚡ Latency-optimized alignment
  • 💰 Real-world financial AI agents

🧬 Memory & Cognitive Modeling

Building PREMem → Next-Gen Memory Systems

graph LR
    A[Experience] --> B[Consolidate]
    B --> C[Structured Memory]
    C --> D[Contextualize]
    D --> E[Reasoning Loop]
    E --> F[Decision]
Loading

Research: Episodic memory that learns to forget

🛡️ Trustworthy & Human-Aligned AI

🎯 Core Principles:

  • ✅ Consistency & Transparency
  • 🎯 Calibrated Outputs
  • 🤔 Reflective Reasoning
  • 📝 Justifiable Processes

Vision: AI that reasons with ethics, not just logic


🏆 Publications

🌟 Top-Tier AI Conferences (EMNLP, ACL, NAACL 2025)

🔥 Featured Papers

📄 Finding Diamonds in Conversation Haystacks 💎 EMNLP arXiv

Yohan Lee, Yongwoo Song, Sangyeop Kim†


📄 Pre-Storage Reasoning for Episodic Memory 🧠 EMNLP arXiv

Sangyeop Kim*, Yohan Lee*, Sanghwa Kim, Hyunjong Kim, Sungzoon Cho†

PREMem: Shifting inference burden to memory for smarter dialogue


📄 What Really Matters in Many-Shot Attacks? 🛡️ ACL arXiv

Sangyeop Kim*, Yohan Lee*, Yongwoo Song*, Kimin Lee†


📄 HEISIR: Hierarchical Expansion of Inverted Semantic Indexing 🔍 NAACL arXiv

Sangyeop Kim†, Hangyeul Lee, Yohan Lee


📄 SAFARI: Sample-specific Assessment Framework 📊 NAACL

Yohan Lee*, Sungho Park*, Sangwoo Han*, Yunsung Lee*†, and team


🛠️ Tech Stack

Languages & Frameworks

Python PyTorch TensorFlow HuggingFace LangChain

Research & MLOps

Docker Kubernetes Git Linux CUDA

Specializations

🧠 LLM Post-Training | ⚡ Inference Optimization | 🔬 Reasoning Systems | 🧬 Memory Architectures | 🛡️ AI Safety


📊 GitHub Stats


🎯 Current Focus

mindmap
  root((Yohan's<br/>Research))
    🧠 Reasoning
      Parallel Inference
      Soft Token Reasoning
      Diffusion-like Dynamics
    ⚙️ Production
      200B+ LLM Optimization
      Financial AI Agents
      Real-time Reasoning
    🧬 Memory
      PREMem Evolution
      Episodic Systems
      Contextual Learning
    🛡️ Safety
      Alignment Research
      Trustworthy AI
      Ethical Reasoning
Loading

🌟 Let's Connect!

I'm always open to collaborations on:

🔬 Novel reasoning architectures • 🧠 Cognitive AI systems • 🚀 LLM optimization • 🛡️ AI safety research


📫 Reach me at:

Portfolio LinkedIn Email


"Making AI think like humans, one reasoning step at a time"

Pinned Loading

  1. deepspeedai/DeepSpeed-MII deepspeedai/DeepSpeed-MII Public

    MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

    Python 2.1k 187

  2. tunib-ai/oslo tunib-ai/oslo Public archive

    OSLO: Open Source framework for Large-scale model Optimization

    Python 309 29

  3. jskwak98/Bookathon3_Bookie_On_And_On jskwak98/Bookathon3_Bookie_On_And_On Public

    Jupyter Notebook 31 9

  4. LostCow/KLUE LostCow/KLUE Public

    KLUE Benchmark 1st place (2021.12) solutions. (RE, MRC, NLI, STS, TC)

    Python 25 4

  5. daily_papers_ko daily_papers_ko Public archive

    This project aims to automatically translate and summarize Huggingface's daily papers into Korean using ChatGPT.

    Python 52 7

  6. CDR-Benchmark CDR-Benchmark Public

    Finding Diamonds in Conversation Haystacks: A Benchmark for Conversational Data Retrieval (EMNLP 2025 Industry Track)

    Python 3 1