Skip to content
View NormXU's full-sized avatar
🎯
最後まで、絶対に諦めじゃだめ
🎯
最後まで、絶対に諦めじゃだめ

Block or report NormXU

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
NormXU/README.md

Hi there 👋🏻,

🔭 I am currently focused on developing cutting-edge multi-modality models capable of natively generating and understanding text and images.

My specific areas of interest include:

  • Path planning

PS: I recognize all these tasks as path-planning tasks. Check this blog for more details.


  • Auto-regressive Generation
  • Diffusion

PS: I recognize all these tasks as diffusion and next-token generation tasks.


  • Document Understanding & Layout Analysis
  • Optical Character Recognition
  • Object Detection

In addition to my current work, I have prior experience in Robotics Perception from my Master's studies. I hope my work can be helpful to you.

Feel free to reach out if you have any questions or if there's anything I can assist with!

Pinned Loading

  1. ERNIE-Layout-Pytorch ERNIE-Layout-Pytorch Public

    An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.

    Python 99 11

  2. Layout2Graph Layout2Graph Public

    An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"

    Python 74 11

  3. nougat-latex-ocr nougat-latex-ocr Public

    Codebase for fine-tuning / evaluating nougat-based image2latex generation models

    Python 123 13