I'm currently working on large language models at JD.
I love contributing to various open source projects. Here are some of my contributions:
I'm currently working on large language models at JD.
I love contributing to various open source projects. Here are some of my contributions:
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Generate categories automatically for each post in Hexo
BERT as language model, fork from https://github.com/google-research/bert