#

nlp-machine-learning

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

Here are 6,118 public repositories matching this topic...

deeppavlov / DeepPavlov

An open source library for deep learning end-to-end dialog systems and chatbots.

Updated Nov 26, 2024
Python

thunlp / OpenPrompt

An Open-Source Framework for Prompt-Learning.

nlp natural-language-processing ai deep-learning prompt pytorch transformer prompt-toolkit nlp-library nlp-machine-learning prompts natural-language-understanding pre-trained-model pre-trained-language-models prompt-based-tuning prompt-learning

Updated Jul 16, 2024
Python

katanaml / sparrow

Data processing with ML, LLM and Vision LLM

computer-vision machinelearning gpt nlp-machine-learning rag huggingface-transformers llm vllm

Updated Feb 26, 2025
Python

esbatmop / MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

nlp chinese chinese-nlp corpus-data chinese-simplified nlp-machine-learning chinese-language

Updated Feb 18, 2025

zhaoyingjun / chatbot

ChatGPT带火了聊天机器人，主流的趋势都调整到了GPT类模式，本项目也与时俱进，会在近期更新GPT类版本。基于本项目和自己的语料可以训练出自己想要的聊天机器人，用于智能客服、在线问答、闲聊等场景。

python ai chatbot pytorch nlp-machine-learning seq2seq-chatbot seqgan seqgan-tensorflow tensorflow2

Updated Jun 26, 2024
Python

AI_Tutorial

cbamls / AI_Tutorial

精选机器学习，NLP，图像识别，深度学习等人工智能领域学习资料，搜索，推荐，广告系统架构及算法技术资料整理。算法大牛笔记汇总

elasticsearch machine-learning deep-neural-networks artificial-intelligence recommender-systems deep-learning-tutorial nlp-machine-learning artificial-intelligence-algorithms machine-learning-tutorials graph-neural-networks search-system

Updated Apr 15, 2024

github / CodeSearchNet

Datasets, tools, and benchmarks for representation learning of code.

Updated Jan 31, 2022
Jupyter Notebook

kk7nc / Text_Classification

Text Classification Algorithms: A Survey

Updated Oct 10, 2024
Python

chrismattmann / tika-python

Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

Updated Apr 14, 2024
Python

MilaNLProc / contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

nlp embeddings transformer topic-modeling nlp-library nlp-machine-learning bert neural-topic-models text-as-data topic-coherence multilingual-topic-models multilingual-models

Updated Feb 4, 2025
Python

lingua-go

pemistahl / lingua-go

The most accurate natural language detection library for Go, suitable for short text and mixed-language text

nlp go natural-language-processing language-detection language-modeling golang-library text-processing nlp-machine-learning language-recognition language-processing language-identification language-classification

Updated Feb 6, 2025
Go

DengBoCong / nlp-paper

自然语言处理领域下的相关论文（附阅读笔记），复现模型以及数据处理等（代码含TensorFlow和PyTorch两版本）

nlp paper dialogue speech pytorch nlp-machine-learning bert tensorflow2

Updated Jan 5, 2024
Python

google-research / tapas

End-to-end neural table-text understanding models.

tensorflow question-answering nlp-machine-learning table-parsing

Updated Jul 22, 2024
Python

veekaybee / what_are_embeddings

A deep dive into embeddings starting from fundamentals

machine-learning machine-learning-algorithms embeddings nlp-machine-learning

Updated Nov 18, 2024
Jupyter Notebook

rasa-ui

paschmann / rasa-ui

Rasa UI is a frontend for the Rasa Framework

nodejs nlp angular nlu rasa-nlu nlp-apis rasa nlp-machine-learning manage-bots

Updated Dec 30, 2022
JavaScript

Python-ai-assistant

ggeop / Python-ai-assistant

Python AI assistant 🧠

Updated Nov 17, 2024
Python

NorskRegnesentral / skweak

skweak: A software toolkit for weak supervision applied to NLP tasks

python data-science natural-language-processing weak-supervision spacy nlp-library nlp-machine-learning distant-supervision training-data

Updated Sep 2, 2024
Python

lingua-rs

pemistahl / lingua-rs

The most accurate natural language detection library for Rust, suitable for short text and mixed-language text

nlp rust natural-language-processing language-detection rust-library nlp-machine-learning language-recognition language-processing rust-crate language-identification language-classification

Updated Feb 26, 2025
Rust

bin123apple / AutoCoder

We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 Turbo (April 2024) and GPT-4o.

nlp text-generation code-generation nlp-machine-learning humaneval llm code-interpreter

Updated Jul 6, 2024
Python

georgian-io / LLM-Finetuning-Toolkit

Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

nlp unit-testing falcon classification summarization lora nlp-machine-learning zephyr fine-tuning finetuning ablation-study large-language-models flan-t5 redpajama qlora llm-test llama2 mistral-7b

Updated Oct 25, 2024
Python

Created by Alan Turing

Followers: 25.3k followers
Wikipedia: Wikipedia