Skip to content

Commit

Permalink
universe-package-quelquhui (#13514) [ci skip]
Browse files Browse the repository at this point in the history
Co-authored-by: Ines Montani <ines@ines.io>
  • Loading branch information
thjbdvlt and ines authored Sep 10, 2024
1 parent 54dc4ee commit 0190e66
Showing 1 changed file with 20 additions and 1 deletion.
21 changes: 20 additions & 1 deletion website/meta/universe.json
Original file line number Diff line number Diff line change
Expand Up @@ -4552,6 +4552,26 @@
},
"category": ["standalone"]
},
{
"id": "quelquhui",
"title": "quelquhui",
"slogan": "Tokenizer for contemporary French",
"description": "A tokenizer for French that handles inword parentheses like in _(b)rouille_, inclusive language (won't split _relecteur.rice.s_,but will split _mais.maintenant_), hyphens (split _peut-on_, or _pouvons-vous_ but not _tubulu-pimpant_), apostrophes (split _j'arrive_ or _j'arrivons_, but not _aujourd'hui_ or _r'garder_), emoticons, text-emoji (_:happy:_), urls, mails and more.",
"github": "thjbdvlt/quelquhui",
"code_example": [
"import spacy",
"import quelquhui",
"nlp = spacy.load('fr_core_news_lg')",
"nlp.tokenizer = quelquhui.Toquenizer(nlp.vocab)"
],
"code_language": "python",
"author": "thjbdvlt",
"author_links": {
"github": "thjbdvlt"
},
"category": ["pipeline"],
"tags": ["tokenizer", "french"]
},
{
"id": "gliner-spacy",
"title": "GLiNER spaCy Wrapper",
Expand Down Expand Up @@ -4579,7 +4599,6 @@
"category": ["pipeline"],
"tags": ["NER"]
}

],

"categories": [
Expand Down

0 comments on commit 0190e66

Please sign in to comment.