Skip to content
Change the repository type filter

All

    Repositories list

    • Inforex

      Public
      Inforex is a web system for text corpora construction.
      JavaScript
      Other
      91113Updated Nov 8, 2024Nov 8, 2024
    • PUGG

      Public
      Python
      0000Updated Aug 12, 2024Aug 12, 2024
    • CLARIN-PL digital library based on DSpace
      Java
      Other
      1.3k001Updated Jun 13, 2024Jun 13, 2024
    • RetNet

      Public
      Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent, and chunkwise forward.
      Jupyter Notebook
      MIT License
      24000Updated Apr 2, 2024Apr 2, 2024
    • Jupyter Notebook
      MIT License
      2912Updated Mar 28, 2024Mar 28, 2024
    • Java
      GNU General Public License v3.0
      1007Updated Feb 8, 2024Feb 8, 2024
    • An advanced, extensible web front-end for the Manatee-open corpus search engine
      TypeScript
      GNU General Public License v2.0
      22000Updated Dec 14, 2023Dec 14, 2023
    • Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
      Python
      MIT License
      336435Updated Dec 3, 2023Dec 3, 2023
    • klajster

      Public
      Python
      0001Updated Nov 30, 2023Nov 30, 2023
    • argilla

      Public
      ✨Argilla: the open-source data curation platform for LLMs
      Python
      Apache License 2.0
      377000Updated Nov 26, 2023Nov 26, 2023
    • LEPISZCZE

      Public
      This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish
      Python
      MIT License
      21311Updated Nov 24, 2023Nov 24, 2023
    • doccano

      Public
      Open source annotation tool for machine learning practitioners.
      Python
      MIT License
      1.7k104Updated Nov 9, 2023Nov 9, 2023
    • 0000Updated Oct 14, 2023Oct 14, 2023
    • Source code for paper "From Big to Small Without Losing It All: Text Augmentation with ChatGPT for Efficient Sentiment Analysis" published at the 13th ICDM Workshop on Sentiment Elicitation from Natural Text for Information Retrieval and Extraction (SENTIRE) organized during the 23rd IEEE International Conference on Data Mining (ICDM 2023)
      Jupyter Notebook
      0300Updated Oct 12, 2023Oct 12, 2023
    • Source code for paper "Towards Model-Based Data Acquisition for Subjective Multi-Task NLP Problems" published at the 13th ICDM Workshop on Sentiment Elicitation from Natural Text for Information Retrieval and Extraction (SENTIRE) organized during the 23rd IEEE International Conference on Data Mining (ICDM 2023)
      Jupyter Notebook
      MIT License
      0000Updated Oct 3, 2023Oct 3, 2023
    • Source code for paper "Capturing Human Perspectives in NLP: Questionnaires, Annotations, and Biases" published at the 2nd Workshop on Perspectivist Approaches to NLP at the 6th European Conference on Artificial Intelligence (NLPerspectives2 @ ECAI 2023)
      Jupyter Notebook
      MIT License
      1000Updated Sep 17, 2023Sep 17, 2023
    • 0000Updated Jul 31, 2023Jul 31, 2023
    • Liner2

      Public
      Generic framework for information extraction tasks, including recognition of named entities, temporal expressions, spatial expressions and events.
      Java
      61250Updated Jun 5, 2023Jun 5, 2023
    • A simple client for doccano API.
      Python
      MIT License
      62000Updated Apr 13, 2023Apr 13, 2023
    • Temporal storage for LEPISZCZE datasets descriptions
      0000Updated Mar 29, 2023Mar 29, 2023
    • A tool for recognition of spatial expressions containing trajector, spatial indicator and landmark.
      Python
      0001Updated Mar 24, 2023Mar 24, 2023
    • Tool for named entity recognition for Polish based on deep learning.
      Python
      62911Updated Mar 24, 2023Mar 24, 2023
    • Code, datasets and results of the ChatGPT evaluation presented in paper "ChatGPT: Jack of all trades, master of none"
      Jupyter Notebook
      MIT License
      42900Updated Mar 7, 2023Mar 7, 2023
    • Single page application, angular (http://polonjid-dictionary.clarin-pl.eu)
      TypeScript
      GNU General Public License v3.0
      0007Updated Feb 4, 2023Feb 4, 2023
    • Metadata sources for all service providers in the CLARIN Service Provider Federation
      Shell
      58000Updated Jan 5, 2023Jan 5, 2023
    • Polem

      Public
      Tool for lemmatization of multi-word phrases and named entities for Polish.
      HTML
      GNU Lesser General Public License v3.0
      4810Updated Dec 6, 2022Dec 6, 2022
    • Wordnet Visual Editor
      Java
      Other
      1104Updated Nov 24, 2022Nov 24, 2022
    • Multiword expressions detection methods based on the vector representations
      Jupyter Notebook
      MIT License
      1001Updated Nov 2, 2022Nov 2, 2022
    • Annotation Pro plugin utilizing ClarinPL speech tools and models
      C++
      MIT License
      0300Updated Oct 12, 2022Oct 12, 2022
    • MIT License
      0002Updated Sep 23, 2022Sep 23, 2022