Skip to the content.

Software

Veridika.ai

Veridika.ai

An AI agent framework for real-time fact-checking.

Online Demo

NoticIA

NoticIA

LLM finetuning and evaluation library for the NoticIA dataset of 850 Spanish news with human-written summaries.

GitHub Repository

Clickbait Fighter

Clickbait Fighter

AI that generates one-sentence summaries of clickbait news articles. Trained on 8×A100; deployed with vLLM and Ray.

Link to the app

GoLLIE

GoLLIE

Guideline-following LLM for Information Extraction; supports zero-shot schemas defined on the fly.

GitHub Repository

T-Projection

T-Projection

High-quality annotation projection for sequence labeling datasets, built on Transformers + Accelerate.

GitHub Repository

Sequence Labeling LLMs

Sequence Labeling with LLMs

Sequence Labelling with LLMs via Text2Text constrained generation built on Transformers + Accelerate.

GitHub Repository

LM Contamination Index

LM Contamination Index

Manually curated database of contamination evidence for LMs.

Web Page

Easy-Translate

Easy-Translate

Translate large text files with a single command. Easy for beginners, customizable for power users.

GitHub Repository

Easy Label Projection

Easy Label Projection

Project labels across datasets using mGiza, FastAlign, SimAlign or AWESOME to generate resources for low-resource languages.

GitHub Repository

Context-enriched NER

Context-enriched multilingual NER using knowledge bases

Candidate generation + KB linking + fine-grained classification using retrieved knowledge.

GitHub Repository

MetaVec

MetaVec

Monolingual and cross-lingual meta-embedding generation and evaluation framework.

GitHub Repository

Self Driving Car in Video Games

Self Driving Car in Video Games

Supervised deep network that learns to drive in GTA V from human-labelled data.

GitHub Repository