Software

Hypernova-60B

Hypernova-60B

Multiverse Computing · 2026

A 60B-parameter compressed LLM optimized for agentic workflows and tool use.

📒 Model

FLUX.1 Krea

FLUX.1 Krea (Krea 1)

Krea · Black Forest Labs · 2025

A 22B diffusion image model with superior aesthetic control and image quality, fully compatible with FLUX.1-dev.

📖 Blog · 📒 Model

GoLLIE

GoLLIE

HiTZ · 2024

A 34B guideline-following LLM achieving state-of-the-art zero-shot Information Extraction.

📖 Blog · 📒 Code

Latxa

Latxa

HiTZ · 2025

A Basque instruction-tuned LLM with performance comparable to GPT-4o and Claude Sonnet.

📖 Paper · 📒 Models

Medical-mT5

Medical-mT5

HiTZ · 2024

The first open-source multilingual text-to-text LLM for the medical domain.

📖 Paper · 📒 Model

Veridika.ai

Veridika.ai

Personal project · 2025

An AI agent framework for real-time fact-checking.

🔗 Online Demo

Projects & Tools

AI-Generated GTAV

AI-Generated GTAV

2025

A deep learning project that uses Diffusion Transformers (DiT) to generate Grand Theft Auto V driving footage.

GitHub Repository

NoticIA

NoticIA

2024

LLM finetuning and evaluation library for the NoticIA dataset of 850 Spanish news with human-written summaries.

GitHub Repository

Clickbait Fighter

Clickbait Fighter

2024

AI that generates one-sentence summaries of clickbait news articles. Trained on 8×A100; deployed with vLLM and Ray.

Link to the app

Sequence Labeling LLMs

Sequence Labeling with LLMs

2024

Sequence labelling with LLMs via Text2Text constrained generation, built on Transformers + Accelerate.

GitHub Repository

T-Projection

T-Projection

2023

High-quality annotation projection for sequence labeling datasets, built on Transformers + Accelerate.

GitHub Repository

LM Contamination Index

LM Contamination Index

2023

Manually curated database of contamination evidence for language models.

Web Page

Context-enriched NER

Context-enriched multilingual NER

2023

Candidate generation + knowledge-base linking + fine-grained classification using retrieved knowledge.

GitHub Repository

Easy-Translate

Easy-Translate

2023

Translate large text files with a single command. Easy for beginners, customizable for power users.

GitHub Repository

Easy Label Projection

Easy Label Projection

2022

Project labels across datasets using mGiza, FastAlign, SimAlign or AWESOME to generate resources for low-resource languages.

GitHub Repository

MetaVec

MetaVec

2021

Monolingual and cross-lingual meta-embedding generation and evaluation framework.

GitHub Repository

Self Driving Car in Video Games

Self Driving Car in Video Games

2019

Supervised deep network that learns to drive in GTA V from human-labelled data.

GitHub Repository