Skip to content
View Thomkat's full-sized avatar

Highlights

  • Pro

Block or report Thomkat

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for our (sebis) submission to ArchEHR-QA 2026 Shared Task (CL4Health @ LREC 2026)

Python 2 Updated Mar 17, 2026

[CVPR 2026] Garments2Look: A Multi-Reference Dataset for High-Fidelity Outfit-Level Virtual Try-On with Clothing and Accessories

Python 12 Updated Mar 20, 2026

[arXiv 2026] MoKus: This repo is the official implementation of "MoKus: Leveraging Cross-Modal Knowledge Transfer for Knowledge-Aware Concept Customization"

Python 8 Updated Mar 16, 2026

Mind the Shift: Decoding Monetary Policy Stance from FOMC Statements with Large Language Models

Python 1 Updated Mar 17, 2026
Python 54 13 Updated Mar 18, 2026

Motivation in LLMs - code and data

Jupyter Notebook 1 Updated Feb 23, 2026

Unified KV cache management for multi-task VLA inference.

Python 3 Updated Mar 20, 2026

Sparsity as a Variance Regulator for Improved Depth Utilization in Language Models

Python 12 Updated Mar 17, 2026

A benchmark to measure AI progress on unsolved research problems in mathematics.

Python 15 Updated Mar 18, 2026

Reinforcing Grounded Video Reasoning via Visual-Perception Prompting

Python 6 Updated Mar 17, 2026

Project page for "Training-free Detection of Generated Videos via Spatio-Temporal Likelihoods" [CVPR 2026]

JavaScript 2 Updated Mar 15, 2026

Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning

Python 24 1 Updated Mar 17, 2026

The official code of FineRMoE.

Python 19 Updated Mar 17, 2026

WebVR: Benchmarking Multimodal LLMs for WebPage Recreation from Videos via Human-Aligned Visual Rubrics

Python 7 Updated Mar 11, 2026

SING-analyzing-semantic-invariants-classifiers

Python 2 Updated Mar 17, 2026

[Arxiv] Anatomy of a Lie: A Multi-Stage Diagnostic Framework for Tracing Hallucinations in Vision-Language Models

Python 28 1 Updated Mar 18, 2026

Meissa is a multi-modal medical agent, built on trajectory-based agentic behavior distillation framework.

Python 38 2 Updated Mar 17, 2026

ATM-Bench: A benchmark for long-term personalized memory QA spanning ~4 years of multimodal data (images, videos, emails). Features referential queries, evidence-grounded answering, and multi-sourc…

Python 11 Updated Mar 20, 2026

Codes for paper: "RbtAct:RebuttalasSupervisionforActionableReviewGeneration"

Python 1 Updated Mar 16, 2026

[CVPR2026] CodePercept: Code-Grounded Visual STEM Perception for MLLM

21 1 Updated Mar 12, 2026

Can Large Language Models Keep Up? Benchmarking Online Adaptation to Continual Knowledge Streams

6 Updated Mar 7, 2026
Python 9 Updated Mar 12, 2026

RETROAGENT: From Solving to Evolving via Retrospective Dual Intrinsic Feedback

Python 12 Updated Mar 19, 2026

Generate high resolution videos with a custom voice and appearance, based on LTX-2/LTX-2.3 + Identity In Context LoRA

Python 87 9 Updated Mar 20, 2026

Code for `LLM2VEC-GEN: Generative Embeddings from Large Language Models`

Python 49 2 Updated Mar 12, 2026

MA-EgoQA: Question Answering over Egocentric Videos from Multiple Embodied Agents

Python 9 Updated Mar 11, 2026

In-Context Reinforcement Learning for Tool Use in Large Language Models

Python 38 5 Updated Feb 4, 2026
Python 12 Updated Mar 16, 2026

Satellite-based causal attribution of coastal water clarity degradation to nickel smelting expansion at Indonesia's Morowali Industrial Park using Bayesian structural time series, multi-algorithm c…

Python 1 Updated Mar 10, 2026
Next