ichim-david

David Ichim ichim-david

I'm a web developer who digs javascript and loves to be part of https://github.com/plone community. Volto Core team member https://github.com/plone/volto/

47 followers · 56 following

Romania

Sponsoring

Achievements

x4 x3

Achievements

x4 x3

Organizations

Stars

google-ai-edge / LiteRT-LM

C++ 4,622 451 Updated May 2, 2026

noonghunna / club-3090

Community recipes for serving LLMs on RTX 3090. Multi-engine (vLLM, llama.cpp, SGLang) and model-agnostic. Currently shipping Qwen3.6-27B configs for 1× and 2× cards.

Shell 285 12 Updated May 2, 2026

mistralai / mistral-vibe

Minimal CLI coding agent by Mistral

Python 4,067 470 Updated Apr 30, 2026

mlhher / late

Orchestrate an entire AI dev team on as little as 5GB VRAM. An AI coding agent built like a systems engineer. Ephemeral context, zero token bloat, exact-match diffs. Stop wasting money on 10k token…

Go 253 21 Updated May 1, 2026

imran-vz / pi-observability

A pi extension that replaces the default footer with a live observability bar and provides a full dashboard command.

TypeScript 7 Updated Apr 29, 2026

mattpocock / skills

Skills for Real Engineers. Straight from my .claude directory.

Shell 52,871 4,438 Updated Apr 30, 2026

JuliusBrussee / caveman

🪨 why use many token when few token do trick — Claude Code skill that cuts 65% of tokens by talking like caveman

Python 52,079 2,791 Updated May 1, 2026

LyalinDotCom / GemmaDesktop

An experiment, what if Gemma had a Desktop app tuned for the model and offline scenarios?

TypeScript 62 5 Updated May 1, 2026

davebcn87 / pi-autoresearch

Autonomous experiment loop extension for pi

TypeScript 6,335 366 Updated Apr 29, 2026

different-ai / openwork

An open-source alternative to Claude Cowork (powered by opencode)

TypeScript 14,595 1,425 Updated May 1, 2026

tiberiuichim / taskman-integrations

Python 3 Updated Apr 30, 2026

0xClandestine / phew

Search-based optimizer for MLX/Metal on Apple Silicon.

Python 8 Updated Apr 30, 2026

TheTom / vllm-swift

vLLM Metal plugin powered by mlx-swift — high-performance LLM inference on Apple Silicon

Python 234 15 Updated May 2, 2026

vllm-project / vllm-metal

Community maintained hardware plugin for vLLM on Apple Silicon

Python 1,059 116 Updated May 2, 2026

thc1006 / qwen3.6-speculative-decoding-rtx3090

First public benchmark of llama.cpp speculative decoding on Qwen3.6-35B-A3B with a single RTX 3090 (post PR #19493 merge, 2026-04-19). 19 configurations covering ngram-cache, ngram-mod, and classic…

Python 16 Updated Apr 26, 2026

oobabooga / textgen

The original local LLM interface. Text, vision, tool-calling, training. UI + API, 100% offline and private.

Python 46,915 5,970 Updated Apr 27, 2026

intel / auto-round

A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support and full compatibility with vLLM, SGLang, and Transformers.

Python 1,229 124 Updated Apr 30, 2026

yibie / skills-manager

A native macOS app to manage skills across coding agents — Claude Code, Cursor, Copilot CLI, Codex, Gemini CLI

Swift 138 13 Updated Apr 27, 2026

nicobailon / pi-subagents

Pi extension for async subagent delegation with truncation, artifacts, and session sharing

TypeScript 1,127 149 Updated May 2, 2026

huggingface / ml-intern

🤗 ml-intern: an open-source ML engineer that reads papers, trains models, and ships ML models

Python 7,948 768 Updated May 1, 2026

alvinreal / awesome-autoresearch

A curated list of autonomous improvement loops, research agents, and autoresearch-style systems inspired by Karpathy's autoresearch.

1,685 127 Updated Apr 24, 2026

aiptimizer / TurboOCR

Fast GPU OCR server. 270 img/s on FUNSD. TensorRT FP16, PP-OCRv5, HTTP + gRPC.

C++ 255 25 Updated May 1, 2026

mutable-state-inc / gemma4metal

I set out to implement TurboQuant (PolarQuant + QJL) for Gemma 4 31B's KV cache — a 31 billion parameter model running on a single Mac. It doesn't work on this model. What I built instead is faster.

C++ 5 Updated Apr 13, 2026

mutable-state-inc / turboquant-llama3.170B

A novel metal kernel that implements TurboQuant such that llama3.170b can run on a consumer Mac book

C++ 8 Updated Apr 14, 2026

greyhaven-ai / autocontext

a recursive self-improving harness designed to help your agents (and future iterations of those agents) succeed on any task

Python 953 69 Updated May 2, 2026

injaneity / pi-computer-use

control your applications using pi-coding-agent. fully invisible.

TypeScript 490 39 Updated Apr 28, 2026

mechramc / Orion

Local AI runtime for training & running small LLMs directly on Apple Neural Engine (ANE). No CoreML. No Metal. Offline, on-device fine-tuning & inference on M-series silicon.

Objective-C 85 6 Updated Mar 6, 2026

thunderbird / thunderbolt

AI You Control: Choose your models. Own your data. Eliminate vendor lock-in.

TypeScript 4,449 295 Updated May 1, 2026

steipete / oracle

Ask the oracle when you're stuck. Invoke GPT-5 Pro with a custom context and files.

TypeScript 2,166 208 Updated May 1, 2026

osaurus-ai / vmlx-swift-lm

vMLX Swift Engine.

Swift 2 3 Updated May 1, 2026

David Ichim ichim-david

Sponsors

Sponsoring

Organizations

Stars