AI/ML Publications
Share your AI research with the world. Publish implementations, academic projects, industry solutions, and tutorials — with code, data, and documentation. All AI/ML work welcome.
ADVANCED AI RESEARCH & APPLICATIONS
PRACTICAL SOLUTIONS FOR REAL-WORLD CHALLENGES
Chen, M. · Patel, R. · Yılmaz, A.
We propose a scalable reward modeling framework that incorporates fine-grained human preference data to achieve better alignment in large language models, demonstrating a 23% improvement in human preference metrics.
Singh, K. · Ibrahim, F. · Nakamura, T.
We introduce MemAgent, a framework for persistent episodic memory in LLM-based agents that enables coherent multi-session interactions without context window limitations.
Lee, J. · Osei, A. · Müller, H.
A novel cross-modal grounding approach that enables language-guided visual reasoning in embodied environments, achieving state-of-the-art performance on ScanQA and EmbodiedScan benchmarks.
10 publications
We propose a scalable reward modeling framework that incorporates fine-grained human preference data to achieve better alignment in large language models, demonstrating a 23% improvement in human preference metrics.
We introduce MemAgent, a framework for persistent episodic memory in LLM-based agents that enables coherent multi-session interactions without context window limitations.
A novel cross-modal grounding approach that enables language-guided visual reasoning in embodied environments, achieving state-of-the-art performance on ScanQA and EmbodiedScan benchmarks.
A comprehensive framework for systematically red-teaming frontier language models across 12 risk categories, with automated discovery of novel attack vectors using adversarial agents.
We derive new empirical scaling laws governing multi-task learning in transformer models, demonstrating that task diversity can substitute for model scale at inference time under certain conditions.
A sparse attention mechanism specifically designed for temporal reasoning tasks, achieving 3.2x inference speedup with equivalent accuracy on benchmarks including TimeQA and TempReason.
We introduce LoRA-Merge, a gradient-aware composition algorithm that combines multiple LoRA adapters trained for different tasks without the catastrophic interference observed in naive weight averaging.
A family of lightweight vision transformer architectures for real-time object detection, achieving 45 FPS on mobile hardware while matching ResNet-50 baselines on MS-COCO.
A multi-agent deep RL approach for adaptive resource allocation in distributed computing systems, demonstrating 31% reduction in latency and 18% increase in throughput over heuristic baselines.
A regularised constraint-based algorithm for causal discovery in high-dimensional time series, with theoretical guarantees under non-stationarity and practical applications in financial markets.
Your Research Deserves an Audience
Publish in Under
an Hour.
Write in our Jupyter-like interface or upload existing notebooks. Link GitHub repos, embed visualisations, and use LaTeX for equations. Top publications are featured across our social media feeds on LinkedIn, X, and more.