Research Seminar⚓︎

A weekly seminar where we read, present, and debate recent papers in data analysis and machine learning.

Upcoming This Week⚓︎

Wednesday, May 13 · 18:00 · Online

Instruction Tuning and Knowledge Injection

Instruction Tuning for Large Language Models: A Survey. Efficient Knowledge Injection in LLMs via Self-Distillation. Aligning Large Language Models with Domain Adaptation.

Instruction Tuning Knowledge Distillation Domain Adaptation LLM

Thursday, May 14 · 13:00 · Per timetable

Dynamic Graphs and Games

TIDFormer: Exploiting Temporal and Interactive Dynamics Makes A Great Dynamic Graph Transformer. TAMI: Taming Heterogeneity in Temporal Interactions for Temporal Graph Link Prediction. Complete Chess Games Enable LLM Become A Chess Master.

Dynamic Graphs Temporal Graphs LLM Chess

## Past Seminars

Thursday, Apr 2 · Seminar 01

Introduction to Network Science and GNNs + RAG in Machine Translation

Crash course on power-law networks, message passing framework, GNNs vs Transformers. Guest talk by Maria Sukhareva on retrieval-augmented machine translation: code switching, Pareto decoding, agentic neologism translation, and contrastive ensembling.

GNN RAG Machine Translation Recording

---

Thursday, Apr 9 · Seminar 02

LLM-Powered Collaborative Task Planning + TAPAS: Multi-Agent Planning

Natural-language-to-PDDL constraint translation via two-stage decomposition. Multi-agent framework combining LLMs with symbolic planning: domain modeling, procedural memory, and self-reflection.

LLM Agents Planning Recording

---

Wednesday, Apr 15 · Seminar 03

GRAG + FastRAG: Graph Retrieval Methods

Graph Retrieval-Augmented Generation with ego-graph retrieval, soft pruning, and dual-view prompting. FastRAG pipeline for semi-structured data: entropy-based chunk sampling, schema learning, and hybrid KG+text retrieval.

Graph RAG RAG Semi-structured Data Recording

---

Wednesday, Apr 22 · Seminar 04

LightRAG + ThoughtTerminator

LightRAG: graph-enhanced RAG with dual-level retrieval (local entities + global themes), 600x cheaper than GraphRAG. ThoughtTerminator: calibrating reasoning models by budgeting tokens upfront and mitigating overthinking via early exit.

Graph RAG RAG Reasoning Efficiency Recording

---

Thursday, Apr 23 · Seminar 05

Sign.MT + EM-LLM

Sign.MT: open-source bidirectional sign-spoken language translation with modular pipeline and three rendering options (skeletal, 3D avatar, HumanGAN). EM-LLM: human-inspired episodic memory enabling infinite-context LLMs via surprise-based segmentation and graph-theoretic boundary refinement.

Sign Language Multimodal Long Context Episodic Memory Recording

---

Wednesday, Apr 29 · Seminar 06

TG-Talker: LLMs for Temporal Graphs

TG-Talker: adapting LLMs for temporal graph link prediction via in-context learning with background set, example set, and temporal neighbors. First framework for applying LLMs to real-world temporal graphs with MRR-based evaluation and textual explanation generation.

Temporal Graphs LLM Link Prediction In-Context Learning

---

Thursday, Apr 30 · Seminar 07

Chess LLM + Social Deduction + SimUSER

ChessLLM: first LLM to play complete chess games achieving Elo 1788 via FEN representation and long-round Stockfish data (+350 Elo). Fine-Grained Evaluation: event-level metrics and thematic analysis of reasoning failures (memory distortion, dissociation, character ambiguity) in Spyfall. SimUSER: AI agents with persona matching, knowledge-graph memory, and brain model for scalable recommender system evaluation.

Game Playing Social Simulation Recommender Systems LLM

---

Tuesday, May 6 · Seminar 08

Transformers are Graph Neural Networks

Theoretical equivalence between multi-head attention in transformers and message passing in graph attention networks on fully connected graphs. Transformers as GNNs that won the hardware lottery due to dense matrix optimization on modern GPUs. Speaker's independent experiments showed GAT outperforming transformer on both CORA and AG News benchmarks.

Transformers GNNs Graph Attention Theoretical Review

*

Course Description

*

Course Grading

*

Research Papers Collection

## Topics

Temporal Graphs and Dynamic GNNs

Architecture design, training acceleration, benchmarks, and explainability for temporal graph neural networks.

27+ papers

Temporal Knowledge Graphs

Embedding, forecasting, and question answering on temporal knowledge graphs.

6 papers

Graph RAG

Retrieval-Augmented Generation enhanced with graph structures: from LightRAG to domain-specific medical and legal applications.

40+ papers

LLM for Graph Tasks

Using large language models as graph learners for node classification, link prediction, and text-attributed networks.

10 papers

LLM Agents and Planning

Multi-agent systems, task planning with LLMs, and knowledge graph-powered agent memory.

16 papers

LLM Reasoning and Thinking

Chain-of-thought, graph-of-thought, formal theorem proving, and scaling reasoning capabilities.

17 papers

Model Architecture and Efficiency

Sparse attention, diffusion LLMs, recursive models, and parameter-efficient methods.

9 papers

Benchmarks and Surveys

Comprehensive evaluations and surveys across deep research, scientific discovery, and web agents.

7 papers

LLM Reasoning with Knowledge Graphs

Chain-of-knowledge prompting, graph-constrained reasoning, and knowledge graph-enhanced LLM inference.

22 papers

Hallucinations and Factuality

Detecting and mitigating LLM hallucinations via knowledge graphs, contrastive decoding, and topological analysis.

6 papers

Long Context and LLM Memory

Benchmarks and methods for extending LLM context: episodic memory, long-context QA, and citation generation.

9 papers

LLM Fine-tuning and Adaptation

Instruction tuning, LoRA adapters, domain adaptation, SFT, and preference optimization.

14 papers

NLP Tasks and Text Processing

Named entity recognition, sentiment analysis, text classification, and table-to-text generation.

11 papers

RAG Infrastructure and Search

Chunking strategies, embedding models, reranking, and retrieval pipelines for RAG systems.

17 papers

Graph Representation Learning

Community detection, subgraph mining, random walks, and relational database benchmarks.

12 papers

Knowledge Graph Construction and Entity Linking

Entity extraction, relation extraction, ontology construction, and entity alignment with LLMs.

11 papers

Recommender Systems

Diffusion-based recommendations, graph convolution for re-ranking, and scaling recommender transformers.

8 papers

Synthetic Data

LLM-driven synthetic data generation, curation, and evaluation frameworks.

4 papers

Social Simulation and Games

LLM agents in social deduction games, role-playing evaluation, and user behavior simulation.

10 papers

Reasoning Budget

Adaptive test-time compute allocation, budget-aware reasoning, and token efficiency for LLMs.

7 papers

KV Cache

KV cache quantization and compression for efficient long-context LLM inference.

2 papers

Adaptive Latent Compute

Training LLMs to reason in continuous latent space with adaptive compute allocation.

2 papers

Reinforcement Learning

RLHF optimization methods for language model alignment.

1 paper

Curriculum Learning

Curriculum-based pretraining and RL strategies for improving LLM reasoning.

2 papers

Continual Learning

Catastrophic forgetting mitigation in continual LLM training and unlearning.

1 paper

Guardrails

Safety guardrails via synthetic data and adversarial training for LLMs.

1 paper

Active Learning

LLM-in-the-loop active learning for efficient data annotation.

1 paper

Domain Adaptation

Domain-adaptive post-training strategies for specialized LLM applications.

1 paper

Multi-objective Optimization

GNN-based approaches to resource-constrained project scheduling under uncertainty.

2 papers