Primary supervisor
Asad MalikCo-supervisors
Large Language Models (LLMs) such as GPT, Llama, Qwen, and Mistral are increasingly used in commercial and academic applications. As more models become available, identifying which model generated a particular response becomes important for copyright auditing, model verification, and AI transparency.
Current fingerprinting methods often rely on manually selected benchmark questions. However, manually designing discriminative questions is time-consuming and may not capture unique behavioral differences between models.
This research proposes a multi-agent framework where several LLM agents collaborate to automatically generate and evaluate fingerprint questions for model identification.
Aim/outline
- Design a multi-agent system consisting of a question generator, evaluator, and refiner.
- Generate candidate fingerprint questions automatically.
- Evaluate how different LLMs respond to these questions.
- Measure the effectiveness of the generated questions for model identification.
- Compare multi-agent generated questions with manually selected questions.
URLs/references
References:
- Instructional Fingerprinting of Large Language Models (NAACL 2024)
https://aclanthology.org/2024.naacl-long.180/ - TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification (ACL Findings 2024)
https://aclanthology.org/2024.findings-acl.683/ - LLMmap: Fingerprinting for Large Language Models (USENIX Security 2025)
https://www.usenix.org/conference/usenixsecurity25/presentation/pasquini - AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation
https://arxiv.org/abs/2308.08155 - CAMEL: Communicative Agents for Mind Exploration of Large Language Model Society
https://arxiv.org/abs/2303.17760
Tools:
- OpenAI API or local LLMs
- Hugging Face Transformers
- Sentence Transformers
Required knowledge
Must Have
- Python Programming
- Artificial Intelligence and Large Language Models (LLMs) Fundamentals
- Basic Data Analysis
- Research and Problem-Solving Skills
Recommended
- Natural Language Processing (NLP)
- Multi-Agent Systems
- Hugging Face / OpenAI APIs
- Prompt Engineering
- Machine Learning Evaluation Metrics
- Linux and Command Line Basics