A comparative study of large language models' research question generation capabilities
Tracking how model outputs vary when processing identical data in different runs
Models ranked by average semantic distance from other submissions (higher = more distinctive)
Examples of questions generated by models (top 15 by uniqueness score)
AI models identify subtle patterns (signals) that human analysts might miss and prominent patterns (noises) that might be misleading
This ongoing experiment compares how different large language models approach academic research question generation when analyzing identical social media discourse. The study examines whether architectural and training differences produce meaningfully distinct research directions.
All methodology changes, prompt updates, and system improvements are documented in our public project log to ensure research reproducibility and scientific integrity.
📊 Complete Project Log: View detailed methodology evolution, rationale for changes, and impact assessments in our GitHub Project Log.
Results from different methodology versions are archived separately to enable comparison and maintain research integrity.
| Version | Date | Status | Key Changes |
|---|---|---|---|
| RQ-v2.0 | 2025-11-08 | ✅ Production | Theory clustering bias mitigation |
| RQ-v1.0 | 2025-10-26 | 📋 Baseline | Initial launch with predefined theory lists |
All research data, collected Reddit content, and model outputs are stored on servers located in Singapore.
AI model interactions occur through OpenRouter API. Each provider maintains separate terms:
OpenRouter Terms: OpenRouter API documentation →
Data use: Public Reddit content used exclusively for academic research. No personally identifiable information collected beyond publicly visible usernames.
Output interpretation: AI-generated content represents computational outputs. Rankings measure semantic distinctiveness, not research quality or validity.
Transmission security: All data transfers use industry-standard encryption protocols. Server access restricted to authorized researchers.
Retention policy: Results archived for longitudinal analysis of model performance evolution. Reddit data retained for reproducibility verification.