Show HN: We fingerprinted 178 AI models' writing styles and similarity clusters
TL;DR Highlight
This study measured the similarity of writing styles of 178 AI models by analyzing them in 32 dimensions, and found that even among models with significant price differences, over 78% similar writing patterns were discovered.
Who Should Read
Developers who are 고민 중인 which AI model to use, or ML engineers researching AI-generated text detection or model selection criteria.
Core Mechanics
- The writing styles of a total of 178 AI models were quantified into 32 dimensions and their similarities were measured and clustered.
- The analysis revealed cases where models with very large price differences had over 78% overlap in writing style. For example, Gemini 2.5 Flash Lite Preview 06-17 and Claude 3 Opus recorded a similarity of 78.2%.
- The research team argued that 'using a cheaper model with the same writing style as an expensive model is equivalent to only paying for the brand.'
- Cluster analysis suggests clues as to whether a model borrowed parameters from another model or underwent a distillation process (a technique for transferring knowledge from a large model to a smaller one).
- The original website is blocked by Vercel security checkpoints, making direct verification difficult, but key claims and some methodologies of the study have been shared through community comments.
- Discussions also arose regarding whether unique patterns appearing in the writing of models (e.g., the use of '--' symbols) are natural byproducts of the RL (reinforcement learning) process, or intentionally inserted watermarks.
Evidence
- There was strong opposition to the claim that models could be substituted simply because their writing styles are similar. Users who have actually used several models pointed out that 'even if the styles are similar, there is a clear difference in their ability to understand my intentions, and the real cost is intelligence, not writing style.'
- Doubts about the reliability of the methodology were also raised. It is unclear whether the 32 dimensions used were derived through principal component analysis (PCA) or arbitrarily selected, and the criterion of '75% similarity = same writing' was criticized as being arbitrary and without justification. There were also criticisms that there was a lack of linguistic theoretical basis.
- The lack of disclosure of prompts and actual response content was criticized for making it impossible to verify the numbers. There was an opinion that 'benchmarks should show both prompts and responses to be meaningful, but without them, it's just numbers.'
- Interesting speculation emerged about whether the frequent appearance of unique symbols like '--' when using Claude or ChatGPT is a byproduct of RL training, or an intentional fingerprint to prevent AI-generated text from being re-introduced into the training data (model collapse).
- Experiences comparing several models and comparing the actual frequency of hallucinations were also shared. One user had the personal experience that Gemini lies less than OpenAI or Anthropic paid models, and speculated that this may be due to Google's better training data or higher utilization of RAG (Retrieval-Augmented Generation).
How to Apply
- If you are reviewing several models and cost reduction is your goal, do not decide on a substitute model based solely on writing style similarity, but conduct direct A/B tests on actual tasks (inference, coding, fact retrieval, etc.). Even if the styles are the same, the actual capabilities may differ significantly.
- If you are creating an AI-generated text detection system, you can extract specific writing patterns of a model (repeating symbols, sentence structure, etc.) as features, as in this study, and use them for model fingerprinting.
- If you want to monitor how much the writing style of a model distilled or fine-tuned in your service has changed from the original base model, you can refer to the 32-dimensional style analysis method of this study and build a pipeline to quantify style drift.
- If you need to share model selection decisions within your team, 'writing style similarity' should be just one reference indicator, and it is best to create a comprehensive evaluation table with actual performance (accuracy, hallucination frequency, response speed) and use it as evidence-based decision-making material.
Terminology
Related Papers
AMEL: Accumulated Message Effects on LLM Judgments
LLM을 자동 평가자로 쓸 때 이전 대화 기록의 긍정/부정 분위기가 이후 판단을 오염시킨다는 걸 75,898개 API 호출로 증명한 연구.
Language-Switching Triggers Take a Latent Detour Through Language Models
8B LLM에 심어진 백도어 트리거가 중간 레이어에서 언어 탐지기를 완전히 속이는 직교 부분공간(orthogonal subspace)으로 숨어 이동한다는 걸 회로 분석으로 밝혀냈다.
Formal Methods Meet LLMs: Auditing, Monitoring, and Intervention for Compliance of Advanced AI Systems
LLM이 규칙을 잘 지키고 있는지 감시하려면 LLM에게 맡기지 말고 LTL(시간 논리 공식) 기반 모니터를 쓰세요.
Bun Rust rewrite: "codebase fails basic miri checks, allows for UB in safe rust"
Anthropic이 인수한 Bun 런타임이 Zig 코드베이스를 AI로 Rust에 재작성했는데, 가장 기본적인 메모리 안전성 검사(miri)조차 통과하지 못하는 UB(Undefined Behavior)가 발견됐다는 이슈가 제기됐다.
MetaBackdoor: Exploiting Positional Encoding as a Backdoor Attack Surface in LLMs
입력 텍스트는 멀쩡한데 입력 길이만으로 LLM 백도어가 발동되는 새로운 공격 기법 발견.
Tell HN: Dont use Claude Design, lost access to my projects after unsubscribing
Claude Design 구독을 해지했더니 기존 프로젝트에 접근이 완전히 차단됐다는 사용자 경고로, AI 도구에 중요한 작업물을 의존할 때의 리스크를 잘 보여주는 사례다.