AI Text Detector SLM

1. Client Context

The rapid proliferation of AI-generated text across professional and institutional environments
has introduced a new class of operational risk. Organizations can no longer assume that written
submissions, published content, or support communications are human-authored, and the
consequences of misclassification vary significantly by context.

Hiring teams encounter AI-generated job applications and assessments. Academic institutions
handle AI-assisted submissions. Content moderation pipelines must distinguish synthetic from
organic text. Editorial teams need to flag mixed-authorship articles before publication.
Compliance functions in regulated industries require verifiable evidence of human authorship
for audit purposes.

2. Problem

Detection must remain accurate even after deliberate paraphrasing or humanization. This
adversarial robustness requirement fundamentally reshapes training methodology, dataset
design, and feature engineering.

The key problems included:

Binary outputs without confidence:
Most systems provide only AI/Human labels without risk scoring or uncertainty indicators.
Lack of explainability:
Stakeholders cannot understand or validate why text is flagged.
Vulnerability to rewrites:
Paraphrasing or humanization tools can bypass standard detectors.
High false positives:
Structured writing styles and non-native language patterns are often misclassified.
No sensitivity calibration:
A single threshold fails across different use cases.
No batch capability:
Tools are limited to manual, single-input workflows.
No audit trail:
Absence of documented outputs for compliance or review processes.

3. AI Approach

The solution is built on a fine-tuned RoBERTa classifier using LoRA Low-Rank Adaptation,
trained on a hybrid dataset and enhanced with explainability and robustness layers.

Model Selection: RoBERTa with LoRA

RoBERTa was selected for its strong sequence classification capabilities and sensitivity to
stylistic patterns distinguishing AI from human text.

LoRA Fine-Tuning Advantages

Injects trainable low-rank matrices into attention layers.
Reduces memory usage and training time.
Maintains performance while minimizing overfitting.
Enables rapid iteration on dataset improvements.

Hybrid Dataset Construction

A high-quality dataset was critical for adversarial robustness.

Public datasets:
Kaggle baseline AI vs human text across domains.
Custom dataset:
Includes paraphrased and humanized AI text, mixed-authorship documents, and
domain-specific professional writing.

This inclusion of rewritten AI content enables the model to detect patterns that survive
paraphrasing, which became a key differentiator.

4. Technology Used

Core AI / ML Technologies

Fine-tuned small language models SLMs
RoBERTa-based sequence classification model
LoRA Low-Rank Adaptation fine-tuning
Transformer-based text classification
Adversarial training methodology
Hybrid dataset training using AI, human, and rewritten text
Explainability-driven modeling
Confidence-based risk scoring
Sensitivity-based threshold tuning
Robust pattern detection using stylistic and structural signals

Detection & Explainability Layer

Explainable AI XAI feature pipeline
Risk scoring with confidence bands
Span-level suspicious text highlighting
Linguistic pattern analysis
Burstiness and variability detection
Repetition and structure analysis
Model interpretability signals
Decision transparency mechanisms

Programming and Data Engineering

Python
Pandas
NumPy
JSON and CSV data processing
Pydantic for schema validation
Custom preprocessing pipelines
Feature engineering pipelines
Batch processing workflows

LLM Engineering / Infrastructure

Hugging Face Transformers
PyTorch
PEFT / LoRA fine-tuning framework
Efficient model training pipelines
Low-latency inference setup
Batch inference optimization
Model evaluation pipelines

Backend & API Layer

Python and FastAPI
REST API architecture
Detection serving layer
Batch detection APIs
Request validation and routing
Scalable backend design

Frontend & User Experience

React
Interactive analysis interface
Highlighted output visualization
Risk score display UI
Reporting dashboards

The architecture is modular, allowing independent updates to model, constraints, and templates
without full redeployment.

5. Outcome / Business Value

The AI Text Detector SLM platform delivered significant operational, technological, and
compliance-driven value by transforming basic AI detection into a robust, explainable, and
enterprise-ready risk assessment system.

Key outcomes include:

Reducing manual effort in detecting AI-generated content across workflows.
Providing explainable detection outputs with highlighted reasoning and transparency.
Enabling risk-based decision-making through confidence scores and sensitivity calibration.
Improving robustness against paraphrased and humanized AI-generated text.
Reducing false positives, especially for structured and non-native writing patterns.
Supporting batch processing for large-scale enterprise text analysis.
Creating compliance-ready audit trails with exportable detection reports.
Enhancing efficiency in hiring, academic review, and content moderation pipelines.
Lowering infrastructure cost through efficient fine-tuned small language models.
Enabling scalable deployment with modular and API-driven architecture.

6. What Similar Companies Can Learn

Detection Must Be Explainable, Not Just Accurate

Many AI detection tools only provide a label, such as AI or Human, which is not sufficient in
high-stakes environments like hiring, compliance, or education.

Explainable outputs build trust, transparency, and defensibility in decision-making.

Adversarial Robustness Is No Longer Optional

With the rise of rewriting and humanization tools, simple detection models can be easily bypassed.

This requires training on adversarial and real-world datasets, not just clean benchmark data.

Efficiency Matters More Than Model Size

Instead of relying on large, expensive models, this project shows that fine-tuned smaller models
like RoBERTa with LoRA can deliver strong performance with lower cost and latency.

Companies should prioritize efficient, deployable AI systems over simply choosing bigger models.

Hybrid Data Strategy Is Critical for Real Performance

Public datasets alone do not capture real-world complexity.

Combining the following leads to better generalization and robustness in production environments:

Open datasets
Custom-curated data
Adversarial samples