Plagiarism Seach Sensitivity Settings

The sensitivity settings let you control exactly how the plagiarism engine compares your text against potential sources — from how long a matching passage must be, to how closely it needs to resemble the original.

Important Warning

The sensitivity settings directly affect the plagiarism detection results of all future scans. Changing these settings can cause:

Relevant plagiarism to be missed — if settings are too strict

A large number of false positives — if settings are too loose

Unless you have a specific reason to change them, we strongly recommend keeping the default values.

Overview

The Sensitivity tab in Settings controls how the plagiarism engine compares your submitted text against potential sources. The settings are divided into two sections:

Text Comparison Settings — control how matches within individual passages are detected
Per-Source Sensitivity — control at what threshold an entire source is reported

All settings can be reset to their factory defaults at any time using the Reset to Default button.

The screenshot below shows the Sensitivity tab as it appears in the Settings dialog. The following sections explain each control in detail.

Minimal Phrase Length

Slider range: 10 – 250 characters | Default: 50 characters

This setting defines the minimum length a suspicious passage must have before it is reported as a match.

A lower value (e.g. 10 chars) flags even very short matching passages — as few as 2–3 words
A higher value (e.g. 150 chars) reports only longer passages, roughly a full sentence or more

Practical Guidance

For academic papers, the default of 50 characters (approx. 6–10 words) works well for most cases

Getting too many “noise” matches from short common phrases? Try increasing to 80–100

Checking very short texts or want to catch even brief verbatim passages? Lower to 20–30

Minimal Similarity (%)

Slider range: 20 – 100 % | Default: 40 %

This is the most important setting for controlling how strictly the engine matches text. It defines the minimum proportion of matching words that a suspicious passage must contain to be reported.

100 % — only word-for-word exact matches are accepted
Lower values — paraphrased, reordered, or slightly altered text is also flagged

How Similarity Is Calculated

Within a detected passage, the engine counts matching words (words appearing in both texts, with minor spelling variations tolerated) and divides them by the total number of words in the passage.

Example: A passage of 30 words where 18 match the source = 60 % similarity.

The Adaptive Similarity Threshold

The similarity threshold does not apply as a fixed value for all passages equally. For short passages, the required similarity is automatically raised toward 100 %, because very short passages that look similar are more likely coincidental. The threshold only drops to your configured minimum for passages of about 150 characters or longer.

This adaptive behavior is visualized live in the sensitivity diagram on the right side of the Sensitivity tab — the diagram updates instantly as you move the sliders.

Practical Guidance

Setting Meaning Use when…

40 % (default) Catches paraphrased text and reordered sentences Most academic use cases

60–70 % Only flags passages where the majority of words match Too many paraphrase-style false alarms with default

100 % Strict word-for-word matching only You want to detect verbatim copying only

Setting	Meaning	Use when…
40 % (default)	Catches paraphrased text and reordered sentences	Most academic use cases
60–70 %	Only flags passages where the majority of words match	Too many paraphrase-style false alarms with default
100 %	Strict word-for-word matching only	You want to detect verbatim copying only

Use Legacy Match Mode

Switch (on/off) | Default: Off

When enabled, this switch activates the Legacy Match Mode, an older matching algorithm that predates the current similarity-based approach.

In legacy mode:

The similarity slider is disabled — similarity is not evaluated at all
Every passage where words match sequentially is reported, regardless of how many non-matching words appear between them
The minimum phrase length is fixed at 20 characters, regardless of the slider setting
The sensitivity diagram is hidden — it does not apply in this mode

Legacy Mode Is Not Recommended

Legacy Match Mode was the original matching logic and tends to produce more false positives, particularly in texts that share common academic vocabulary. It is retained for backwards compatibility only — for example, to re-run old scans under identical conditions for direct comparison with previous results.

Minimal Source Length

Slider range: 10 – 50 words | Default: 15 words

This setting controls the minimum total number of plagiarised words a source must contribute before it appears in your report at all.

A lower value (e.g. 10 words) — sources with just a few matching words are still listed
A higher value (e.g. 30–50 words) — sources that match only briefly are filtered out entirely

This acts as a final gating filter applied after all individual passages have been detected. Even if a source contains multiple small matching passages, it only appears in the report if the total word count of all its matches meets this minimum.

Practical Guidance

Receiving many sources that contribute only a handful of matching words? Increase to 20–30

Working with short texts where even a small match may be significant? Decrease to 10–12

The Sensitivity Diagram

The chart on the right side of the Sensitivity tab provides a live visualization of your current settings. It shows the relationship between phrase length (horizontal axis) and the required minimum similarity (vertical axis) for a passage to be accepted as a match.

Reading the Diagram

Element	What it means
Blue S-curve	The required similarity at each phrase length. Slopes smoothly from 100 % for very short phrases down to your configured minimum for longer ones.
Shaded area	The zone of accepted matches. Any passage whose phrase length and similarity fall within this area is flagged as suspicious.
Left dotted line	Your configured Minimal Phrase Length. Passages shorter than this are never reported.
Right dotted line (150)	The reference point at which your configured Minimal Similarity fully applies.
Y-axis labels	100 % at the top; your configured minimal similarity at the flat part of the curve.
Blue dot on left line	The required similarity exactly at your configured minimum phrase length.

Example — Default Settings (50 chars, 40 %)

Phrase length	Required similarity	Result
30 characters	~90 %	Only near-identical short passages flagged
50 characters	~75 %	Minimum phrase length reached; moderate similarity required
100 characters	~52 %	About half the words must match
150 characters or more	40 %	Your configured minimum applies — paraphrasing is detected

Moving the Minimal Phrase Length slider shifts the left boundary of the shaded zone
Moving the Minimal Similarity slider raises or lowers the flat right portion of the curve

Presets and Reset

Reset to Default

Clicking Reset to Default restores all sensitivity values to their factory defaults:

Setting	Default Value
Minimal Phrase Length	50 characters
Minimal Similarity	40 %
Legacy Match Mode	Off
Minimal Source Length	15 words

When Do Settings Apply?

Settings take effect for all new plagiarism scans started after saving. They do not retroactively change the results of already-completed scans. Each completed scan stores its own settings snapshot, which is displayed in the report for reference.

Quick Reference

Setting	More sensitive (detects more)	More strict (reports less)
Minimal Phrase Length	Lower value	Higher value
Minimal Similarity	Lower value	Higher value
Minimal Source Length	Lower value	Higher value
Legacy Match Mode	On (no similarity filter at all)	—

Overview
Minimal Phrase Length
Minimal Similarity (%)
Legacy Match Mode
Minimal Source Length
The Sensitivity Diagram
Presets and Reset
Quick Reference

Plagiarism Seach Sensitivity Settings

Important Warning

Overview

Minimal Phrase Length

Practical Guidance