Workflow Overview
Step 1: Input Your Content
Enter text directly or upload a file. The system automatically generates a unique fingerprint using MinHash and sentence-transformers.
Example Input
The New York Times reported today on latest developments in artificial intelligence. Machine learning models are transforming how we process and analyze data at scale. Companies are increasingly concerned about provenance of their training data.
Fingerprint Process
Content is tokenized, hashed, and stored as a MinHash signature in SQLite database for fast similarity search.
Supported Formats
Plain text, JSON, CSV. Future support: Images, audio, multi-modal datasets.
Step 2: Get Instant Matches
The Tracer Agent compares your fingerprint against 102+ known sources (NYT, Wikipedia, Reddit, academic papers) and returns similarity scores.
Example Response
{
"matches": [
{"source": "nyt-article-00001", "similarity": 0.92, "license": "COPYRIGHT", "risk": "HIGH"},
{"source": "wikipedia-ai-article", "similarity": 0.65, "license": "CC-BY-SA", "risk": "LOW"}
],
"risk_assessment": "HIGH",
"fingerprint": "0xa46cde1340ef41d9b929cd81099b8bc2",
"lineage_id": "lin_1234567890"
}Risk Levels
Risk is calculated based on similarity score and source license type.
Step 3: Register On-Chain
Optional: Register your dataset's provenance on Mantle L2 for immutable, publicly verifiable proof of origin.
Registration Output
{
"success": true,
"dataset_hash": "0xa46cde1340ef41d9b929cd81099b8bc2",
"transaction_hash": "0xcb3d0be2be507a7f3655d46dde00531d1aba261542dc08aba3d505894e101a85",
"block_number": 33066053,
"gas_used": "0.0005 MANTLE",
"explorer_url": "https://sepolia.mantlescan.xyz/tx/0xcb3d0be2..."
}On-Chain Benefits
- ✓Immutable records that cannot be altered
- ✓Public verification by anyone with explorer access
- ✓Timestamped proof of data origin
- ✓Derivative tracking for dataset forks
View Lineage Proof
After registration, view complete provenance chain for any dataset using lineage viewer or blockchain explorer.
Lineage Example
Dataset: 0xa46cde1340ef41d9b929cd81099b8bc2
Origins:
└─ arxiv-2301-001 (Attention Is All You Need)
License: MIT
Risk: LOW
Derivatives:
├─ fine-tuned-model-v1
└─ training-dataset-batch-2
Transaction: 0xcb3d0be2be507a7f3655d46dde00531d1aba261542dc08aba3d505894e101a85
Status: Confirmed on Mantle Sepolia Block #33066053