Data Principles
At AiDE, we believe better AI starts with better data — data that is human-generated, consented, and responsibly handled.
This page explains, in plain language, how data flows through our platform and the principles we follow when collecting, validating, and delivering datasets.
(For legally binding terms, please refer to our Terms of Service and Privacy Policy.)
Our Core Principles
1. Consent Comes First
All data on AiDE is submitted by real people who knowingly choose to participate in specific projects. Contributors decide:
what data they submit,
which projects they participate in,
and whether to continue contributing.
We do not scrape personal data or collect data without contributor participation.
2. Contributors Retain Rights
Contributors retain ownership of their original submissions.
By participating, contributors grant AiDE a limited license to:
validate submissions,
assemble datasets,
and license approved, anonymized datasets to companies or distribution partners under defined commercial terms, including downstream sublicensing where applicable.
Contributors are compensated when their data is accepted and used.
3. Companies License Data — They Don’t Own It
Companies (or Users requesting data) receive a license to use delivered datasets in accordance with their agreement, not ownership of the underlying contributor data. License structures may include internal use or commercial redistribution rights, depending on the specific dataset agreement.
This helps:
protect contributor rights,
reduce downstream legal risk,
and ensure data is used as intended.
4. Anonymization by Default
Before delivery:
direct personal identifiers are removed or excluded,
datasets are aggregated and validated,
contributors’ identities are never shared with buyers.
Some projects may involve sensitive domains (e.g. medical, legal, customer support). In those cases, additional safeguards and restrictions apply.
5. Validation & Quality Control
All datasets go through validation before delivery, which may include:
automated checks,
human review by validators,
format and schema verification.
Only data that meets project requirements is included in delivered datasets.
6. Transparency & Auditability
Delivered datasets may include:
manifests describing structure and contents,
file hashes to verify integrity,
basic provenance metadata (project, date range, validation status).
This ensures datasets are traceable, reproducible, and enterprise-ready.
7. Retention & Deletion
Contributors may request deletion of their personal account information at any time.
Approved, anonymized datasets that have already been licensed to companies cannot be retroactively removed, but personal identifiers are not retained or exposed.
8. Ethical Use of Data
AiDE is built to support responsible AI development. We work with companies seeking:
consented data,
clear usage rights,
and high-quality human input.
We do not support unauthorized resale, deceptive data collection, or misuse of contributor data. Authorized commercial redistribution is governed by explicit license agreements.
Questions?
If you have questions about how data is handled:
Contributors can contact: privacy@aidemarketplace.com
Companies can contact: sales@aidemarketplace.com
For legal terms governing data use, please review our Terms of Service and Privacy Policy.