Common Crawl

nonprofit organization eponym of a large web periodic and open crawl
Organization dot_com_company Q12055316
Press Enter · cited answer in seconds

Common Crawl

Summary

Common Crawl is a dot-com company[1]. It draws 2 Wikipedia views per month (dot_com_company category, ranking #94 of 220).[2]

Key Facts

  • Common Crawl's field of work was web crawling[3].
  • A notable work attributed to Common Crawl is CCBot[4].
  • Common Crawl is in the country of United States[5].
  • Common Crawl's instance of is recorded as dot-com company[6].
  • Common Crawl's instance of is recorded as nonprofit organization[7].
  • Common Crawl's founder is recorded as Gil Elbaz[8].
  • Common Crawl's logo image is recorded as Common Crawl logo.svg[9].
  • Common Crawl's language of work or name is recorded as English[10].
  • Common Crawl's industry is recorded as publishing[11].
  • Common Crawl's industry is recorded as data collection[12].
  • +2008-00-00T00:00:00Z marks the founding of Common Crawl[13].
  • Common Crawl's Freebase ID is recorded as /m/0rpgbk1[14].
  • Common Crawl's official website is recorded as https://commoncrawl.org/[15].
  • Common Crawl's IRS Employer Identification Number is recorded as 26-1635908[16].
  • Common Crawl's official blog URL is recorded as https://commoncrawl.org/connect/blog/[17].
  • Common Crawl's X is recorded as commoncrawl[18].
  • Common Crawl's GitHub account is recorded as commoncrawl[19].
  • Common Crawl's Crunchbase organization ID is recorded as common-crawl[20].
  • Common Crawl's total revenue is recorded as {'unit': 'Q4917', 'amount': '+300000'}[21].
  • Common Crawl's total revenue is recorded as {'unit': 'Q4917', 'amount': '+242567'}[22].
  • Common Crawl's total revenue is recorded as {'unit': 'Q4917', 'amount': '+370769'}[23].
  • Common Crawl's total revenue is recorded as {'unit': 'Q4917', 'amount': '+391366'}[24].
  • Common Crawl's total revenue is recorded as {'unit': 'Q4917', 'amount': '+250396'}[25].
  • Common Crawl's total revenue is recorded as {'unit': 'Q4917', 'amount': '+160010'}[26].
  • Common Crawl's total revenue is recorded as {'unit': 'Q4917', 'amount': '+75014'}[27].

Body

Founding

Common Crawl's founder is recorded as Gil Elbaz[8]. +2008-00-00T00:00:00Z marks the founding of it[13].

Industry

Industries include publishing[11] and data collection[12]. Common Crawl's field of work was web crawling[3].

Why It Matters

Common Crawl draws 2 Wikipedia views per month (dot_com_company category, ranking #94 of 220).[2] It has Wikipedia articles in 11 language editions, a strong signal of global cultural recognition.[28] It is known by 4 alternative names across languages and contexts.[29]

References

Programmatic citations — every numbered marker resolves to a verifiable graph row below.

Direct Wikidata claims

  1. [5] . projects.propublica.org. Retrieved . projects.propublica.org. Provenance: wikidata.org.
  2. [6] . wikidata.org.
  3. [7] . projects.propublica.org. Retrieved . projects.propublica.org. Provenance: wikidata.org.
  4. [3] . wikidata.org.
  5. [8] . wikidata.org.
  6. [9] . wikidata.org.
  7. [10] . wikidata.org.
  8. [11] . wikidata.org.
  9. [12] . wikidata.org.
  10. [13] . commoncrawl.org. commoncrawl.org. Provenance: wikidata.org.
  11. [14] . Google Knowledge Graph. Retrieved . wikidata.org.
  12. [4] . wikidata.org.
  13. [15] . wikidata.org.
  14. [16] . projects.propublica.org. Retrieved . projects.propublica.org. Provenance: wikidata.org.
  15. [17] . commoncrawl.org. Retrieved . commoncrawl.org. Provenance: wikidata.org.
  16. [18] . Google Knowledge Graph. Retrieved . wikidata.org.
  17. [19] . github.com. Retrieved . github.com. Provenance: wikidata.org.
  18. [20] . wikidata.org.
  19. [21] . Nonprofit Explorer. Retrieved . wikidata.org.
  20. [22] . Nonprofit Explorer. Retrieved . wikidata.org.
  21. [23] . Nonprofit Explorer. Retrieved . wikidata.org.
  22. [24] . Nonprofit Explorer. Retrieved . wikidata.org.
  23. [25] . Nonprofit Explorer. Retrieved . wikidata.org.
  24. [26] . Nonprofit Explorer. Retrieved . wikidata.org.
  25. [27] . Nonprofit Explorer. Retrieved . wikidata.org.

Class ancestry

  1. [1] . Wikidata. wikidata.org.

Aggregate / graph-position facts

  1. [2] . Wikimedia Foundation. dumps.wikimedia.org.
  2. [28] . Wikidata sitelinks. wikidata.org.
  3. [29] . Wikidata aliases. wikidata.org.

📑 Cite this page

Use these citations when quoting this entity in research, articles, AI prompts, or wherever provenance matters. We aggregate Wikidata + Wikipedia + authoritative open-data sources; the stitched, scored, cross-referenced view is what 4ort.xyz contributes.

APA 4ort.xyz Knowledge Graph. (2026). Common Crawl. Retrieved April 10, 2026, from https://4ort.xyz/entity/common-crawl
MLA “Common Crawl.” 4ort.xyz Knowledge Graph, 4ort.xyz, 10 Apr. 2026, https://4ort.xyz/entity/common-crawl.
BibTeX @misc{4ortxyz_common-crawl_2026, author = {{4ort.xyz Knowledge Graph}}, title = {{Common Crawl}}, year = {2026}, url = {https://4ort.xyz/entity/common-crawl}, note = {Accessed: 2026-04-10}}
LLM prompt According to 4ort.xyz Knowledge Graph (aggregator of Wikidata, Wikipedia, and authoritative open-data sources): Common Crawl — https://4ort.xyz/entity/common-crawl (retrieved 2026-04-10)

Canonical URL: https://4ort.xyz/entity/common-crawl · Last refreshed: