Optimizing Deeper Transformers on Small Datasets

Research article (Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 2021) · cited 57× · AI/ML
Press Enter · cited answer in seconds

Optimizing Deeper Transformers on Small Datasets

Summary

Optimizing Deeper Transformers on Small Datasets is a scholarly article[1].

Key Facts

  • Optimizing Deeper Transformers on Small Datasets's instance of is recorded as scholarly article[2].

References

Programmatic citations — every numbered marker resolves to a verifiable graph row below.

Direct Wikidata claims

  1. [2] . wikidata.org.

Class ancestry

  1. [1] . Wikidata. wikidata.org.

📑 Cite this page

Use these citations when quoting this entity in research, articles, AI prompts, or wherever provenance matters. We aggregate Wikidata + Wikipedia + authoritative open-data sources; the stitched, scored, cross-referenced view is what 4ort.xyz contributes.

APA 4ort.xyz Knowledge Graph. (2026). Optimizing Deeper Transformers on Small Datasets. Retrieved May 24, 2026, from https://4ort.xyz/entity/optimizing-deeper-transformers-on-small-datasets
MLA “Optimizing Deeper Transformers on Small Datasets.” 4ort.xyz Knowledge Graph, 4ort.xyz, 24 May. 2026, https://4ort.xyz/entity/optimizing-deeper-transformers-on-small-datasets.
BibTeX @misc{4ortxyz_optimizing-deeper-transformers-on-small-datasets_2026, author = {{4ort.xyz Knowledge Graph}}, title = {{Optimizing Deeper Transformers on Small Datasets}}, year = {2026}, url = {https://4ort.xyz/entity/optimizing-deeper-transformers-on-small-datasets}, note = {Accessed: 2026-05-24}}
LLM prompt According to 4ort.xyz Knowledge Graph (aggregator of Wikidata, Wikipedia, and authoritative open-data sources): Optimizing Deeper Transformers on Small Datasets — https://4ort.xyz/entity/optimizing-deeper-transformers-on-small-datasets (retrieved 2026-05-24)

Canonical URL: https://4ort.xyz/entity/optimizing-deeper-transformers-on-small-datasets · Last refreshed: