# textDNA

> program to explore and analyze word usage across text collections of varying scale

**Wikidata**: [Q125523891](https://www.wikidata.org/wiki/Q125523891)  
**Source**: https://4ort.xyz/entity/textdna

## Summary
textDNA is a software program developed to explore and analyze word usage across text collections of varying scale. Created by Professor Danielle N. Szafir, it functions as a web-based tool that enables users to visualize and investigate linguistic patterns within digital texts. The project was initially conceived in 2009 and formally published in 2016.

## Key Facts
- **Definition:** A program for exploring and analyzing word usage across text collections.
- **Developer:** Danielle N. Szafir, a Professor of Computer Science.
- **Inception:** The project began in 2009.
- **Publication Date:** The software was published on October 10, 2016.
- **Platform:** Accessible via a web browser.
- **License:** The software is under an "all rights reserved" copyright status.
- **Source Code:** The repository is hosted on GitHub by the UW Graphics group.
- **Distribution:** Distributed as a web page.

## FAQs
### Q: What is the primary function of textDNA?
A: textDNA is designed to help users explore and analyze how words are used across collections of text that may vary in size and scale. It provides a visual interface for dissecting linguistic data.

### Q: Who created textDNA?
A: The software was developed by Danielle N. Szafir, who serves as a Professor of Computer Science.

### Q: Is textDNA open source?
A: While the source code is available in a public GitHub repository, the software is listed as "all rights reserved" and is fully copyrighted, rather than being released under a standard open-source license.

### Q: How can textDNA be accessed?
A: The tool is formatted as a web page and runs on a web browser. A live version is accessible via the University of Wisconsin–Madison Computer Sciences website.

## Why It Matters
textDNA addresses the challenge of visualizing and understanding linguistic patterns across disparate and large-scale text collections. In the fields of digital humanities and computer science, analyzing word frequency and distribution is often limited by the scale of the data or the complexity of the tools required. textDNA matters because it bridges this gap by offering a browser-based solution that abstracts away complex setup requirements, allowing researchers to focus directly on the content analysis. Its development by a Professor of Computer Science ensures a foundation in academic rigor, specifically within the domain of visualization and graphics. By allowing for the comparison of word usage across collections of varying scales, it provides a unique lens for textual scholarship that standard text editors or basic search tools cannot offer. Furthermore, the project's transition from a 2009 inception to a 2016 release highlights a sustained effort to refine the tool for public and academic use.

## Notable For
- **Scale Analysis:** Specifically designed to handle text collections of "varying scale," a distinct feature for comparative text analysis.
- **Web-Based Accessibility:** Unlike many analysis tools that require local installation, textDNA is fully functional within a standard web browser.
- **Academic Pedigree:** Developed by Danielle N. Szafir within a university Computer Science department, linking it to research in visualization.
- **Transparency:** Despite being copyrighted, the source code is publicly available for review on GitHub.

## Body

### Development and Origins
textDNA was created by Danielle N. Szafir, a Professor of Computer Science. The project has roots dating back to 2009, though it was not formally published until October 10, 2016. The development is attributed to the University of Wisconsin–Madison graphics group (uwgraphics), as evidenced by the source code repository.

### Technical Specifications
The software is categorized as a "non-tangible executable component" of a computer, essentially functioning as an interactive web application.
- **Platform:** Web Browser
- **Distribution Format:** Web page
- **Copyright Status:** All rights reserved
- **Repository:** `https://github.com/uwgraphics/Release-TextDNA`

### Availability and Access
textDNA is distributed via the internet and is accessible at specific URLs hosted by the University of Wisconsin–Madison Computer Sciences department.
- **Main Website:** `https://graphics.cs.wisc.edu/Vis/SequenceSurveyor/TextDNA.html`
- **Working Application:** `https://vep.cs.wisc.edu/TextDNA/live/templates/list.html`
- **Elmcip ID:** The project is indexed under ID 11216.

## References

1. [Source](https://graphics.cs.wisc.edu/Vis/SequenceSurveyor/TextDNA_About.html)