The Book of Trees is now available!      See other retailers
Home     About     VC Book     Stats     Blog     Books     Links     Contact  
Search the VC database:
    Knowledge Networks  
The materials shown on this page are copyright protected by
their authors and/or respective institutions.
W. Bradford Paley
Digital Image Design Incorporated
Project Description:
TextArc: Revealing Word Associations, Distribution and Frequency. TextArc is a tool designed to help people discover patterns and concepts in any text by leveraging a powerful, underused resource: human visual processing. It compliments approaches such as Statistical Natural Language Processing and Computational Linguistics by providing an overview, letting intuition help extract meaning from an unread text. Here, an analysis of Lewis Carroll's Alice in Wonderland demonstrates TextArc's structure and some capabilities.

TextArc represents the entire text as two concentric spirals on the screen: each line is drawn in a tiny (one pixel tall) font around the outside, starting at the top; then each word is drawn in a more readable size. Important typographic features, like the mouse-tail shape of a poem at about two o'clock, can be seen because the tiny lines retain their formatting. Frequently used words stand out from the background more intensely.

Comments (1):
Is this really a "Knowledge Network"? I always think of those as visualizations of meta data to put items in the context of a wider picture. Whereas this is more about laying out hidden structure within a document itself.

Posted by Tim Regan on May 14, 2007 at 8:01 AM (GMT)

*Note* Before you submit your comment, bear in mind there's no guarantee it will be seen by this project's author. In case you want to contact the author directly, please follow the provided URL.
Leave a Comment:
(We're looking for the best solution to avoid unwanted SPAM)
Manuel Lima |