The Shape of Our Words

Conversation Fractal and the Structure of Writing

It is odd to think that our words, which we consider to be ours uniquely, as having a defined global structure that can be mapped as a predictable form.

However, research shows that our speech is actually shaped as a fractal and has a predictable, mappable internal structure. Even more interesting, this structure can be seen across all people and time.

Intriguing new research from Google DeepMind shows that language has a deep memory that is set and can be described mathematically. The variable calculated is the Hurst variable. This variable establishes that human language carries a measurable long-range memory (that we live with without perceiving), and what we say is shaped by everything that was said before it, and this holds across all cultures, all domains, and all scales of expression. The conversation fractal can be mapped and compared by text.

How This Tool Works

This conversation fractal tool maps text to analyze its internal structure shape. Each sentence is broken down into twelve data points. These include word count, average word length, lexical diversity, punctuation density, question density, exclamation density, first person density, second person density, negation density, number density, mid-sentence capitalization, and syllabic complexity. Those twelve points are assembled as an aggregate and analyzed using Principal Component Analysis to establish three axes where the largest number of variations occur. The sentences are plotted on a 3D axis and connect as they occur in the text.

To the side, you can see two calculations. The Zipf slope establishes whether word frequency follows a fractal shape based on word distribution frequency, and the Hurst exponent measures whether the sentences were shaped by previous sentences or if they exist independently.

What is displayed is the map of the entirety of your text, and for natural writing, it does indeed form the shape of a 3D fractal.

CONVERSATION MANIFOLD
12 FEATURES · PCA · 3D
min 10 sentences for meaningful structure
CONVERSATION MANIFOLD
DRAG · ROTATE  ·  SCROLL · ZOOM  ·  HOVER · INSPECT

This tool is designed the show the beauty of these words we write.

Directions for Use

To use the Conversation Fractal tool, enter your text in the box or upload a file, and then click analyze.

The tool will return a 3D image, with Zipf and Hurst value to the side, as well as an interpretation of the results.

Frequently Asked Questions

What is the Zipf Law applied to language?

The Zipf-Mandelbrot Law applied to language postulates that when words are sorted by magnitude of appearance, word frequency is often inversely related to word rank.

What is the Hurst variable in language analysis?

The Hurst variable is a variable that looks at the fractal dimension of data. Values between .5 and 1 indicate that the data follows memory of the past (like a fractal with its established pattern), while 0 to .5 shows no such order.

What are twelve text points that the tool analyzes to graph variance?

Word count — the length of the sentence
Average word length — the mean number of characters per word
Lexical diversity — the ratio of unique words to total words
Punctuation density — frequency of commas, semicolons, and colons
Question density — how often question marks appear
Exclamation density — how often exclamation marks appear
First-person density — frequency of I, me, my, we, our, and related words
Second-person density — frequency of you, your, yourself, and related words
Negation density — frequency of not, never, no, can’t, won’t, and related words
Number density — how often numerical figures appear
Mid-sentence capitalisation — unexpected capital letters indicating proper nouns or emphasis
Syllabic complexity — the average number of syllables per word

What do the three color modes show?

Sequence mode colors each sentence point by the order it appears in the text; early sentences are blue, moving through to gold as the text progresses. This shows you the path of the text over time.
Energy mode colors each point by lexical complexity; sentences with longer, more unusual, and more varied words glow brighter.
Personal mode colors each point by first and second person density, how much the words I, me, my, we, and you appear in each sentence.