Advanced Essay

What might essays look like in the age of AI? (šŸŒ± First draft)

In the 1970s, decades before computers began to compete at the highest levels of chess, a new variant of the game was invented, called Advanced Chess. The idea was that instead of a human player competing against a computer, the human and machine would play together on the same side.

Thinking about this made me wonder about other forms of humanā€“AI collaboration. In particular, I became curious about how the existence of AI models on the readerā€™s side might fundamentally change the essay form.

First, what is an essay?

Whenever you start thinking about how to cleave apart knowledge into its constituent components and dynamically reassemble them, thereā€™s a risk of blurring the lines between the various forms of media entirely. So, for the purposes of this discussion, Iā€™m going to say that an essay consists of a sequence of words and other elements intentionally arranged by its author in a particular order.1 This includes computational essays, in which the essay itself is formed from a computational medium in which text and images can exist alongside of dynamic illustrations, simulations, and other forms of interactive media.2

Constraints on essays

Traditionally, essays have been written for humans to read, which imposes a constraint on the essay form (and other art forms), which is that human attention is finite and valuable. Every second, you are gifted with exactly one second of the readerā€™s attention, and you had better make good use of it.3

Iā€™ve sometimes wished for the ā€œextendedā€ version of an essay, or for a briefer version. But today, essays are typically provided in a one-size-fits-all fashion that does not adapt to the reader, and that the reader cannot easily adapt to themselves.

How does this change in the presence of reader-side AI models?

Compared to human attention, AI attention is potentially much, much cheaper. This lowers the cost of adding extra information to an essay so long as it does not interfere with the ā€œmain trackā€ designed for people to read.

Imagine if the essay came bundled with a whole bunch of extra material:

The idea of curating a context for a particular piece of writing seems like a very powerful idea.

So, what can the AI do with all of this extra information?

It can take advantage of not only the material provided by the author, but also the context it understands about the reader. The writer knows things the reader doesnā€™t, and the reader knows things the writer cannot.4

For example, the AI canā€¦

The User Interface

How would this look from the readerā€™s perspective? A few quick points here. Iā€™d love to experiment with this, but I donā€™t currently have the time!

Existing examples

Thereā€™s a lot of prior art, though of course none of it has been designed with the intention of LLM use, since LLMs only appeared on the scene after these works were published.

Foot-sidenotes

So, thatā€™s it for now. If I had had more time, I would have written a shorter essay. But in fact, there may be an advantage to this style of writing, and a future AI can condense the ideas context-specifically for individual readers based on their interests. šŸ™ƒ

In service of that future, here are some extended side-footnotes (or foot-sidenotes) that I (or my AI) might return to if I decide to tinker with this concept later.


1

This is probably only roughly right, but the key idea Iā€™m thinking about here is how we can augment a traditional essay, given the new possibilities and constraints of AI assistance on the reader side.

2

To date, the best instantiations of the computational notebook idea are found in Mathematica, which introduced the format, and Observable notebooks, which are an innovative browser-based take on some of the same ideas.

3

I spend a lot of my time thinking about data visualizations, which are not entirely unlike the written word. In that context, one consequence of the finiteness of human attention is that all of the visual elements on the screen are competing for the finite, precious resource of the readerā€™s attention, which makes it necessary to be very intentional about choosing what to show, emphasize, or hide. But there is no best visualization ā€“ itā€™s a function of not only the data, but also the audience and their interests and goals. Presenting an expert with a visualization design for a beginner will often cause them to be dissatisfied, as the simplifications that were necessary for basic comprehension precluded some of the more advanced insights, or omitted some important controls. . Fortunately, thereā€™s more of a practice of providing alternate views of the same data so that you can meet the reader (or user) where they are.

4

This feels a bit like late binding in dynamic programming languages, and like Juliaā€™s just-in-time-ahead-of-time compilation strategy. The idea is that the author of the library has written the logic, but does not know what concrete types their function will be called with. The user of the library has the values in their hand at the time they go to call the function, so the compiler just-in-time specializes the code and compiles an optimized version of the function for the exact types of values the user wants to call the function with. This is analogous to a writer who knows some things but lacks the full context of the readerā€™s experience level and interests, where the AI model can dynamically adapt the work to the interests of and capabilities of the reader.