20 Comments
Comment deleted
Expand full comment

Are you familiar with the 'smiling curve' of value add? Ben Thompson writes about this a lot.

https://stratechery.com/concept/aggregation-theory/smiling-curve/

I suspect that what these LLM's and Image Generation models will do is end up reducing a lot of 'intellectual grunt work' involved in a content pipeline that currently looks roughly like:

ideation -> implementation -> distribution

One big change recently was distribution going from 'expensive broadcast media' (newspaper, radio, TV) to 'cheap multicast media' (email, social media). That caused all kinds of problems.

What the LLM's might do is simply cheapen the _implementation_ portion of the pipeline, which could have the effect of moving all the value creation to the 'ideation' portion of the pipeline so that it looks more like this

ideation -> (neural network) -> (social media)

For example, i have a novel, kind of written. By this, I mean I have a one sentence, one paragraph form, one page form, etc. If I could feed this into an LLM trained on _my writing_ personally, that would be awesome.

This suggests something like the horses and railroads section for written content.

I seriously doubt the 'LLM's get better and everyone gets fired' scenario plays out, because most of what i pay for, when i pay for content, is the relationship itself. The thing I think I'm really paying for is "whatever that thing is that drives ideation", which i think is, itself, a function of personal values and lived experience.

It's also far easier to imagine lots of indie games with graphics that look pretty decent, since a HUGE portion of game development costs involve the absurd quantities of content developers need to create. I think you might even see an interplay of LLM's and art geneation software.

An artist puts into the LLM: "a description of a science lab in a high school in the 1980's, in the dark, with one computer screen on." The LLM then turns this description into a bunch of text describing all the objects and their positions. Then this text is parsed by an 3d image generation tool, which uses the text to generate a scene graph, and then renders all the individual objects in the scene graph in parallel.

The end result is we could be in a world where someone could write up a movie script and turn it into a decently produced feature length film, or write up a description of a short game: "players cooperate to herd cats through an escher-like landscape while pursued by the sleepwaking ghost of Werner Herzog" and a future version of unity spits out an actual playable binary.

Combine 'really cheap prototypes' with modern entrepreneurial models, and i think should likely see an explosion of cool indie shit coming around the corner.

Expand full comment

Great post. Exposing the many paths forward is a great thought exercise and has opened my mind to a structured way to explore these different progress metaphors. Thank you.

Expand full comment

Great post, thank you! This question is a bit normative, but which of these outcomes do you think would be the best for humanity? Which would you prefer to see? Feel free to interpret "best" in whatever way aligns with your values.

Expand full comment

In the early days of desktop publishing (aka 30 years ago) there was the great JPEG debate. Scanned images for four-color catalogs and magazines (there was very little digital photography yet) were stored as TIFF files with lossless compression. They were huge, particularly for for the storage and network standards of the time. But there was a fight on, with some people advocating JPEG compression, which sacrifices some accuracy for the ability to make the image smaller. At the time, the algorithms used for compression were crude compared to today, and they could leave ugly, visible artifacts in the images, particularly when they were printed at large sizes on high-quality presses. But the people pushing the tech said it would get better. Or that the general public wouldn't care, because they wouldn't notice the difference. And by the time digital cameras (which all use lossy compression) were commonly available a decade later, they were right. Now, everyone uses some form of lossy compression for images from the ones you see on the screen to glossy magazine covers. The original images taken by the camera are lossy, and each resizing or resaving just introduces new artifacts and nobody ever notices.

Expand full comment

The chess analogy does not quite go through yet, because chess is a competitive game with a fixed and fully transparent metric for success. There is never any doubt who won, and you can aggregate performance over multiple games to get a robust and undeniable sense of who or what is doing best. There are many domains that are like this, but writing isn't one of them. You could operationalize writing by making it into a competition, but that would always be vulnerable to people saying "that's not the kind of writing I want."

In that vein, there are some experiments that I'd like to do, such as formal debates or moot court sessions. Once LLMs start winning those competitions against humans, we could start to say that they really are better. And, as you say "who cares?" becomes an appropriate question. What if lawyers who rely heavily on LLMs begin to outperform those who don't? What if that starts to happen in cases argued before real courts? I think we would HAVE to care. I'm not sure how that would play out.

Arguably, being able to make appropriate use of technical means is a reasonable requirement for lawyers and other professionals. Indeed, some professions have specific rules about being up to speed technically. After all, the professionals have a duty to their clients, and if technical means can help, they should use them. A civil engineer who insisted on using only pencil and paper for calculations that a computer can do more easily and reliably would be disbarred in a heartbeat.

Expand full comment

One book I read years ago was by a woman economic historian (name forgotten) whose topic was the history of technology advancements and how their usage and application evolved over many years. Her main point was about how initially, humans would just hook up the new technology to the old process or plant and equipment and carry on. For example, we would take the old flour mill that water had driven and plug it into an electric motor. Real breakthroughs, often many years later, only came when we started to understand the distinctive essence of the new technology and we would design and build our process from the ground up to take into account the full potential of that new capability. So what LLMs transform may take a while to emerge…

Expand full comment

Viclav Smil (How the World Really Works) said that forecasting the effects of new technology is a mugs game, but if you have to do it, start by throwing out the Armageddon scenarios and the techno-utopian ones (like the Singularity), and then basically do what you did in this post: look what happened before when radically new technologies came along.

Even if forecasting is a mugs game, I agree with apxhard that LLMs will be used to do all the essential writing that no one wants to do - user manuals come to mind. The speed of LLMs might also make them valuable for generating up-to-the-minute news content. There's money in getting the news out first after all. Once you remove all the tedious and stressful writing, why would we want to stop doing the enjoyable kind?

Writing, when it comes down to it, is about communicating the thoughts in one persons brain via text into someone else's brain. We don't have a reason to care what an LLM is thinking, so if we know something is written by an LLM, will we care enough to make the effort to read it, once the novelty wears off?

You could argue that it's the quality of the idea that makes something interesting, but that's only true in some circumstances. Scientific ideas are judged on how well they represent reality and how much predictive power they have. Emotional writing, writing with emotional content, will probably always seem fake coming from an LLM. I think humans are likely to be almost as good at detecting fake emotional writing as fake facial expressions, but even if we fall for it at first, as soon as we find out there was no human behind it, I think we'll want to ditch it and go looking for the real thing.

Just a side note re # 8, check out the pre-fab housing market again before making your next real estate purchase. The state of the economy aside, housing seems almost as ripe for technological disruption as writing. The number of new companies and products coming out is almost as interesting to watch as the LLM space.

Expand full comment