Porter Olsen: It’s my great plea­sure to intro­duce Darius Kazemi. He began his career as a game devel­op­er, which he did for ten years and worked on mas­sive­ly mul­ti­play­er online games includ­ing Lord of the Rings Online, which I played a great deal. But of course if it said Tolkien I was inter­est­ed to at least check it out. Then he decid­ed to write an arti­cle titled Fuck Video Games” and that was the end of that.

But he pur­sued his career as an author. He’s recent­ly released a book on Jagged Alliance 2, a turn-based squad strat­e­gy game. He’s per­haps most well-known for his work in the weird Internet,” specif­i­cal­ly bot-making and a pro­gram that he wrote called Random Shopper, which goes to Amazon and sys­tem­at­i­cal­ly spends fifty dol­lars a month buy­ing him stuff which you actu­al­ly buy, right? [To Darius] And then it gets shipped to you and you see what you get.

He also writes Twitter bots includ­ing a muse­um bot that goes to a col­lec­tion and ran­dom­ly shows you some of the con­tent that is not nor­mal­ly seen in the nor­mal course of vis­it­ing the muse­um. I’m very much look­ing for­ward to the time at which Random Shopper buys for you Jagged Alliance the book. Thank you for being here. We’re excit­ed to hear your talk, and I’ll turn it over to you.

Darius Kazemi: Hi, every­body, I’m Darius Kazemi and I hope I can live up to the abstract that I sent in ever so long ago. I do want this to be a dia­logue and not just me up here so I do have a talk pre­pared. It it not going to fill a full hour because I expect to talk to you all while we’re here.

That was a pret­ty good intro­duc­tion. As Porter men­tioned I spent the first part of my career in video games. I’m actu­al­ly going to talk about that a lit­tle bit because it relates to what I want to dis­cuss today.

Screenshot; Darius standing at a podium with a slide showing a large list of his 2013 projects

Stuff I made in 2013″ at Darius’ blog

I make a lot of stuff. This is the projects that I released in 2013. Actually a few more than that. This is like 18 months of projects, and each one of these is—my book that took me a year to write is one entry on here, and lit­tle projects that took me an hour to release are also list­ed on here. I con­sid­er them all of equal val­ue because I firm­ly believe that there is almost no cor­re­la­tion between how much work into some­thing and whether or not it will make an impact, peo­ple will like it, that sort of thing. All I know is that I like all this stuff.

I’m here at MITH today, and I want­ed to talk a lit­tle bit about dig­i­tal human­i­ties from my posi­tion as an inter­est­ed out­sider. I’ve always kept a fin­ger in acad­e­mia, at first through game stud­ies and peo­ple study­ing video games, and more recent­ly through elec­tron­ic lit­er­a­ture and those fields. I’m not going to go into a what is it?” debate because I know every­one who’s in dig­i­tal human­i­ties is very tired of those, but we know when we see it, right? But I do want to talk about what an out­sider thinks dig­i­tal human­i­ties is, which is prob­a­bly best encap­su­lat­ed by the Ngram Viewer. It’s this sort of Big Data approach to the human­i­ties that has been writ­ten about a lot, and peo­ple have writ­ten about why it’s bad from a cul­tur­al and insti­tu­tion­al cri­tique and those stand­points, and how it’s tied to Silicon Valley trends, tied up with the cor­po­ra­ti­za­tion of the uni­ver­si­ty, and so on and so forth. I’m not that inter­est­ed in cov­er­ing those angles, but what I would like to bring to this for a bit is my own expe­ri­ence using these sorts of ana­lyt­i­cal tools onto the at least some­what human­is­tic enter­prise of the video game.

I spent five years work­ing as a data ana­lyst for online worlds. I start­ed work­ing at Turbine in Boston. That was in 2005 and we were just begin­ning to under­stand what we had on our hands in terms of data. When I start­ed work­ing there on Dungeons & Dragons Online, we had a serv­er that was col­lect­ing data about what our play­ers did on the servers, but we had no report­ing on it. No one was using it. They were just these data­bas­es that got slow­ly filled over time, until occa­sion­al­ly some admin would go and write a script that would blow away any­thing over six months. That data wasn’t even engi­neered for the pur­pose of track­ing play­er behav­ior. It was most­ly engi­neered by engi­neers who in the course of their bug-fixing decid­ed it would be help­ful to write a track­er to do this, or a track­er to do that. By the time the game was launched, we had all these track­ers that were col­lect­ing data that was meant to be tech­ni­cal, but could also be ana­lyzed to learn things about the way that play­ers act­ed.

But no one was look­ing at this infor­ma­tion at all. I learned about this serv­er and I was this enter­pris­ing 22 year-old with an engi­neer­ing degree and a minor in rhetoric, so I thought I could do any­thing. I fig­ured that I could build soft­ware that turned that data into dash­boards of infor­ma­tion that we could then act on. So I did that. I was work­ing as a tester at the time, but I got per­mis­sion to do this project on the basis that it would save us mon­ey and time test­ing if we could out­source a lot of our test­ing to the play­ers them­selves. We could find out that peo­ple are wip­ing out in a par­tic­u­lar dun­geon if we get sta­tis­tics on whether or not peo­ple are suc­ceed­ing in a dun­geon.

This was a huge suc­cess inter­nal­ly at Turbine. In an 18 month span, we went from no for­mal report­ing appa­ra­tus at all to a team of four peo­ple ded­i­cat­ed almost exclu­sive­ly to col­lec­tion and inter­pre­ta­tion of data, with myself as the chief inter­preter. I like to say that we cre­at­ed this cul­ture of data at Turbine. We went from a com­pa­ny that didn’t talk about these things to a com­pa­ny where design­ers would come to me before they even had imple­ment­ed a new idea that they had, and said, I have these con­cerns. What can we do to track data about our play­ers that can then tell me once this fea­ture goes live whether or not it was a suc­cess?”

And this was new in the indus­try at the time, as well. I believe there was pret­ty much one oth­er com­pa­ny that was doing data analy­sis at this lev­el, and that was EA on The Sims Online. Most of the oth­er com­pa­nies were sim­ply scrap­ing log files and occa­sion­al­ly look­ing at num­bers here and there. But at best a com­pa­ny would have a sin­gle engi­neer who would do this part-time because it seemed like a good idea. So this was all very new. This was about four years before Zynga real­ly pop­u­lar­ized social online games and col­lect­ing met­rics and tun­ing their game based on that, to the point where I had to go out to con­fer­ences and try and con­vince peo­ple in video games that it might be a good idea to col­lect data on their users.

Virtual worlds are real­ly inter­est­ing for this kind of work because in a vir­tu­al world, you’re pre­sent­ed with what seems like this closed epis­te­mo­log­i­cal sys­tem, where in the­o­ry if it’s hap­pen­ing in the sim­u­la­tion, you tech­ni­cal­ly have per­fect data on what’s hap­pen­ing with­in the para­me­ters of that sim­u­la­tion. It seemed like we were solv­ing epis­te­mo­log­i­cal prob­lems inher­ent to user stud­ies and play-testing. We no longer had to deal with all that messy Well are they real­ly play­ing like they would if we’d brought them into the com­pa­ny? Are they say­ing nicer things about the game because we bought them piz­za?” All the way down to hairi­er prob­lems than that as well. So it seemed like we were doing this very sci­en­tif­ic thing.

What I’ve learned from that expe­ri­ence is that using data can get you very far…until it doesn’t any­more. One of the hard­est lessons that I learned, and I learned it slow­ly over the course of three years, is that you’re almost always con­firm­ing sus­pi­cions that you already had. It’s unclear whether you’re com­ing in ask­ing biased ques­tions, and in fact even the very nature of the sta­tis­tics them­selves can be called into ques­tion as to whether or not they’re accu­rate or mean any­thing.

We built all sorts of visu­al­iza­tions. This was from my con­sult­ing com­pa­ny that I had after that. We looked at peo­ple play­ing Quake and built these beau­ti­ful visu­al­iza­tions of how they did it, and you could look at that and make con­jec­tures about it. Next up is an exam­ple that I don’t think I’ve shown before. This is a cap­ture from a tool that I built inter­nal­ly at Turbine. This isn’t even live data, this is from our beta serv­er. I built this tool called the Economy Explorer, and we used it to look at trade net­works in the game and sort of fer­ret out who the gold farm­ers were, and who’s doing real­ly strange things trad­ing with a bunch of place­hold­er char­ac­ters and then fig­ur­ing out who the mule is and cut­ting them out of the net­work, and all that kind of stuff.

[Here Darius plays a clip ~11:1211:55 in the video show­ing the appli­ca­tion.]

It was very excit­ing work, but this was the build­ing of a tool to help an author­i­tar­i­an appa­ra­tus in the game, essen­tial­ly. This was the tool that we used to ban peo­ple for doing things that we didn’t like in the game.

When it game down to actu­al insight and fig­ur­ing out fun­da­men­tal things about our game and learn­ing about it, I used to get this ques­tion at con­fer­ences all the time and I always hat­ed it, which is, What’s the most sur­pris­ing insight you uncov­ered through your data?” I nev­er had a good answer for this ques­tion. Usually I’d default to some­thing like Well the big sur­prise was that there were no big sur­pris­es.”

And yeah, we’d uncov­er unusu­al stuff but it usu­al­ly just mys­ti­fied us. We were smart peo­ple, I had a strong human­i­ties back­ground in addi­tion to a tech back­ground. I knew things, I had cross-disciplinary knowl­edge, like I thought I was smart enough to slice and dice it and fig­ure out what this stuff meant, but the more I sliced and diced it, the more it was like infi­nite regres­sion of nest­ing dolls. You just open up this box and then there was anoth­er box, and then you open up that box and anoth­er box. It’s prob­a­bly a human head at the very bot­tom.

We’d uncov­er things like a whole bunch of peo­ple are buy­ing this par­tic­u­lar kind of item. Sometimes we’d look at it and at best we would notice that the item’s pret­ty over-powered for the price. Well great, that’s use­ful, but it’s not sur­pris­ing. It’s not an insight that our play­ers are opti­miz­ing for the sys­tems that we told them to opti­mize for. Over and over and over again, it just told us things are going the way you prob­a­bly expect them to go, but maybe they’re going worse or maybe they’re going a lit­tle bet­ter” and that was pret­ty much it.

People would want to know about mon­ey and pur­chas­ing habits, and that’s what we actu­al­ly got out of the realm of the epis­te­mo­log­i­cal shield that we had set up because now you’re deal­ing with humans with cred­it cards, and there were pri­va­cy laws and all these oth­er things. So I sort of said okay, I’ll do the busi­ness ana­lyt­ics on that side because I have to for the busi­ness peo­ple, but on this oth­er side is the game stuff look­ing at the game as a work on its own mer­it, and I can still slice and dice that and learn things. Over and over again we either learned that our sus­pi­cions were found­ed, which either meant that our sus­pi­cions were always right or that the way we were col­lect­ing this data was just set up to con­firm our sus­pi­cions, or we found weird things. For exam­ple, often­times when we’d find some­thing weird it was like this sto­ry I’m about to tell.

So we have mon­sters run­ning around this world, because of course you’re not going to have a beau­ti­ful vir­tu­al world with­out mon­sters get­ting in your way all the time. We put in a few hooks to the arti­fi­cial intel­li­gence of the mon­sters because some of the engi­neers were wor­ried about the pathfind­ing for the mon­sters; that they would get stuck in cer­tain places. So essen­tial­ly a mon­ster would walk around, and at any point in its pro­gram if it didn’t know what to do next and said I lost my march­ing orders. I’m com­plete­ly lost. I have no idea what to do.” it would ping out those coor­di­nates and it would just say Okay, I got con­fused here.”

So said I could make a map of all these places the mon­sters get con­fused and maybe I could find some weird place where the geom­e­try is bad and needs to be fixed in the game. So we pull up the map, and the map looks almost like an exact grid, like all these dots but the dots are all clus­tered along these lines mak­ing this grid. I had no idea. I thought there was a bug in my code that col­lect­ed this data or some­thing. I took it to a much more expe­ri­enced engi­neer who took a look at it and said, I know exact­ly what that is.” Our sys­tem is divid­ed into land blocks, and the AI just gets con­fused when it’s basi­cal­ly cross­ing a latitude/longitude line into anoth­er par­cel of the serv­er. It gets momen­tar­i­ly con­fused and then reori­ents itself. Invisible to a play­er, but got sent out to the serv­er.

So that’s weird and sort of inter­est­ing, but it reminds me of this. Anyone who’s ever searched for fuck” on the his­tor­i­cal ngrams site, you prob­a­bly were mild­ly sur­prised to note that there’s a lot of f-bombs being dropped in 1650, until you go to Google Books and actu­al­ly search for that word in that time peri­od. Then it turns out that it’s just the word suck” or some­times such” depend­ing on the font, and the OCR picks it up the wrong way. That reminds me of find­ing that weird grid pat­tern with those arti­fi­cial intel­li­gence enti­ties in the game. It’s equiv­a­lent to those AI seams.

This infor­ma­tion often tells us more about the soft­ware plat­form col­lect­ing the infor­ma­tion and the appa­ra­tus that we’re using than the actu­al con­tent that we are con­cerned with, which is fas­ci­nat­ing to me because I like meta stuff and I like soft­ware plat­forms and all that sort of thing. I think that’s great and inter­est­ing and I would read a paper on OCR errors in Google Books and what that means. But yeah, data analy­sis often tells you more inter­est­ing things about the method­ol­o­gy than the sub­ject.

So we end­ed up with these mas­sive epis­te­mo­log­i­cal prob­lems any­way. We weren’t avoid­ing them at all, we weren’t being tricky or sneaky and find­ing some secret way to have an objec­tive view of what we were look­ing at. We were just cre­at­ing new epis­te­mo­log­i­cal prob­lems. We were just mov­ing things around.

I often equate data analy­sis with cri­tique as a prac­tice. They’re both based on decom­pos­ing things, break­ing them out into parts, fig­ur­ing out how they work. I want to talk a lit­tle bit about this idea of com­po­si­tion over cri­tique. When I say com­po­si­tion, I mean this in the sense of Bruno Latour’s Compositionist Manifesto” pub­lished back in 2010. When Latour talks about com­po­si­tion, he sets it up not in oppo­si­tion but sort of as an alter­na­tive to cri­tique. He comes from a sci­ence and tech­nol­o­gy stud­ies back­ground, so he’s an STS guy, and his point is that cri­tique only gets you so far. The way he puts it is if you have a sledgehammer—not a reg­u­lar hammer—or a wreck­ing ball, there’s pret­ty much one thing you can do with that, and it’s super use­ful for clear­ing out old debris and all that sort of thing. But what hap­pens when you’re done with that? I’ll quote him at length here. He’s talk­ing about the dif­fer­ence between cri­tique and com­po­si­tion. He says:

The dif­fer­ence is not moot, because what per­forms a cri­tique can­not also com­pose. It is real­ly a mun­dane ques­tion of hav­ing the right tools for the right job. [The sledge­ham­mer can] break down walls, destroy idols, ridicule prej­u­dices, but you can­not repair, take care, assem­ble, reassem­ble, stitch togeth­er. It is
no more pos­si­ble to com­pose with the para­pher­na­lia of cri­tique than it is to cook with a see­saw. Its lim­i­ta­tions are greater still, for the ham­mer of cri­tique can only pre­vail if, behind the slow­ly dis­man­tled wall of appear­ances, is final­ly revealed the nether­world of real­i­ty. But when there is noth­ing real to be seen behind this destroyed wall, cri­tique sud­den­ly looks like anoth­er call to nihilism. What is the use of pok­ing holes in delu­sions, if noth­ing more true is revealed beneath?
Bruno Latour, An Attempt at a Compositionist Manifesto’”

This in turn reminds me of some­thing that I read ear­li­er this year in the dig­i­tal human­i­ties milieu from Mark Sample. He wrote a blog post called Difficult Thinking about the Digital Humanities” and I’ll quote him as well here. I’m sort of tak­ing him a lit­tle bit out of con­text, but I hope he’ll for­give me. He says

Two hall­marks of dif­fi­cult think­ing are imag­in­ing the world from mul­ti­ple per­spec­tives and wrestling with con­flict­ing evi­dence about the world. Difficult think­ing faces these ambi­gu­i­ties head-on and even pre­serves them, while facile think­ing strives to elim­i­nate complexity—both the com­plex­i­ty of dif­fer­ent points of view and the com­plex­i­ty of incon­ve­nient facts.
Mark Sample, Difficult Thinking about the Digital Humanities

He’s sort of draw­ing a line between that facile think­ing and cri­tique, and he says, I’m dis­sat­is­fied with that word crit­i­cal” and all its variations—that’s why my for­mu­la­tion empha­sizes dif­fi­cult think­ing over facile think­ing.” I don’t think it’s a coin­ci­dence that at the same time that he wrote this essay, he was also in the mid­dle of an incred­i­bly pro­lif­ic peri­od of writ­ing about and build­ing Twitter bots. He wrote this great essay about bot arche­types where he talks about closed bots ver­sus green bots. @everyword is a great exam­ple of a closed bot. It takes a dic­tio­nary and it iter­ates through the dic­tio­nary and it tweets every word in that dic­tio­nary, and then it’s done. That’s a closed bot.

A green bot is some­thing that might source from the greater world. Sample’s bot @whitmanfml is a great exam­ple of that. It takes quotes from What Whitman which are a closed cor­pus and then mix­es them with the #FML hash­tag, source from Twitter which is an open cor­pus, and that’s what he’s talk­ing about when he talks about green bots and how green bots have this capac­i­ty to sur­prise and all these oth­er things.

I don’t think it’s a coin­ci­dence that he was think­ing about these things at the same time. Another great exam­ple of a bot that he built is more of an activist bot, sort of the equiv­a­lent of— It’s activist in the sense that a Brechtian play is activist. He calls it an exper­i­ment in spec­u­la­tive sur­veil­lance, and it comes up with all these tweets about who NSA PRISM has caught doing what. If you fol­low this, it keeps you on your toes, essen­tial­ly, as you fol­low it through your feed.

What I want I want to chal­lenge peo­ple in dig­i­tal human­i­ties to think about is how you can reverse the polar­i­ty, so to speak, on the tools of dig­i­tal human­i­ties. If you’re learn­ing things about the way texts are con­struct­ed or how they’re sit­u­at­ed, that kind of thing, and you’re learn­ing these things through analy­sis be it through data like ngrams or any­thing like that, by all means pub­lish the results of that sta­tis­ti­cal analy­sis or visu­al­iza­tions or what­ev­er. But I think it’s a real­ly impor­tant exer­cise to also try turn­ing that research on its head and using your find­ings to gen­er­ate things that go out into the world.

A real­ly enlight­en­ing exam­ple for me is the reg­u­lar expres­sion. For those who don’t know, a reg­u­lar expres­sion is a com­put­er sci­ence con­struct, it’s math­e­mat­i­cal con­struct, it’s a lin­guis­tic con­struct. It’s one of these great things that sort of cuts across all sorts of dis­ci­plines, and you can write math­e­mat­i­cal proofs that a reg­u­lar expres­sion is also a state machine from this oth­er field, and it’s also iso­mor­phic to this and that. But for peo­ple in dig­i­tal human­i­ties espe­cial­ly you might be famil­iar with the reg­u­lar expres­sion because it is one of the most com­mon tools for extract­ing text from oth­er text.

So if I want to find every set of comma-separated phras­es in a book, I could write a reg­u­lar expres­sion that essen­tial­ly says find a block of let­ters and then a com­ma, fol­lowed by an arbi­trary num­ber of blocks of let­ters and a com­ma, fol­lowed by maybe an and’ maybe not if it’s an Oxford Comma or what, and then ter­mi­nat­ing in a ter­mi­na­tor like a peri­od or an excla­ma­tion point or any­thing like that. (I’m a pro­gram­mer; it took me ten years to real­ly inter­nal­ize reg­u­lar expres­sions.) You feed that in, and then you feed in the book, and then it spits out on the oth­er end the match­es for all that. You can also use reg­u­lar expres­sions to find and replace things. So if I want to find all of those comma-separated phras­es and replace them with the word meow,” I can do that very eas­i­ly with a reg­u­lar expres­sion.

What I hadn’t real­ized in ten or twelve years of using reg­u­lar expres­sions is that they also can be turned on their head. You can use a reg­u­lar expres­sion as a gen­er­a­tive tool. There are tools out there where you can put in a reg­u­lar expres­sion that might say— If you wrote a reg­u­lar expres­sion that said find every instance of two vow­els adja­cent to one anoth­er” you could also feed it into a machine that says giv­en the rule two vow­els adja­cent to one anoth­er’ gen­er­ate every string of char­ac­ters that val­i­dates that rule.” and then it would spit out a few hun­dred vow­el pairs for you. You could get weird­er than that. You can feed in sets of syl­la­bles and you could have it gen­er­ate whole new lan­guages and that sort of thing based on this tool that is nor­mal­ly used for analy­sis and decom­po­si­tion and tak­ing things apart. You can turn it on its head and make it into some­thing gen­er­a­tive.

I want to show anoth­er exam­ple here. This is on my mind right now because this is some­thing that is hap­pen­ing at the moment. I orga­nize this thing every November called NaNoGenMo, and it’s like NaNoWriMo, which is National Novel Writing Month where peo­ple sign up and they pledge to write a 50,000 word nov­el over the course of the month of November. Last year I posit­ed that we could do NaNoGenMo where you’d pledge to write code that gen­er­ates a 50,000 word nov­el over the course of November. So we have a bunch of peo­ple who sign up and do this. Academics, pro­gram­mers, peo­ple who are nei­ther. We have 61 projects going right now. Some of them are already done. This one’s great; it’s called 50,000 Meows, and you just feed it a book and we can get the new all-meow ver­sion of Moby Dick. Much improved, I think. It retains the syn­tax and cap­i­tal­iza­tion and all that sort of stuff. So it still has the shape of Moby Dick.

I’m work­ing on some­thing where I’m look­ing at peo­ple who have stud­ied plot­ting for nov­els and reverse-engineering their analy­sis of the way plots in nov­els work to gen­er­ate some­thing approx­i­mat­ing a nov­el and part of what this does is it points out the holes in their mod­el as well. So it does sort of oper­ate as cri­tique as well. I pulled all this stuff from one of these how to write a nov­el for a begin­ner” things, and this per­son has the ten steps” of a nov­el. You start with sta­tus quo, then some­thing hap­pens, then the char­ac­ter makes a deci­sion to act. So I read through this whole thing and took it extra­or­di­nar­i­ly lit­er­al­ly, and I built a short first pass of this, which is

If you’re inter­est­ed, there’s a resource thread where peo­ple post resources they think might be use­ful, any­thing from tools for nat­ur­al lan­guage pro­cess­ing to cor­pus­es of data that they think might be inter­est­ing. I learned about some­one who com­piled 10,000 plot ele­ments that could pos­si­bly be in a book, so I’m prob­a­bly going to dig into that pret­ty soon as well. To me as an out­sider, this is all dig­i­tal human­i­ties work as well. We’re ana­lyz­ing the form of what’s out there and then inter­nal­iz­ing it. In this case we’re writ­ing an algo­rithm based on that, and then spit­ting it back out in the world.

This is the same thing as when I did @twoheadlines. This was me look­ing at a lazy form of joke on Twitter and going, I could para­me­ter­ize this and just gen­er­ate this joke for­ev­er.” I don’t know if Mark Sample would con­sid­er this an activist bot, but I con­sid­er it activist for a very nar­row band of cul­tur­al activism where I want to kill a par­tic­u­lar kind of joke. And I’ve sort of suc­ceed­ed. On a week­ly basis, some­one tells a lazy joke like this and then some­one else rather innocu­ous­ly goes, Oh that’s just like @twoheadlines!” And I’d like to think that they look at it and feel bad.

Much of what I do is just notic­ing pat­terns in how peo­ple com­mu­ni­cate in dig­i­tal media and fig­ur­ing out how to repli­cate that and send it back out in the world. When I see peo­ple doing just analy­sis, I’m always hold­ing my breath wait­ing for the next step, which is what are you going to do with that? What are you going to make with that? Analysis for its own sake is fine, I like analy­sis for its own sake, but at that point that I’m wait­ing for some­one else to take that and see what they’re going to do with it.

I think that’s it for the planned part of my talk here. I’m inter­est­ed in open­ing it up to ques­tions or com­ments or any­thing.

This is just a lit­tle bonus. I built this this morn­ing while I was work­ing on my talk because I acci­den­tal­ly built it. This just gen­er­ates a ran­dom ngram. So if you want to do some analy­sis, there’s a tool for that now. This was sort of inspired by this great web site Spurious Correlations. This algo­rithm has a zil­lion dif­fer­ent data sets and finds ones that cor­re­late well, and then it says, Well here you go. Drawn your own con­clu­sions.”


Audience: But you're not actually looking for sequences that are correlated, they're just…

Darius: No. They're just random words from the dictionary, but sometimes you'll notice that "polarization" and "folklore" are pretty well correlated, and what does that tell us about the history of the written word?

Audience: That'll be for your next talk.

Audience 1: I'm wondering if your novel generation algorithm could be forked and turned into a scholarly article generator.

Darius: There are already tons of scholarly article generators. That's a well-trodden path. It's usually people from the hard sciences, who create generators that generate sort of deconstructionist tomes and then submit them. [cross-talk comment from audience member] Random post-modern stuff, and then submit them to actual journals and see who they can get to actually accept them for publication. And this has happened. But it's also gone the other way. There are scientific papers as well that have been equally, sadly, accepted by journals, too. You should Google it. There's a lot of good stuff there.

Audience 2: My day job is over in the English department, so I'm going to try to ventriloquize a question from a kind of cranky senior colleague who, were such a person sitting here, he or she might point out that if you look at Literature with a capital "L," the kind of literature that we expect to do real cultural work. You look at titles and get things like War and Peace, you get Crime and Punishment. These are the big questions, right. And the projects that you've shown, one of them is about killing a certain type of joke on Twitter. How would you address a kind of cranky skeptic who sort of wanted to know something about the potential for this form to address the big questions, the human condition?

Darius: I'd show them [Last Words] to start with. This is occurrences of the word "love" in last words of executed Texas death row inmates. This is a corpus that I got my hands on. Someone just pointed me to it, and this is one of those 45 minute projects for me. I didn't even set out to build this, I just said "I have this weird set of data." I built software that scraped out all the last words. I wrote a program that just put the entire text of people's last words in front of me and just sort of flashed it by like flash cards. I noticed the word love appeared many many times. I said "what if I filter for the word 'love?'" It came up with this, and I said, "I'm done. That's it. It's done. Ship it." That was over lunch at work one day. And this has been used by actual anti-death penalty advocates. They use this in their workshops. A Christian newspaper wrote an article about this and Christ and the death penalty, and can Christians really support the death penalty, that sort of thing.

So maybe not NaNoGenMo, yet. I don't think we're going to have War and Peace, but I do think there's plenty of potential to address larger issues.

Audience 2: And one version of that question is kind of this cranky hypothetical colleague, but another version is the concern that a lot of what we often see is either derivative or parasitic in the sense that it's a kind of in-joke if you will. So the 50,000 meows is funny, but because we have National Novel Writing Month, so you already have to be in on that as a kind of Internet meme or phenomenon in order to then appreciate this wonderful derivative work, and that's a kind of barrier to entry.

Darius: Right. NaNoGenMo itself is my loving jab at NaNoWriMo because it is very strange to have an event where the definition of novel is "50,000 sequential words." So yes, I do agree there's very often a bit of a barrier there. On the other hand, I think you could make the argument that what I'm seeking to do— It's that whole public intellectual versus university and publications and that sort of thing type of debate. I'm interested in deploying things to public networks like Twitter and Facebook and the Internet at large, where people are more likely to laugh at a reference to NaNoWriMo than they are to laugh at a reference to Foucault.

Audience 2: But ultimately the same sort of underlying message…

Darius: Yeah. I think it's a lot of the same work that's being done. And much like with the debates over the public intellectual and their role, they can go on forever as well.

Audience 3: How do you see the field of software criticism or cultural criticism through software evolving? What's next?

Darius: I don't know what's next for the evolution of cultural criticism or opposition through software. I see more and more work done with— I was talking earlier about this today. In my head for a long time, I've had this divide between software that does work like utility software, and software that does art. You have utility bots that you can tweet at a bot and say "Remind me at 12pm to have lunch." and then it would tweet at you at 12pm and say, "Have lunch." That's a Twitter bot. It's not really an art bot, I don't think, but maybe you could make a case for it. As opposed to my Metaphor-a-minute, which just cranks out metaphors all the time, which is much more situated as an art thing.

But it's strange because there are also activist bots. I'm hosting something called Bot Summit on Saturday [November 8, 2014] in Boston, where a bunch of practitioners are getting together both online and in person to talk about this sort of stuff. One of the things we're going to talk about is this whole question of activist bots. There's a strange tension between activism through bots and the terms of service of places like Twitter, which don't let you unsolicited reply to people. So I've had people say, "Can you help me make a bot that finds people tweeting bad facts about climate change and gives them the data to help them come around on it." And I tell them no, I can't do that because that bot will get banned automatically by Twitter as spam, because technically it is spam. They did not sign up to get harassed by an entity for things that they are tweeting. So it makes it very hard to do activist work on Twitter, but within activism you have Mark Sample's NSA PRISM bot, which is activism. It is also art. I drew a connection to Brecht on purpose there. It's possible to do art and activism at the same time. It's also possible to write something on a sign that says "Down with So-and-so" and go out in the street, and that is activism as well, and that's not art.

So I'm very interested in the future of activist bots and activist software. There's something that someone built called Block Together, where you can as a community come up with a blacklist of people who are toxic to your community. And then everyone in your community can authenticate with Block Together and press a button and simultaneously block all 5000 of those people. So members of your community can be protected from having to deal with members of another community they might find toxic. But it stays within the bounds of Twitter's terms of service. That's very interesting work as well.

In terms of trends, I've noticed a lot more visual things happening on Twitter recently. There's ⋆✵tiny star fields✵⋆ by Katie Rose Pipkin, which has 20,000 followers, blows away any of my creations, and just tweets these beautiful, serene little visual compositions. And I love @ARealRiver, which tweets individual things that look like a sort of scene by a river. As you scroll down you'll notice it's continuous, and you can just scroll forever and follow this river. There's a boat in there, there's a horse. If you go all the way down, you'll hit a mountain range that persists for quite some time, and then there's metropolitan areas it's passed through. It's a persistent landscape on Twitter. It's very interesting and it's very exciting to me. So I'm noticing a lot more visual stuff playing with emoji as well, but I can't predict the future. Hopefully I get a better sense when we have Bot Summit this weekend.

Audience 5: I wanted to ask you about the term "Weird Internet." Can you give us the background on that term, and also talk more broadly beyond Twitter where you see the weird Internet?

Darius: Sometimes when I say Weird Internet, some people thing I'm talking about Weird Twitter, which is its own subset of surreal Dadaist humor that happens on Twitter. But when I say Weird Internet I mean that very specifically. I'm talking about almost a return to a folk Internet where there's a place for things that aren't just sanitized of your birthday party that your grandmother can click Like on. People have recently referred to it as like the return of Web 1.0. There was an article recently about that. I don't know if you've seen tilde.club but you sign up—there's a huge waitlist now, like eight thousand people or something on the waitlist—but I think Paul Ford set this up and it's literally just… I don't know if universities do this anymore, but I know they did when I was in school where it's like, "Okay here's your shell account, and here's the Unix server and your username is blahblahblah.edu/~dkazemi" and then you can upload whatever you want on there and host it on there. This is an attempt to recreate that, and if you go to people's sites […] it's got this lo-fi look to it. But beyond that, it's got a lo-fi apparatus as well. One of my friends was saying "I have friends who are completely not technical asking me for help with ssh because they have a tilde.club account and they want to update their site." I consider this Weird Internet, I consider bots, especially non-utility bots to be Weird Internet. Anything that makes you stop in your tracks or surprises you at all I would consider part of that milieu. I just want strange and unexpected beautiful things to happen all the time.

Audience 6: I don't know this for a fact, but it was my impression that that started out as a joke kinda similar to how some of the things you talked about have a comical aspect to them, but Paul was quickly overwhelmed by all these people that wanted to use that site. There were some people that I think were part of the joke, like Matt was talking about being on the inside—

Darius: Right. Some of it's almost like an ironic, hipster, "I'm gonna use ssh in addition to building a bike out of my woodshed."

Audience 6: But then there were all these other people that got in on it because they actually liked the idea of it. Like, "Actually, no, I don't mind ssh'ing to this computer and creating an HTML file and having a web site there." And people that are rediscovering that that was the way that the web started. There's a new generation of people now that are [observing?] this is actually how it began.

Darius: Myself, over the last few years I've been slowly extricating myself from blogging platforms like WordPress. I archived my WordPress site and it exists in static form now but I can't log in, there's no database or anything like that. Instead if I want to create a blog post, I open up a text editor and I type "" and then I start like I used to when I was in 7th grade and making little web pages. So now my essays look more like this. I just write it all by hand in a text editor.

But it also gives me the freedom to do things like embed software and make it way more interactive than it otherwise would've been because on WordPress it's like, "Well can I find a plugin that does the thing that I want? Do I have to write a plugin?" As a technical person, this gives me more freedom. But I think also even if you're not that technical and you're just learning ssh, it can be exhilirating to have— I remember when I was a teenager I think, and I first learned that I could log into a computer remotely somewhere else, log into my home computer from someone else's computer and get files off of it, that was this incredible, exhilirating thing for me not just because of that connection, but also it was this thing that I controlled. I could literally install software on my home computer from someone else's machine, and that was very cool and it felt empowering at the time, and I think you get a whiff of that by joining something like tilde.club as well, even though it's more managed. But it's certainly less managed than Facebook, for example.

Audience 6: I guess what I meant to ask was have you noticed this sort of arc in some of the work you've done where it started out with this initial purpose of doing something maybe with a comic value or whatever, but then like with tilde.club it took on this other meaning. Like you started out with a particular idea but it sort of morphed into something else that had more use than you anticipated.

Darius: Definitely. A great example of that is Museum Bot. I came home from work one day and my wife was already home, and she said, "Hey, I don't know if you saw, the Metropolitan Museum of Art released 400,000 high-resolution images along with metadata about its deep, deep back catalogue." And I as like, "That's really cool!" And she was like, "You might want to do something with that." And I was like, "Yeah, that would be cool to do." And I sat there for a half an hour, forty-five minutes, looking at their collection and figuring out what I could do with it, and I couldn't come up with anything good. I was just like, "I can't think of anything interesting. But you got me all excited so I guess I'll just build the most boring possible degenerate case." Which is it just grabs something at random and tweets it.

At the time I was making it, it was a throwaway. But because it's this autonomous agent that goes out into the world and is sort of consistently surprising. The Met's collection often has, I mean you know, "Petals of a Composite Cornflower Pendant," that's weird; An unlabeled just…vase; a drink tumbler, like for your whisky. I would not have expected to see this stuff in the catalog at the Met. People really liked this, and it also says things about curation. People can look at this and look at what's on display at the Met and go, "Oh. What's on display at the Met is nothing like what a true random sampling of items in the Met's catalog is." So people start to learn things about curation by engaging with this bot, and then for some people it's almost a status thing. It's like, "Oh, I'm signing up for culture." It's like listening to NPR so you can tell your friends that you listen to NPR. "Have you heard the latest Serial episode? Come on."

So there's a lot going on here, and I had zero zero zero idea when I built it. I built it and I had dinner, and I was like okay, I can forget about this forever. And it keeps coming back up with more and more people who are interested in it and want to talk about it and so fort.

Audience 8: I want to go back to your example where you were saying you sort of drifted away from existing platforms and started building your own stuff including blog posts and tied that to the activism stuff that you were talking about earlier. Of course, activists have really embraced the Internet including some of the weird stuff as well. But they will obviously and probably do see the value in moving off platforms like Facebook and Twitter because those are owned by private companies and they exert their control over algorithms, over stuff they censor over their terms of service, and activists don't like that because that gets in their way. On the other hand, as soon as they want to move away they face a different barrier, which is they need to learn how to make this stuff, and a lot of them—the majority of them, probably—don't know how to do this stuff. So what would you say is like…there's obviously kind of a good thing about more people learning about the Weird Internet, but there's also this barrier of having skills to build their own stuff. And also then letting other people know that stuff's out there, because unless its on Facebook very few people will learn that it's there because it's just some little web site.

Darius: To the second point first, I think that's unavoidable. I think you see that analog in just traditional activism as well. As an activist, you both need to be able to go out into a public square and protest and maybe do it by getting a permit and following local laws, or maybe not so much. But you also need a private space in your own home or a building somewhere or a public park or something, where you can just gather and do something a little more private on your own terms as well. So to me, I see that as a loose analog to running your own server and services versus going out into the world. I don't ever want to stop putting my stuff on Twitter as a platform because whether we like it or not, that's sort of like the public forum, or one of them.

On the education gap, I agree with you completely. I think I'm ethically obligated, or morally obligated to teach people how to do this kind of stuff, so I publish the source code for almost everything I do. But publishing source code is like whatever, it doesn't help 99.9% of people. So I write prose tutorials, I run workshops, I run things like Bot Summit, I go speak at places, that sort of thing. That's a personal choice that I make as a creator, to actually give people these tools that've helped me so much in me doing those things, because I like seeing other people's work and what they do with it.

Audience 9: Are you not afraid that at some point there will be so many people doing Weird Internet that it'll stop being weird?

Darius: For a long time I was the only person doing game analytics and then everyone was doing it, and I moved onto Weird Internet. And I suppose if I really feel crowded out I'll just move on to something else.

Audience 10: I noticed that when you talk about your bots, often you refer it's like, "Oh it took fifteen minutes, it took me forty-five minutes." I was wondering how you wish your audience get out from that kind of comment.

Darius: I think I mention that because I consider making a Twitter bot on the same level as most other creative endeavors, so I'm a big fan of— There's a guy who I met recently, Jonathan Mann, he does a song a day on YouTube and has been doing it for many years. He passed the 2000 song mark recently, and much like me not even close to a majority of his stuff is a big hit or anything, but [we] probably have roughly about the same hit rate, about one out of every fifteen things that we make gets some kind of response other than the usual background noise of like, "Oh that's cool."

I have this blog post I wrote called "Thoughts on Small Projects" that goes in detail on this. I think it's very important as a creative person to get to a place where not only have you skills gotten to a certain level where you can create things very quickly, but also your expectations have been brought down as well. I think it's very important to do. I spent until I was maybe 27 years old or something agonizing all the time over whether what I was doing was good enough, and I just sat on so many projects and had so many half-finished things sitting in folders. I don't have half-finished things that sit in folders anymore because I decide that they're finished and I just put them out there. It's been creatively rewarding for me, and I hope that other people can take stuff away from that, too. That's the great irony or big joke, that Museum Bot, which I made in an hour or two is more popular and I get more people saying they liked it and all that sort of stuff than the book that I spent a year writing. I'm still proud of that book, but it just didn't happen to be what was right at the time or whatever. I don't know.

Mostly when I talk about how quick it is, I just want to encourage people passively to do whatever it is they do quickly, whether it's making comics or whatever. I don't mean to showboat "I'm so fast and this is so easy," but also I try to provide the tools to make it easy for people as well. The reason I can do Twitter bots quickly is I have a template, which is a program that I run and it just asks me a set of questions about what software I'm going to need and then I press enter and it generates a placeholder Twitter bot that just tweets "Hi" every hour. Then I don't have to write any of the scaffolding. It's like here's all the scaffolding, now you can just work on the core of it. That saves me significant time on every project as well. But I made sure to publish this and I know other bot makers who use this, and it saves them time as well. So it goes back to that moral obligation that I feel to enable other people to be able to do this stuff.


Help Support Open Transcripts

If you found this useful or interesting, please consider supporting the project monthly at Patreon or once via Square Cash, or even just sharing the link. Thanks.