<?xml version="1.0"?>
<?xml-stylesheet href="/transform" type="text/xsl"?>
<!DOCTYPE html>
<html xmlns="http://www.w3.org/1999/xhtml" xmlns:bibo="http://purl.org/ontology/bibo/" xmlns:bs="http://purl.org/ontology/bibo/status/" xmlns:ci="https://vocab.methodandstructure.com/content-inventory#" xmlns:dct="http://purl.org/dc/terms/" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:xhv="http://www.w3.org/1999/xhtml/vocab#" xmlns:xsd="http://www.w3.org/2001/XMLSchema#" lang="en" prefix="bibo: http://purl.org/ontology/bibo/ bs: http://purl.org/ontology/bibo/status/ ci: https://vocab.methodandstructure.com/content-inventory# dct: http://purl.org/dc/terms/ foaf: http://xmlns.com/foaf/0.1/ rdf: http://www.w3.org/1999/02/22-rdf-syntax-ns# xhv: http://www.w3.org/1999/xhtml/vocab# xsd: http://www.w3.org/2001/XMLSchema#" vocab="http://www.w3.org/1999/xhtml/vocab#" xml:lang="en">
  <head>
    <title property="dct:title">Why We Care About Plagiarism</title>
    <base href="https://doriantaylor.com/why-we-care-about-plagiarism"/>
    <link href="23d7e373-91f6-45ad-b8e8-1d1266e5637d" rev="ci:document"/>
    <link href="elsewhere" rel="alternate bookmark" title="Elsewhere"/>
    <link href="this-site" rel="alternate index" title="This Site"/>
    <link href="http://purl.org/ontology/bibo/status/published" rel="bibo:status"/>
    <link href="" rel="ci:canonical" title="Why We Care About Plagiarism"/>
    <link href="person/dorian-taylor#me" rel="dct:creator" title="Dorian Taylor"/>
    <link href="person/dorian-taylor" rel="meta" title="Who I Am"/>
    <link about="./" href="3f36c30c-6096-454a-8a22-c062100ae41f" rel="alternate" type="application/atom+xml"/>
    <link about="./" href="this-site" rel="alternate"/>
    <link about="./" href="elsewhere" rel="alternate"/>
    <link about="./" href="e341ca62-0387-4cea-b69a-cdabc7656871" rel="alternate" type="application/atom+xml"/>
    <link about="./" href="f07f5044-01bc-472d-9079-9b07771b731c" rel="alternate" type="application/atom+xml"/>
    <link about="verso/" href="3f36c30c-6096-454a-8a22-c062100ae41f" rel="alternate" type="application/atom+xml"/>
    <link about="verso/" href="this-site" rel="alternate"/>
    <link about="verso/" href="elsewhere" rel="alternate"/>
    <meta content="What did Claudine Gay actually do?" name="description" property="dct:abstract"/>
    <meta content="2024-01-19T02:54:06.270818+00:00" datatype="xsd:dateTime" property="dct:created"/>
    <meta content="2024-01-19T04:32:32+00:00" datatype="xsd:dateTime" property="dct:modified"/>
    <meta about="person/dorian-taylor#me" content="Dorian Taylor" name="author" property="foaf:name"/>
    <meta content="summary" name="twitter:card"/>
    <meta content="@doriantaylor" name="twitter:site"/>
    <meta content="Why We Care About Plagiarism" name="twitter:title"/>
    <meta content="What did Claudine Gay actually do?" name="twitter:description"/>
    <object>
      <nav>
        <ul>
          <li>
            <a href="//dorian.substack.com/p/so-long-farewell-auf-wiedersehen" rev="dct:references" typeof="bibo:Article">
              <span property="dct:title">So Long, Farewell, Auf Wiedersehen, Good Night</span>
            </a>
          </li>
          <li>
            <a href="23d7e373-91f6-45ad-b8e8-1d1266e5637d" rev="ci:document" typeof="qb:Observation">
              <span>urn:uuid:23d7e373-91f6-45ad-b8e8-1d1266e5637d</span>
            </a>
          </li>
        </ul>
      </nav>
    </object>
  </head>
  <body about="" id="E6grDw72M11alVdILMkRCK" typeof="bibo:Article">
    <p>I was occupied over the last several weeks, <a href="ibis-gets-a-tune-up" rel="dct:references" title="IBIS Gets a Tune-Up">as I chronicled elsewhere</a>, and doubly so with travel over the holidays, so I missed most of the drama around the coordinated right-wing assault against the presidents of both Penn and Harvard (and <abbr>MIT</abbr>, but she seems to be hanging on), effectively shepherded into voicing an insufficiently full-throated support of Israel's ongoing prosecution of its war in Gaza during a <abbr>US</abbr> congressional hearing. This performance was enough to take down Penn's president <a href="https://en.wikipedia.org/wiki/Liz_Magill" rel="dct:references">Liz Magill</a>, but not <a href="https://en.wikipedia.org/wiki/Claudine_Gay" rel="dct:references">Claudine Gay</a>, the president of Harvard. As a result, the <a href="https://en.wikipedia.org/wiki/Ratfucking" rel="dct:references">ratfuckers</a> switched tack and went after her scholarship.</p>
    <aside role="note">
      <p>My own understanding of this conflict is that <a href="https://en.wikipedia.org/wiki/Hamas" rel="dct:references">the terrorist organization Hamas</a>, which does not represent most Palestinians, launched what has been characterized as <a href="https://www.foreignaffairs.com/israel/catastrophic-success-hamas" rel="dct:references">a <q>catastrophically successful</q> attack</a>&#x2014;insofar as they caught the Israeli security apparatus completely pants-down, and were likely much more effective than they had anticipated. During this <a href="https://en.wikipedia.org/wiki/2023_Hamas-led_attack_on_Israel" rel="dct:references">prolonged assault on October 7 2023</a>, Hamas militants raped, tortured, and shot their way through over a thousand Israeli civilians, and took another 200-plus more hostages with them back to Gaza. The sitting Israeli government, which does not represent most Israelis (let alone, presumably, most Jewish people) responded in turn by leveling Gaza (or what was left of it from the last go-round), displacing nine-tenths of its two-million-strong population, and killing 25 times as many civilians (and counting) as the Hamas attack to which it putatively responds. Knesset members have also <a href="https://www.nbcnews.com/news/world/gaza-nakba-israels-far-right-palestinian-fears-hamas-war-rcna123909" rel="dct:references">made open calls for another nakba</a> (Arabic for <q>catastrophe</q>, referring initially to <a href="https://en.wikipedia.org/wiki/Nakba" rel="dct:references">the 1948 expulsion</a> of three quarter of a million Arabs from towns in what became Israel) <a href="https://www-mako-co-il.translate.goog/news-military/2024_q1/Article-44f6b501cf5dc81026.htm?sCh=3d385dd2dd5d4110&amp;pId=1898243326&amp;_x_tr_sl=auto&amp;_x_tr_tl=en&amp;_x_tr_hl=en&amp;_x_tr_pto=wapp" rel="dct:references">publicly expressed designs on settling Gaza</a> as they have been doing to the West Bank for decades. The stated military objective of <q>eliminating Hamas</q>, furthermore, has been judged by analysts as ranging from poorly-defined to infeasible on its own terms, and ultimately incompatible with rescuing the remaining hostages. Finally, there is a personal incentive on the part of prime minister Benjamin Netanyahu to continue prosecuting this war indefinitely, because the moment it's over, for mostly-unrelated matters, there's a solid chance he goes to prison.</p>
      
      <p>As I am naught but a simple parenthesis farmer, I don't have much else to say about the conflict itself, other than unconditional support for one side or the other (to the extent that <q>side</q> even accurately delineates the actors) leads to some pretty grim conclusions. The most-educated people in North America appear to understand this subtlety, which is what the university presidents were ultimately (and disingenuously) invited to respond to. It is likely&#x2014;at least in private&#x2014;members of the United States Congress understand it as well.</p>
    </aside>
    <p>Unsatisfied with the outcome of the congressional hearing, the would-be character assassins, <a href="https://www.wsj.com/articles/how-we-squeezed-harvard-claudine-gay-firing-dei-antisemitism-culture-war-a6843c4c" rel="dct:references">figureheaded by one Christopher Rufo</a>, went digging through Dr. Gay's publications, looking for material to support a narrative that she (a Black woman, if you had somehow managed to get through the last month without encountering that fact) held her position for reasons other than merit. The most substantial evidence they found was a paragraph in her PhD dissertation which was lifted from a paper by a professor and colleague in her doctoral program (and which inverts the operative verb) without proper attribution.</p>
    <aside role="note">
      <p>This is about where I caught up, the day of, or perhaps the one after Gay's resignation. My question was something on the order of <q>okay but what did she actually <em>do</em>?</q> In response, <a href="https://bsky.app/profile/literalbanana.bsky.social/post/3ki3dp63ize2s" rel="dct:references">a helpful anthropomorphic banana directed me</a> to <a rel="dct:references" href="https://freebeacon.com/campus/this-is-definitely-plagiarism-harvard-university-president-claudine-gay-copied-entire-paragraphs-from-others-academic-work-and-claimed-them-as-her-own/">the right-wing blog post</a> that featured animated highlights of the ostensible smoking gun.</p>
    </aside>
    <p>An entire paragraph which is mostly word-for-word identical to a paragraph in another person's work, without quotation marks around it, is unequivocally bad optics&#x2014;especially when the offending document is the doctoral dissertation of the leader of the most august academic institution on the continent. It is also a glaring hypocrisy: an affront to every student who has ever had their knuckles rapped&#x2014;or worse&#x2014;for doing the same. If you look at the actual <em>content</em> of the text which one of the authors of the original avowed was <q>technically plagiarized</q> (a detail Gay's opponents gleefully latched onto, but not before the customary excision from its context) you will notice it is a piece of pure expositional boilerplate. It has the tone and rhetorical weight of an instruction manual for a dishwasher&#x2014;Stephen Voss, a coauthor of the original paper, <a href="https://kentuckylantern.com/2023/12/21/lessons-from-a-lynch-mob/">basically said as much.</a></p>
    <p>Here is where I enter with my motivation for entertaining this subject. About twelve years ago, <a href="la-forme-de-flaner" rel="dct:references" title="La Forme de Fl&#xE2;ner">I wrote on my personal website</a> what I consider to be a B-minus meditation on how a preoccupation with <q>efficiency</q> was a suboptimal frame for the quaternary sector, because results tended to be <a href="https://en.wikipedia.org/wiki/Stochastic_process" rel="dct:references">stochastic</a> and <a href="https://en.wikipedia.org/wiki/Fat-tailed_distribution" rel="dct:references">fat-tailed</a>&#x2014;meaning that both the value and the timing of outcomes were random. Since there is no meaningful way to make a process like that more <q>efficient</q>, perhaps we, as information-sector professionals, should stop obsessing over efficiency, and focus on other values instead. Three months later, I get messages from no fewer than three different people&#x2014;unbeknownst to each other&#x2014;notifying me that a version of my article, with names and details changed, <a rel="dct:references" href="https://uxmag.com/articles/the-flaneur-approach-to-user-experience-design">had just been published in an industry magazine</a> by an aspiring thought leader.</p>
    <p>My partner at the time was an instructor at a private college that mainly focused on university prep for international students, meaning that she dealt mainly with kids from all over the world that were somewhere in between high-school-aged and freshman. It was her Sisyphean task to drill into her students the importance of quoting their sources, because, as she would tell them (I paraphrase), <q>I'm interested in <em>your</em> ideas, and if you don't quote your sourced text, I can't tell which ideas are yours.</q> More to the point, though, <em>not</em> doing so contravened the school's academic code&#x2014;as it presumably does with all of them&#x2014;and would result in disciplinary action. <em>She</em> was the one who took it upon herself to reach out to this publication that they had published an article that had been plagiarized, and their response was (something on the order of) while there are similarities in both the structure and overarching argument, the article they published <em>can't</em> be plagiarism, because it doesn't reproduce any passages verbatim.</p>
    <aside role="note">
      <p>Again, three different people who didn't know each other reached out to me about the suspicious similarities between the two articles.</p>
    </aside>
    <p>This is why I felt impelled to write about this event: this would-be thought leader (at least, convincingly enough to those who brought the matter to my attention) used my article as scaffolding to appear clever and worldly in a trade publication (while diluting the message into a platitude fit for a motivational poster), while Claudine Gay recycled a few lines of text that amount to no discernible argument&#x2014;not even so much as a witty turn of phrase&#x2014;at all. One of these acts is an inexcusable crime of plagiarism, while the other is ostensibly fair game.</p>
    <p>I want to challenge the notion that sequences of words are the only things that can be plagiarized. Rather, there are semantic constructs all up and down the scope of a work that are amenable to copying. Consider&#x2014;mainly because my brain is insisting on using this as an example and can't think of any others&#x2014;<a href="https://en.wikipedia.org/wiki/Ran_(film)" rel="dct:references">Akira Kurosawa's film <em>Ran</em></a>, which is more or less <a href="https://en.wikipedia.org/wiki/King_Lear" rel="dct:references">Shakespeare's <em>King Lear</em></a> transposed to feudal Japan. Kurosawa did not try to claim otherwise (though interestingly, he learned about and read the play only <em>after</em> he had started, and drove his project in its direction), but imagine if he <em>did</em>. <q>Shakespeare? Never read him. Total coincidence.</q> Who would buy that?</p>
    <p>Coincidences happen, and it's not impossible that two individuals can come up with the same informational structure with no prior contact. (This, after all, is what Kauffman's Adjacent Possible is about.) I'm going to assert, though, that the explanatory power of coincidence drops off the more overlapping detail there is. Words, for instance, are concrete symbolic objects arranged in a definite topological structure, and plus there are a lot of them, so the chance that you're going match more than a few in a row in any two documents at random is going to be pretty slim&#x2014;and slimmer the more words you add to the sequence. If you get rid of filler words and abstract out to what the WordNet people call <em>synsets</em> (which are what they sound like: synonyms grouped under an identifier), you could probably match longer strings that are still defensible as coincidental. This is because the way language works is that you have to use roughly the same words in roughly the same order if you want to communicate roughly the same things, so this isn't that surprising. Indeed, if you're trying to write something like a legal or scientific document, there's going to be a sort of paint-into-a-corner effect where you <em>have</em> to use certain syntactical constructs to say what you're trying to say, because your hyperspecialized language won't have any synonyms.</p>
    <aside role="note">
      <p>Melodies are worth considering because there aren't that many musical notes in total, and likewise not many valid ways (relative to <em>in</em>valid ways) you can configure a <em>sequence</em> of notes, especially when the set of possible melodies is compressed under the operation of key transposition. It's not <em>that</em> uncommon for musicians to come up with a tune, only to find out that it's already part of some other song. In other words, the smaller the symbol alphabet&#x2014;along with, moreover, rules about what symbols are allowed to follow others&#x2014;the more likely it is that longer sequences of symbols will collide.</p>
    </aside>
    <p>The plagiarism question arises when a work contains features found in an earlier work arranged in a configuration too implausible to be a coincidence. Word sequences are an easy target because everybody agrees on what a <em>word</em> is. If you wanted to claim plagiarism of some other construct, you'd need to analyze the situation using some kind of surrogate model. Then, of course, you can have an argument about whether the model is valid, or, if you're a dodgy trade publication, you can just decline to hear the evidence.</p>
    <p>I'm not terribly interested in the psychology of plagiarism&#x2014;not being a fan of remote psychoanalysis in general&#x2014;but it should be pretty uncontroversial to surmise that a plagiarist necessarily recognizes some instrumentality in the original work, and figures that nobody will notice they copied it (or otherwise that copying is legitimate). As for not proactively citing the source, maybe they don't believe the original author deserves the credit; maybe they're concerned their take will look weaker in comparison. <em>Or</em>, perhaps they judge that the content <em>itself</em> is unworthy of a cite. It really is quite remarkable just how <em>dull</em> the passages Claudine Gay is accused of lifting are. It's almost as if she pasted them in with the intent of changing them into her own language&#x2014;which she <em>did</em>, here and there&#x2014;but ultimately decided she couldn't be bothered.</p>
    <aside role="note">
      <p>In 1997, when Gay wrote her dissertation, the technical, and moreover the <em>economic</em> feasibility of storing, indexing, and searching the entire corpus of all text ever written in English, was several <a href="https://en.wikipedia.org/wiki/Moore's_law">Moore doublings</a> away. The idea that you would eventually be able to just <em>scan</em> a document for text that matches <em>any</em> other document was probably not on many people's radar back then. It was probably a reasonable expectation that as long as you kept your borrowings to unremarkable boilerplate, nobody would notice. Even now, <a href="https://www.theatlantic.com/technology/archive/2024/01/plagiarism-war-claudine-gay/677020/">it's <em>still</em> an expensive, albeit monopoly-mediated proposition</a>, so the only people who are going to check up on a 27-year old document are those who are sufficiently motivated. And sure, this revelation would probably have been disqualifying if it had been caught on the front end, but now it has to be weighed against a nigh three-decade-long career. Moreover, people are bound to change their behaviour over time: nowadays, if you wanted to rip off a substantial chunk of somebody's work without getting caught, you could just ask ChatGPT to paraphrase it first. A service like <a href="https://en.wikipedia.org/wiki/Turnitin">Turnitin/iThenticate</a> is going to have a <em>lot</em> more trouble with that.</p>
    </aside>
    <p>While it's poor scholastic hygiene, terrible optics, and a galling hypocrisy, what the plagiarized material is <em>not</em>&#x2014;at least to <a href="https://en.wiktionary.org/wiki/Gell-Mann_Amnesia_effect" rel="dct:references">these admittedly untrained eyeballs</a>&#x2014;on account of its stultifying blahness, is evidence that Claudine Gay doesn't know what she's talking about. Nowhere does it try to be smart, or clever, or interesting; it's the kind of boring boilerplate one would nowadays consign to ChatGPT. Contrast this with one Neri Oxman.</p>
    <p>In a somewhat predictable twist, because hedge fund billionaire&#x2731; Bill Ackman, who was agitating for the removal of Gay (and threatening to rescind a large donation to Harvard's endowment over the matter), happens to be married to Oxman (a <abbr>MIT</abbr> Media Lab/<abbr>TED</abbr>/Edge/Epstein alum), <a href="https://www.businessinsider.com/neri-oxman-plagiarize-wikipedia-mit-dissertation-2024-1" rel="dct:references"><em>her</em> academic record found itself facing scrutiny.</a> And what a find! Wikipedia! <a href="https://twitter.com/sTeamTraen/status/1745937835121631369" rel="dct:references">Her own students!</a> Paragraph after paragraph unceremoniously cut and pasted! Here is somebody who has very much made a career as a showperson, and <em>unlike</em> Claudine Gay, the quantity&#x2014;and critically, the <em>content</em>&#x2014;of the plagiarism makes me immediately wonder aloud if putting on a slick performance is all Oxman knows how to do.</p>
    <aside role="note">
      <p>&#x2731;&#x2009;It is quite a feat that Ackman made it to billionaire despite having such a reputation for losing money. Funnily, I was aware of both him and Oxman but had no idea they were married to each other until this whole debacle.</p>
    </aside>
    <p>This, I propose, is one leg of why we <em>really</em> care about plagiarism. If you produce some sort of idea, or insight, or synthesis, or explanation, or argument, or story, that's a sign of <em>your</em> brain working; <em>you</em> understanding something. If you copy somebody else's, where's the evidence for <em>you</em>? How do we know <em>you're</em> competent? (Again, what makes the Gay plagiarism so remarkable is that it contains no discernible insights.) The other leg one might be inclined to describe in terms of intellectual property, but I think it's subtler than that. When you copy somebody's work (again, idea, insight, etc.) without attribution, you're diverting attention and accolades to yourself that would otherwise have gone to them. So in addition to harming the true progenitor and defrauding your audience about your own capabilities, accomplishments, and potential, you're also depriving <em>them</em> of access to the genuine article.</p>
    <aside role="note">
      <p>Again, consider the Kurosawa-Shakespeare hypothetical: plagiarism only has a hope in hell of working if your target is relatively obscure.</p>
    </aside>
    <section>
      <h3>Coda: Gamergate II</h3>
      <p>Taking the actual, usually fairly innocuous conduct of people you don't like and having a public conniption about <q>ethics</q> is unambiguously the playbook of <a href="https://en.wikipedia.org/wiki/Gamergate_(harassment_campaign)" rel="dct:references">GamerGate</a>. The difference is that while the original GamerGate was quasi-organic, this time around it's premeditated. Rufo himself announced in advance he would be doing this, and then crowed about his victory over Gay in the Wall Street Journal and said to expect more of the same. Institutional leaders have had a decade to prepare for this brand of onslaught, but they don't appear to have done much of it. I bet they're scrambling now.</p>
      <p>Again, the GamerGate strategy is to take something that you actually <em>did</em>, no matter how banal, and make a huge public fuss about it. These people will make a parking ticket sound like a double homicide. All they care about is scoring the hit; they aren't bothered if you don't deserve it, and have no compunction about ruining your career or your life. You also can't hit them back in any meaningful way, because they're all avowed scumbags&#x2014;it's what they call mudwrestling a pig.</p>
      <p>The way you protect yourself from gamergaters is not to have a spotless record&#x2014;because they will keep digging until they find even the faintest blemish&#x2014;but rather to make the spots not matter. Something Harvard potentially could have done to at least partially neutralize these attacks&#x2014;I'm spitballing here&#x2014;is stage a <q>technically-plagiarism jubilee</q>&#x2014;that is, a one-time amnesty for people who have copied boring bits of academic boilerplate without attribution: <em>technically</em> plagiarism, but not of the species that defrauds the audience or usurps any great idea or effort. <a href="https://bsky.app/profile/morungos.bsky.social/post/3ki3g3dzhbp2d" rel="dct:references">As a mutual on the socialnets said</a>, it was a teaching moment, and Harvard failed.</p>
    </section>
    

  </body>
</html>
