Linguistics | Chris Aldrich

Collective Nouns for Typewriters and Typists

The traditional collective noun for a group of typists is a “pool”, but this just doesn’t seem as creative or entertaining as many collective nouns can potentially be. As a typewriter collector it would be more fun to have a clever collective noun for a group of typists at a type-in. Similarly it would be nice to have a name for a collection of typewriters themselves.

So toward this end I thought I’d brainstorm for a moment to make a few lists of potential ideas for both. If you’ve got ideas, please add them to the comments. There’s no need to try to come up with an “official” collective noun as use in daily life, writing, and throughout the typosphere will ultimately determine a winner through actual usage. Some words are double-listed as they could refer to either.

What do you call your collection?

Potential Collective Nouns for Typewriters

Radiance
Tabulation
Segment (no one’s every done collecting, are they?)
Sheaf
Clatter
Flourish
Sounder
Chatter
Ream
Basket
Font
Aureole (for the Smith-Corona enthusiasts)
Roost (especially when stored and displayed on shelves)
Parenthesis
Return

Potential Collective Nouns for Typists

Pool (traditional)
Claque
Vibration
Riot
Mayhem
Commotion
Symphony
Exclamation
Shift
Chattering
Touch
Drumming
Sonority
Tumult
Thriving
Applause
Manual
Carriage
Clatter

What could we be? Are we a vibration of typists? Maybe we’re a thriving of typists? A Touch of Typewriter Collectors? Please add your ideas to the list below…

Here’s a version of the timeline of some of the intellectual history I presented today at the PKM Summit in Utrecht. I’m happy to answer any questions, or if you’re impatient, you can also search my online digital repository of notes for any of the people or topics mentioned.

It covers variations of personal knowledge management, commonplace books, zettelkasten, indexing, etc. I wish we’d had time for so much more, but I hope some of the ideas and examples are helpful in giving folks some perspective on what has gone before so that we might expand our own horizons.

The color code of the slides (broadly):

orange – intellectual history
dark grey – memory, method of loci, memory palaces
blue – commonplace books
green – index cards, slips, zettelkasten traditions
purple – orality
light teal – dictionary compilations
red – productivity methods

Vocabulary notebooks, Criminally Insane Asylum Patients, Zettelkasten, the Thesaurus Linguae Latinae, and Digital Dictionaries

A Sixth Grade Vocabulary Notebook

The sixth grade language arts class at the school in Altadena, CA, which my daughter attends, has a weekly set of vocabulary exercises which they keep in a simple composition notebook. Each week the teacher picks two vocabulary words (eg: passage, intelligent) and throughout the week the students fill in bits of knowledge about the word itself. On Monday they write down the word, a preliminary definition of it in their own words, a quick sketch or drawing of their perception of the word, and any prior knowledge they have of it. On Tuesday they revisit the words and look up dictionary definitions and write them down in their notebooks. On Wednesday they compose an original sentence using the words. Thursday finds them filling in spaces under each word with their morphologies, and variations with prefixes and suffixes. Finally on Friday they complete the weekly exercise by writing down synonyms and antonyms for the week’s words.

When I saw their notebooks at a recent open house night, it immediately reminded me of a now partially forgotten lexicographer’s and grammarian’s practices of excerpting (ars excerpendi) and collecting examples of sentences and words on slips of paper. Examples of this can be seen in the editing and creation of the Oxford English Dictionary, the Thesaurus Linguae Latinae (Latin for Thesaurus of the Latin Language), and the Wörterbuch der ägyptischen Sprache (German for Dictionary of the Egyptian Language).

I first became aware of the practice when reading Simon Winchester’s entertaining book The Professor and the Madman: A Tale of Murder, Insanity, and the Making of the Oxford English Dictionary. In the book , Winchester describes the pigeonhole and slip system that Oxford professor James Murray and collaborators used to create the Oxford English Dictionary (OED). The editors of the dictionary put out a call to readers to note down interesting everyday words they found in their reading along with example sentences and source references. They then collected these words alphabetically into pigeonholes and from here were able to collectively compile their magisterial dictionary which uses the collected example sentences. While tangentially about the creation of the OED, the heart of the fascinating story in the book focuses on Dr. William C. Minor, a Civil War veteran and a convicted murderer living in Britain in the Broadmoor Criminal Lunatic Asylum, who began a long written correspondence with James Murray by sending in over ten thousand slips with words from his personal reading. Many years went by between the two men before the dictionary editor realized that his collaborator was in an insane asylum. The 1998 book was ultimately turned into the 2019 movie starring Mel Gibson and Sean Penn.

Thesaurus Linguae Latinae

Somewhat similar to the compilation of the Oxford English Dictionary which predated it is the ongoing compilation of the Thesaurus Linguae Latinae (TLL). An academic research project begun in 1894 and projected to be finished by a team of international scholars sometime around 2050, the TLL is a massive dictionary written entirely in Latin which contains every instance of every known Latin word in every known medium (manuscripts, scrolls, artworks, coins, buildings, monuments, graffiti, etc.) from the beginning of the language down to the 2nd century CE and from then on, every lexicographically significant instance from that time until the 6th century CE.

The Thesaurus Linguae Latinae used the Meusel system for creating zettel (a German word meaning slip) by utilizing double folio sheets onto which they copied text in hectographic ink which can be reproduced by lithography before cutting them up into individual slips. It took approximately five years of collecting and excerpting material before the researchers of the TLL began writing “articles”, by which they mean individual entries in their dictionary of Latin words. Because of the time-consuming work to research and write individual articles, researchers are individually credited within the Thesaurus for their work on individual words.

Between the 2nd and 6th centuries CE, the Thesaurus Linguae Latinae doesn’t excerpt every single word in written Latin, just what the researchers thought was lexicographically significant. As an example, they didn’t excerpt all of Saint Augustine’s works because if they had, the collection would have been approximately 50% larger because Augustine was such a prolific writer.

The magisterial zettelkasten (German for slip box) which powers the Thesaurus Linguae Latinae is befittingly housed on the top floors of the Residenz, the former palace of the Bavarian royal family, now a part of the Bavarian Academy (Bayerische Akademie der Wissenschaften) in Munich, Germany.

slip for the word sentio — An example slip in the TLL for the word “sentio”.

The slips in the TLL’s collection are organized alphabetically by headword (or catchword) in a box in the top right hand side of the card and then secondarily by their appearance or publication in chronological time, which is indicated in a box on the top left of each slip. The number of copies of each slip is written in the bottom left hand corner and circled. Within the text excerpts on the cards themselves, occurrences of the word are underlined in red.

Basic statistics regarding the Thesaurus:

comprised of approximately 55,000 ancient Latin vocabulary words
- 10,000,000+ slips
- stored in about 6,500 boxes
- with approximately 1,500 slips per box
excerpted from a library of 32,000 volumes
contributors: 375 scholars from 20 different countries, with:
- 12 Indo-European language specialists
- 8 romance language specialists
- 100 proof-readers
approximately 44,000 words published in their dictionary already
- published content: 70% of the entire vocabulary
- print run: 1,350 copies
- Publisher: consortium of 35 academies from 27 countries on 5 continents
Longest remaining words which remain to be compiled into the dictionary
- non / 37 boxes of ca. 55,500 slips
- qui, quae, quod / 65 boxes of ca. 96,000 slips
- sum, esse, fui / 54.5 boxes of ca. 81,750 slips
- ut / 35 boxes of ca. 52,500 slips

As a point of comparison, the upper end of prolific academic researchers and note takers who use index card collections for their lifelong research (25-40 year careers) have compiled collections of 90,000 (Niklas Luhmann), 70,000+ (Gotthard Deutsch), 30,000 (Hans Blumenberg), 27,000+ (S.D. Goitein) and 12,500 slips (Roland Barthes). This means that there are individual Latin words in the TLL have more slips than these researchers produced in their research lifetimes.

A sample of the note cards being used to compile the TLL. Courtesy of Samuel Beckelhymer.

Living languages

While many think of Latin as a “dead language”, something one notices quickly about the articles in the TLL is that words changed meanings over the span of time which they were in use. Linguists call this change in word meaning over time semantic shift. Many articles focus on these subtle changes and different meanings over time. Often words with only a few hundred attestations in the corpus of the language will be quoted and cited in articles about them with every example of use along with their contexts to help highlight these subtleties. Just like people had the choice of which words to use in the ancient world, we have those same choices today and this is where the use of modern dictionaries and thesauruses can make our words and word choices more exciting.

Normally, a dictionary just tells you what words mean—and of course we do that—but the scale of the project gives us the space and opportunity to say what we’re not sure of too. This is important because it leaves the door open for further scholarship and it gives the reader choices rather than dictating to them what to think. The dictionary can be a catalyst for more research and this is what makes the dictionary a living thing.—⁠⁠Adam Gitner, a TLL scholar

Slip box for the word ‘requiro’ © Adam Gitner

For those interested in more details on the TLL, Kathleen Coleman’s presentation on YouTube is a fantastic resource and primer on what is in it, how they built it and current work:

TLL Podcast and the Wordhord

Based on the history and usage of the Latin word horreum, which is featured in the first episode of the Thesaurus Linguae Latinae podcast, I can’t help but think that not only is the word ever so apropos for an introduction to some of the TLL, but it does quite make an excellent word for translating the idea of card index in English or Zettelkasten from German into Latin: “My horreum is a storehouse or treasury for my thoughts and ideas which nourishes my desire to discover and build upon my knowledge.” One might also notice that the Latin word horreum is also cognate with the fun Old English word “wordhord” that one encounters in classics like Beowulf and which roughly translates as one’s brain or their memory, especially for words.

Wörterbuch der ägyptischen Sprache (A Dictionary of the Egyptian Language)

Like the Thesaurus Linguae Latinae the Wörterbuch der ägyptischen Sprache was an international collaborative zettelkasten project. Started in 1897, it was finally published as five volumes in 1926.

The structure of the filing system for the Wörterbuch der ägyptischen Sprache (Wb) was designed based on the work done for the Thesaurus Linguae Latinae started three years earlier. Texts in the collection were roughly divided into passages of about 30 words and written in hieroglyphic form on postcard-sized slips of paper. The heading contained the designation of the text and the body included the texts’ context (inscriptions, etc.) as well as a preliminary translation of the passage.

These passages were then cross-referenced with other occurrences of the hieroglyphics to provide better progressive translations which ultimately appeared in the final manuscript. As a result some of the translations on the cards were incomplete as work proceeded and cross-comparisons of individual words were puzzled out.

A slip showing a passage of text from the victory stele of Sesostris III at the Nubian fortress of Semna. The handwriting is that of project leader Adolf Erman, who had “already struggled with the text as a high school student”.

With support from the German Research Foundation, the 1.5 million sheets of the Wörterbuch der ägyptischen Sprache began to be digitized and put online in 1997. The Digitized Card Archive (DZA) of the Dictionary of the Egyptian Language (Wörterbuch der ägyptischen Sprache) has been available on the Internet since 1999. The archive can be searched at: https://aaew.bbaw.de/tla/servlet/DzaIdx. Since 2004, the materials and query functions have been integrated into the larger Thesaurus Linguae Aegyptiae project at https://aaew.bbaw.de/thesaurus-linguae-aegyptiae.

Wörterbuch der ägyptischen Sprache by Adolph Erman and Hermann Grapow can be viewed online using the Wb. browser at https://aaew.bbaw.de/tla/servlet/WbImgBrowser. Links from reference points within the dictionary go directly to corresponding slips of paper in the digitized slip archive.

Although he’s a fictional character, given one could suppose that given his areas of specialization in archaeology, Indiana Jones would certainly have been aware of the Wörterbuch, would likely have used it, and may even have worked on it as a young college student.

The method used for indexing the Wörterbuch der ägyptischen Sprache and the Thesaurus Linguae Latinae is now generally known as a key word in context (KWIC) index. The design of these sorts of indices is now a subject within the realm of computer science and database design. Given that the work on the TLL has taken over 100 years, could it be possible that digital versions might speed up the process of excerpting, collating, and writing articles in the future? Perhaps these examples might be used for compiling other languages in the future.

Modern day practice: Wordnik and Hypothes.is

Having looked at some historical word and idea collecting practices, how might one do this sort of work in a modern, digital world? A similar word collecting scheme is currently happening on the internet now, though perhaps with a bit more focus on interesting neologisms (and hopefully without many insane asylum patients.) The lovely folks at the online dictionary Wordnik have been using the digital annotation tool Hypothes.is to collect examples of words as they happen in the wild. One can create a free account on the Hypothes.is service and quickly and easily begin collecting words for their dictionary efforts by highlighting example sentences and tagging them with “wordnik” and “hw-[InsertFoundWordHere]”.

So for example, I was reading about the clever new animations in the language app Duolingo and came across a curious new word (at least to me): viseme.

To create accurate animations, we generate the speech, run it through our in-house speech recognition and pronunciation models, and get the timing for each word and phoneme (speech sound). Each sound is mapped onto a visual representation, or viseme, in a set we designed based on linguistic features.

So I clicked on my handy browser extension for Hypothes.is, highlighted the sentence with a bit of context, and tagged it with “wordnik” and “hw-viseme”. The “hw-” prefix ostensibly means “head word” which is how lexicographers refer to the words you see defined in dictionaries.

Then the fine folks at Wordnik are able to access the public annotations matching the tag Wordnik, and use Hypothes.is’ API to pull in the collections of new words for inclusion into their ever-growing corpus of examples. Lexicographers can then use examples of words appearing in context to define, study, and research their meanings and their shifts in meaning over time.

Since I’ve collected interesting new words and neologisms for ages anyway, this has been a quick and easy method of helping out other like-minded wordhoarders along the way. (Note how this last sentence has brought wordhord back into more active usage with a tinge of shift?!) In addition to the ability to help out others, a side benefit of the process is that the collected words are all publicly available for reading and using in daily life! You can not only find the public page for Wordnik words on Hypothes.is, but you can subscribe to it via RSS to see all the clever and interesting neologisms appearing in the English language as collected in real time! So if you’re the sort who enjoys touting new words at cocktail parties, a rabid cruciverbalist who refuses to be stumped by this week’s puzzle, or a budding lexicographer yourself, you’ve now got a fantastic new resource! I’ve found it to be far more entertaining and intriguing than any ten other word-of-the-day efforts I’ve seen in published calendar or internet form.

If you like, there’s also a special Hypothes.is group you can apply to join to more easily aid in the effort. Want to know more about Wordnik and their mission, check out their informative Kickstarter page.

Expanding the sixth grade practice

The basic pedagogic exercise I’ve described above is an incredibly solid base for nearly any school-aged child. But with some of the historical context we’ve explored, the weekly word notebook exercise could be expanded. Some could be done during the week while others could be done at a later date/time, which could serve as potential (spaced repetition) reminders to students as they see words throughout the year potentially for bonus points.

What is the earliest attestation (evidence or proof of existence) of a word?

Can students find attestations of their words during their weekly reading or reading later in the year?

What is the word’s etymology? What other words sound like it or are related to it? What words are cognate to it in other languages they might be studying/learning? These could be collected too.

What new and interesting words are students coming across that they haven’t seen before in their own reading? Bonus points for doing additional words they find themselves, or add them to the queue of the words the teacher assigns on future weeks.

Double bonus points for finding new words in their reading that are neologisms which aren’t in the dictionary yet. Can they find and add words to the Wordnik dictionary using Hypothes.is?

Instead of using a notebook for their supplemental wordhord, students might try the older practice of keeping their words on index cards and storing them in a zettelkasten just like the OED, the TLL, or the Wb. A shoebox works nicely and can be fun to decorate, but there are fancier boxes out there. Here they might also be used as flashcards for occasional review. Students can index them alphabetically and perhaps their example sentences may come in handy later in life while they’re doing their own writing (see Draft No. 4 and boxing words.) Perhaps their collections will come in handy at the end of high school when they take the SAT or the ACT tests? Might their collections rival those of famed academics like Niklas Luhmann, Gotthard Deutsch, Hans Blumenberg, S.D. Goitein or Roland Barthes? Maybe they’ll become professional lexicographers and help to finish up work on the TLL later in life?

For a fun math exercise, can students calculate how long it would take them (individually or as a class) to copy out 10,000,000 slips for their words at the pace of two or three words a week? How many notebooks would this require? Would they fit into their classroom? their house, their library, or their school?

What other ideas might one add to such a classroom exercise?

References

Forschung: Der Thesaurus linguae Latinae. Munich, Germany: Bayerische Akademie der Wissenschaften, 2019. https://www.youtube.com/watch?v=C3Eqt2QBKNs.

Kathleen Coleman, “The Thesaurus Linguae Latinae” Paideia Lectures 2022, 2022. https://www.youtube.com/watch?v=s98hTIOW1Ug.

Pinkerton, Byrd. “The Ultimate Latin Dictionary: After 122 Years, Still At Work On The Letter ‘N.’” NPR, May 14, 2016, sec. Parallels. https://www.npr.org/sections/parallels/2016/05/14/476873307/the-ultimate-latin-dictionary-after-122-years-still-at-work-on-the-letter-n.

The Professor and the Madman. 35mm film, Biography, Drama, History. Voltage Pictures, Fábrica de Cine, Definition Films, 2019.

Smith, Chris. “Thesaurus Linguae Latinae: How the World’s Largest Latin Lexicon Is Brought to Life.” De Gruyter Conversations, July 5, 2021. https://blog.degruyter.com/thesaurus-linguae-latinae-how-the-worlds-largest-latin-lexicon-is-brought-to-life/.

Winchester, Simon. The Professor and the Madman: A Tale of Murder, Insanity, and the Making of the Oxford English Dictionary. 1st ed. New York: Harper, 1998.

First Use of Zettelkasten in an English Language Setting?

The idea of having and maintaining a Zettelkasten has become increasingly popular since the »Zettelkästen. Maschinen der Phantasie« exhibition at Marbach in March 2013 and the appearance of the website zettelkasten.de in late 2013 and has grown significantly with the Cambrian explosion of a variety of digital note taking tools since 2018.

But here’s a fun little historical linguistic puzzle:

What was the first use of the word Zettelkasten in a predominantly English language setting?

In my own notes/research the first occurrence I’ve been able to identify in an English language setting is on Manfred Kuehn’s blog in Taking note: Luhmann’s Zettelkasten on 2007-12-16. He’d just started his blog earlier that month.

Has anyone seen an earlier usage? Can you find one? Can you beat this December 2007 date or something close by a different author?

Google’s nGram Viewer doesn’t indicate any instances of it from 1800-2019 in its English search, though does provide a graph for German with peaks in the 1850s, 1892 (just after Ernst Bernheim’s Lehrbuch der Historischen Methode in 1889), 1912, 1925, and again in 1991.

Twitter search from 2006-2007 finds nothing and there are only two results in German both mentioning Luhmann.

My best guess for earlier versions of the appearance zettelkasten in English might stem from the work/publications of S. D. Goitein or Gotthard Deutsch, but I’ve yet to see anything there.

For those who speak German, what might you posit as a motivating source for the rise of the word in the 1850s or any of the other later peaks?

S.D. Goitein’s Card Index (or Zettelkasten)

Abstract

Scholar and historian S.D. Goitein built and maintained a significant collection of over 27,000 notes in the form of a card index (or zettelkasten¹) which he used to fuel his research and academic writing output in the mid to late twentieth century. The collection was arranged broadly by topical categories and followed in the commonplace book tradition though it was maintained on index cards. Uncommon to the space, his card index file was used by subsequent scholars for their own research and was ultimately digitized by the Princeton Geniza Project.

Introduction to S.D. Goitein and his work

Shelomo Dov Goitein (1900-1985) was a German-Jewish historian, ethnographer, educator, linguist, Orientalist, and Arabist who is best known for his research and work on the documents and fragments from the Cairo Geniza, a fragmented collection of some 400,000 manuscript fragments written between the 6th and 19th centuries.

Born in Burgkunstadt, Germany in 1900 to a line of rabbis, he received both a secular and a Talmudic education. At the University of Frankfurt he studied both Arabic and Islam from 1918-1923 under Josef Horovitz and ultimately produced a dissertation on prayer in Islam. An early Zionist activist, he immigrated to Palestine where he spent 34 years lecturing and teaching in what is now Israel. In order to focus his work on the Cairo Geniza, he moved to Philadelphia in 1957 where he lived until he died on February 6, 1985.

After becoming aware of the Cairo Geniza’s contents, S.D. Goitein ultimately devoted the last part of his life to its study. The Geniza, or storeroom, at the Ben Ezra Synagogue was discovered to hold manuscript fragments made of vellum, paper, papyrus, and cloth and written in Hebrew, Arabic, and Aramaic covering a wide period of Middle Eastern, North African, and Andalusian Jewish history. One of the most diverse collections of medieval manuscripts in the world, we now know it provides a spectacular picture of cultural, legal, and economic life in the Mediterranean particularly between the 10th and the 13th centuries. Ultimately the collection was removed from the Synagogue and large portions are now held by a handful of major research universities and academic institutes as well as some in private hands. It was the richness and diversity of the collection which drew Goitein to study it for over three decades.

Research Areas

Goitein’s early work was in Arabic and Islamic studies and he did a fair amount of work with respect to the Yemeni Jews before focusing on the Geniza.

As a classically trained German historian, he assuredly would have been aware of the extant and growing popularity of the historical method and historiography delineated by the influential works of Ernst Bernheim (1899) and Charles Victor Langlois and Charles Seignobos (1898) which had heavily permeated the areas of history, sociology, anthropology, and the humanities by the late teens and early 1920s when Goitein was at university.

Perhaps as all young writers must, in the 1920s Goitein published his one and only play Pulcellina about a Jewish woman who was burned at the stake in France in 1171. [@NationalLibraryofIsrael2021] ^# It is unknown if he may have used a card index method to compose it in the way that Vladimir Nabokov wrote his fiction.

Following his move to America, Goitein’s Mediterranean Society project spanned from 1967-1988 with the last volume published three years after his death. The entirety of the project was undertaken at the University of Pennsylvania and the Institute for Advanced Study to which he was attached. ^# As an indicator for its influence on the area of Geniza studies, historian Oded Zinger clearly states in his primer on research material for the field:

The first place to start any search for Geniza documents is A Mediterranean Society by S. D. Goitein. [@Zinger2019] ^#

Further gilding his influence as a historian is a quote from one of his students:

You know very well the verse on Tabari that says: ‘You wrote history with such zeal that you have become history yourself.’ Although in your modesty you would deny it, we suggest that his couplet applies to yourself as well.”
—Norman Stillman to S.D. Goitein in letter dated 1977-07-20 [@NationalLibraryofIsrael2021] ^#

In the early days of his Mediterranean Society project, he was funded by the great French Historian Fernand Braudel (1902-1985) who also specialized on the Mediterranean. Braudel had created a center in Paris which was often referred to as a laboratoire de recherches historiques. Goitein adopted this “lab” concept for his own work in American, and it ultimately spawned what is now called the Princeton Geniza Lab. [@PrincetonGenizaLab] ^#

The Card Index

Basics

In addition to the primary fragment sources he used from the Geniza, Goitein’s primary work tool was his card index in which he ultimately accumulated more than 27,000 index cards in his research work over the span of 35 years. [@Rustow2022] ^# Goitein’s zettelkasten ultimately consisted of twenty-six drawers of material, which is now housed at the National Library of Israel. [@Zinger2019] ^#.

Goitein’s card index can broadly be broken up into two broad collections based on both their contents and card sizes:

Approximately 20,000 3 x 5 inch index cards² are notes covering individual topics generally making of the form of a commonplace book using index cards rather than books or notebooks.
Over 7,000 5 x 8 inch index cards which contain descriptions of a fragment from the Cairo Geniza. [@Marina2022] [@Zinger2019] ^#

The smaller second section was broadly related to what is commonly referred to as the “India Book” ^# which became a collaboration between Goitein and M.A. Friedman which ultimately resulted in the (posthumous) book India Traders of the Middle Ages: Documents from the Cairo Geniza “India Book” (2007).

The cards were all written in a variety of Hebrew, English, and Arabic based on the needs of the notes and the original languages for the documents with which they deal.

In addition to writing on cards, Goitein also wrote notes on pieces of paper that he happened to have lying around. [@Zinger2019] ^# Zinger provides an example of this practice and quotes a particular card which also shows some of Goitein’s organizational practice:

In some cases, not unlike his Geniza subjects, Goitein wrote his notes on pieces of paper that were lying around. To give but one example, a small note records the location of the index cards for “India Book: Names of Persons” from ‘ayn to tav: “in red \ or Gray \ box of geographical names etc. second (from above) drawer to the left of my desk 1980 ~~in the left right steel cabinet in the small room 1972~~” is written on the back of a December 17, 1971, note thanking Goitein for a box of chocolate (roll 11, slide 503, drawer13 [2.1.1], 1191v).

This note provides some indication of some of his arrangements for note taking and how he kept his boxes. They weren’t always necessarily in one location within his office and moved around as indicated by the strikethrough, according to his needs and interests. It also provides some evidence that he revisited and updated his notes over time.

In Zinger’s overview of the documents for the Cairo Geniza, he also provides a two page chart breakdown overview of the smaller portion of Goitein’s 7,000 cards relating to his study of the Geniza with a list of the subjects, subdivisions, microfilm rolls and slide numbers, and the actual card drawer numbers and card numbers. These cards were in drawers 1-15, 17, and 20-22. [@Zinger2019] ^#

Method

Zinger considers the collection of 27,000 cards “even more impressive when one realizes that both sides of many of the cards have been written on.” [@Zinger2019] Goitein obviously broke the frequent admonishment of many note takers (in both index card and notebook traditions) to “write only on one side” of his cards, slips, or papers. ^# This admonishment is seen frequently in the literature as part of the overall process of note taking for writing includes the ability to lay cards or slips out on a surface and rearrange them into logical orderings before copying them out into a finished work. One of the earliest versions of this advice can be seen in Konrad Gessner’s Pandectarum Sive Partitionum Universalium (1548).

Zinger doesn’t mention how many of his 27,000 index cards are double-sided, but one might presume that it is a large proportion. ^# Given that historian Keith Thomas mentions that without knowing the advice he evolved his own practice to only writing on one side [@Thomas2010], it might be interesting to see if Goitein evolved the same practice over his 35 year span of work. ^#

The double sided nature of many cards indicates that they could have certainly been a much larger collection if broken up into smaller pieces. In general, they don’t have the shorter atomicity of content suggested by some note takers. Goitein seems to have used his cards in a database-like fashion, similar to that expressed by Beatrice Webb [@Webb1926], though in his case his database method doesn’t appear to be as simplified or as atomic as hers. ^#

Card Index Output

As the ultimate goal of many note taking processes is to create some sort of output, as was certainly the case for Goitein’s work, let’s take a quick look at the result of his academic research career.

S.D. Goitein’s academic output stands at 737 titles based on a revised bibliography compiled by Robert Attal in 2000, which spans 93 pages. [@Attal2000] ^{# #} A compiled academia.edu profile of Goitein lists 800 articles and reviews, 68 books, and 3 Festschriften which tracks with Robert Atta’s bibliography. ^#. Goitein’s biographer Hanan Harif also indicates a total bibliography of around 800 publications. [@NationalLibraryofIsrael2021] ^#. The careful observer will see that Attal’s list from 2000 doesn’t include the results of S.D. Goitein’s India book work which weren’t published in book form until 2007.

Perhaps foremost within his massive bibliography is his influential and magisterial six volume A Mediterranean Society: The Jewish communities of the Arab World as Portrayed in the Documents of the Cairo Geniza (1967–1993), a six volume series about aspects of Jewish life in the Middle Ages which is comprised of 2,388 pages. ^# When studying his card collection, one will notice that a large number of cards in the topically arranged or commonplace book-like portion were used in the production of this magnum opus. ^# Zinger says that they served as the skeleton of the series and indicates as an example:

…in roll 26 we have the index cards for Mediterranean Society, chap. 3, B, 1, “Friendship” and “Informal Cooperation” (slides 375–99, drawer 24 [7D], 431–51), B, 2, “Partnership and Commenda” (slides 400–451, cards 452–83), and so forth. ^#

Given the rising popularity of the idea of using a zettelkasten (aka slip box or card index) as a personal knowledge management tool, some will certainly want to compare the size of Goitein’s output with that of his rough contemporary German sociologist Niklas Luhmann (1927-1990). Luhmann used his 90,000 slip zettelkasten collection to amass a prolific 550 articles and 50 books. [@Schmidt2016]. Given the disparity in the overall density of cards with respect to physical output between the two researchers one might suspect that a larger proportion of Goitein’s writing was not necessarily to be found within his card index, but the idiosyncrasies of each’s process will certainly be at play. More research on the direct correlation between their index cards and their writing output may reveal more detail about their direct research and writing processes.

Digital Archive

Following his death in February 1985, S.D. Goitein’s papers and materials, including his twenty-six drawer zettelkasten, were donated by his family to the Jewish National and University Library (now the National Library of Israel) in Jerusalem where they can still be accessed. [@Zinger2019] ^#

In an attempt to continue the work of Goitein’s Geniza lab, Mark R. Cohen and A. L. Udovitch made arrangements for copies of S.D. Goitein’s card index, transcriptions, and photocopies of fragments to be made and kept at Princeton before the originals were sent. This repository then became the kernel of the modern Princeton Geniza Lab. [@PrincetonGenizaLaba] ^{# #}

Continuing use as an active database and research resource

The original Princeton collection was compacted down to thirty rolls of microfilm from which digital copies in .pdf format have since been circulating among scholars of the documentary Geniza. [@Zinger2019] ^#

Goitein’s index cards provided a database not only for his own work, but for those who studied documentary Geniza after him. [@Zinger2019] ^# S.D. Goitein’s index cards have since been imaged and transcribed and added to the Princeton Geniza Lab as of May 2018. [@Zinger2019] Digital search and an index are also now available as a resource to researchers from anywhere in the word. ^#

Historically it has generally been the case that repositories of index cards like this have been left behind as “scrap heaps” which have meant little to researchers other than their originator. In Goitein’s case his repository has remained as a beating heart of the humanities-based lab he left behind after his death.

In Geniza studies the general rule of thumb has become to always consult the original of a document when referencing work by other scholars as new translations, understandings, context, history, and conditions regarding the original work of the scholar may have changed or have become better understood.[@Zinger2019] ^# In the case of the huge swaths of the Geniza that Goitein touched, one can not only reference the original fragments, but they can directly see Goitein’s notes, translations, and his published papers when attempting to rebuild the context and evolve translations.

Posthumous work

Similar to the pattern following Walter Benjamin’s death with The Arcades Project (1999) and Roland Barthes’ Mourning Diary (2010), Goitein’s card index and extant materials were rich enough for posthumous publications. Chief among these is India Traders of the Middle Ages: Documents from the Cairo Geniza “India Book.” (Brill, 2007) cowritten by Mordechai Friedman, who picked up the torch where Goitein left off. ^# # However, one must notice that the amount of additional work which was put into Goitein’s extant box of notes and the subsequent product was certainly done on a much grander scale than these two other efforts.

Notes per day comparison to other well-known practitioners

Given the idiosyncrasies of how individuals take their notes, the level of their atomicity, and a variety of other factors including areas of research, other technology available, slip size, handwriting size, etc. comparing people’s note taking output by cards per day can create false impressions and dramatically false equivalencies. This being said, the measure can be an interesting statistic when taken in combination with the totality of these other values. Sadly, the state of the art for these statistics on note taking corpora is woefully deficient, so a rough measure of notes per day will have to serve as an approximate temporary measure of what individuals’ historical practices looked like.

With these caveats firmly in mind, let’s take a look at Goitein’s output of roughly 27,000 cards over the span of a 35 year career: 27,000 cards / [35 years x 365 days/year] gives us a baseline of approximately 2.1 cards per day. ^#. Restructuring this baseline to single sided cards, as this has been the traditional advice and practice, if we presume that 3/4ths of his cards were double-sided we arrive at a new baseline of 3.7 cards per day.

Gotthard Deutch produced about 70,000 cards over the span of about 17 years giving him an output of about 11 cards per day. [@Lustig2019] ^#

Niklas Luhmann’s collection was approximately 90,000 cards kept over about 41 years giving him about 6 cards per day. [@Ahrens2017] ^#

Hans Blumenberg’s zettelkasten had 30,000 notes which he collected over 55 years averages out to 545 notes per year or roughly (presuming he worked every day) 1.5 notes per day. [@Kaube2013] ^#

Roland Barthes’ fichier boîte spanned about 37 years and at 12,250 cards means that he was producing on average 0.907 cards per day. [@Wilken2010] If we don’t include weekends, then he produced 1.27 cards per day on average. ^#

Finally, let’s recall again that it’s not how many thoughts one has, but their quality and even more importantly, what one does with them which matter in the long run. ^# Beyond this it’s interesting to see how influential they may be, how many they reach, and the impact they have on the world. There are so many variables hiding in this process that a fuller analysis of the statistical mechanics of thought with respect to note taking and its ultimate impact are beyond our present purpose.

Further Research

Based on a cursory search, no one seems to have picked up any deep research into Goitein’s card collection as a tool the way Johannes F.K. Schmidt has for Niklas Luhmann’s archive or the Jonathan Edwards Center at Yale has for Jonathan Edwards’ Miscellanies Index.

Goitein wrote My Life as a Scholar in 1970, which may have some methodological clues about his work and method with respect to his card index. He also left his diaries to the National Library of Israel as well and these may also have some additional clues. ^# Beyond this, it also stands to reason that the researchers who succeeded him, having seen the value of his card index, followed in his footsteps and created their own. What form and shape do those have? Did he specifically train researchers in his lab these same methods? Will Hanan Harif’s forthcoming comprehensive biography of Goitein have additional material and details about his research method which helped to make him so influential in the space of Geniza studies? Then there are hundreds of small details like how many of his cards were written on both sides? ^# Or how might we compare and contrast his note corpus to others of his time period? Did he, like Roland Barthes or Gotthard Deutch, use his card index for teaching in his earlier years or was it only begun later in his career?

Other potential directions might include the influence of Braudel’s lab and their research materials and methods on Goitein’s own. Surely Braudel would have had a zettelkasten or fichier boîte practice himself?

References

Attal, Robert. A Bibliography of the Writings of Prof. Shelomo Dov Goitein. Second, Expanded edition. Jerusalem: Ben Zvi Institute, 2000.
Barthes, Roland. Mourning Diary. Macmillan, 2010. https://us.macmillan.com/books/9780374533113/mourningdiary.
Bernheim, Ernst. Lehrbuch der historischen Methode und der Geschichtsphilosophie : mit Nachweis der wichtigsten Quellen und Hilfsmittelzum Studium der Geschichte … völlig neu bearbeitete und vermehrte Auflage. 1889. Reprint, Leipzig : Duncker, 1903. http://archive.org/details/lehrbuchderhisto00bernuoft.
Dershowitz, Nachum, and Ephraim Nissan. Language, Culture, Computation: Computing for the Humanities, Law, and Narratives: Essays Dedicated to Yaacov Choueka on the Occasion of His 75 Birthday, Part II. Springer, 2014.
Franklin, Arnold E. “Jewish Communal History in Geniza Scholarship: Part 2, Goitein’s Successors.” Jewish History 32, no. 2 (December 1, 2019): 143–59. https://doi.org/10.1007/s10835-019-09313-7.
Frenkel, Miriam, and Moshe Yagur. “Jewish Communal History in Geniza Scholarship: Part 1, From Early Beginnings to Goitein’s Magnum Opus.” Jewish History 32, no. 2 (December 1, 2019): 131–42. https://doi.org/10.1007/s10835-018-9310-8.
Gessner, Konrad. Pandectarum Sive Partitionum Universalium. 1st Edition. Zurich: Christoph Froschauer, 1548.
Goitein, Shelomo Dov, and Mordechai Friedman. India Traders of the Middle Ages: Documents from the Cairo Geniza “India Book.” Brill, 2007. https://brill.com/display/title/11514.
Kaube, Jürgen. “Zettelkästen: Alles und noch viel mehr: Die gelehrte Registratur.” FAZ.NET, March 6, 2013. https://www.faz.net/aktuell/feuilleton/geisteswissenschaften/zettelkaesten-alles-und-noch-viel-mehr-die-gelehrte-registratur-12103104.html.
Langlois, Charles Victor, and Charles Seignobos. Introduction to the Study of History. Translated by George Godfrey Berry. First. New York: Henry Holt and company, 1898. http://archive.org/details/cu31924027810286.
Lieberman, Phillip I. “Goitein’s Unfinished Legacies.” Jewish History 32, no. 2 (December 1, 2019): 525–26. https://doi.org/10.1007/s10835-018-9311-7.
Lustig, Jason. “‘Mere Chips from His Workshop’: Gotthard Deutsch’s Monumental Card Index of Jewish History.” History of the Human Sciences 32, no. 3 (July 1, 2019): 49–75. https://doi.org/10.1177/0952695119830900.
NLI Treasures – Outstanding Archives: Shlomo Dov Goitein with Dr. Hanan Harif and Matan Barzilai. National Library of Israel, 2021. https://www.youtube.com/watch?v=WIfH-iSGa5M.
Princeton Geniza Lab. “Goitein’s Index Cards,” 2022. https://genizalab.princeton.edu/resources/goiteins-index-cards.
Princeton Geniza Lab. “Goitein and His Lab.” Research Lab. Accessed January 9, 2023. https://genizalab.princeton.edu/about/history-princeton-geniza-lab/goitein-and-his-lab.
Princeton Geniza Lab. “Text Searchable Database.” Research Lab. Accessed January 9, 2023. https://genizalab.princeton.edu/about/history-princeton-geniza-lab/text-searchable-database.
Rustow, Marina, and Eve Krakowski, et al. Princeton Geniza Lab. “Goitein’s Index Cards,” 2022. https://genizalab.princeton.edu/resources/goiteins-index-cards.
Schmidt, Johannes F. K. “Niklas Luhmann’s Card Index: Thinking Tool, Communication Partner, Publication Machine.” In Forgetting Machines: Knowledge Management Evolution in Early Modern Europe, 287–311. Library of the Written Word 53. Brill, 2016. https://doi.org/10.1163/9789004325258_014.
Thomas, Keith. “Diary: Working Methods.” London Review of Books, June 10, 2010. https://www.lrb.co.uk/the-paper/v32/n11/keith-thomas/diary.
Webb, Beatrice Potter. My Apprenticeship. First Edition. New York: Longmans, Green & Co., 1926.
Zinger, Oded. “Finding a Fragment in a Pile of Geniza: A Practical Guide to Collections, Editions, and Resources.” Jewish History 32, no. 2 (December 1, 2019): 279–309. https://doi.org/10.1007/s10835-019-09314-6.

Footnotes

In my preliminary literature search here, I have not found any direct references to indicate that Goitein specifically called his note collection a “Zettekasten”. References to it have remained restricted to English generally as a collection of index cards or a card index.↩︎
While not directly confirmed (yet), due to the seeming correspondence of the number of cards and their corpus descriptions with respect to the sizes, it’s likely that the 20,000 3 x 5″ cards were his notes covering individual topics while the 7,000 5 x 8″ cards were his notes and descriptions of a single fragment from the Cairo Geniza. ^#↩︎

Hypothes.is as a Digital Zettelkasten for Neologism and Word Collection for Wordnik

A little while ago, one of the followers of my Hypothes.is account where I actively mark up my reading with highlights, annotations, and notes asked me why I was tagging seemingly random sentences with “wordnik” and other odd tags that started with “hw-“. Today I thought I’d write out the explanation of the habits around one of my side hobbies of word collecting.

Some background

In the book The Professor and the Madman: A Tale of Murder, Insanity, and the Making of the Oxford English Dictionary, Simon Winchester describes the pigeonhole and slip system that professor James Murray used to create the Oxford English Dictionary. The editors essentially put out a call to readers to note down interesting every day words they found in their reading along with examples sentences and references. They then collected these words alphabetically into pigeonholes and from here were able to collectively compile their magisterial dictionary. Those who are fans of the various methods of knowledge collection and management represented by the index card-based commonplace book or the zettelkasten, will appreciate this scheme as a method of collectively finding and collating knowledge. It’s akin to Paul Otlet and Henri La Fontaine’s work on creating the Mundaneum, but focused on the niche area of lexicography and historical linguistics.

The Professor and the Madman is broadly the fascinating story of Dr. W. C. Minor, an insane asylum patient, who saw the call to collect words and sentences began a written correspondence with James Murray by sending in over ten thousand slips with words from his personal reading.

Wordnik and Hypothes.is

A similar word collecting scheme is currently happening on the internet now, though perhaps with a bit more focus on interesting neologisms (and hopefully without me being cast as an insane asylum patient.) The lovely folks at the online dictionary Wordnik have been using the digital annotation tool Hypothes.is to collect examples of words as they happen in the wild. One can create a free account on the Hypothes.is service and quickly and easily begin collecting words for the effort by highlighting example sentences and tagging with “wordnik” and “hw-[InsertFoundWordHere]”.

So for example, this morning I was reading about the clever new animations in the language app Duolingo and came across a curious new word (at least to me): viseme.

To create accurate animations, we generate the speech, run it through our in-house speech recognition and pronunciation models, and get the timing for each word and phoneme (speech sound). Each sound is mapped onto a visual representation, or viseme, in a set we designed based on linguistic features.

Since I’ve collected interesting new words and neologisms for ages anyway, this has been a quick and easy method of helping out other like minded word collectors along the way. In addition to the ability to help out others, a side benefit of the process is that the collected words are all publicly available for reading and using in daily life! You can not only find the public page for Wordnik words on Hypothes.is, but you can subscribe to it via RSS to see all the clever and interesting neologisms appearing in the English language as collected in real time! So if you’re the sort who enjoys touting new words at cocktail parties, a rabid cruciverbalist who refuses to be stumped by this week’s puzzle, or a budding lexicographer yourself, you’ve now got a fantastic new resource! I’ve found it to be far more entertaining and intriguing than any ten other word-of-the-day efforts I’ve seen in published or internet form.

Having been studying Welsh for a while, this video about how it informed J.R.R. Tolkien’s creation of Elvish languages for his fiction was fascinating.

The fact that he uses the word Nazgûl [~““35:51] from the Irish (nasc) and Scots Gaelic (nasg) words meaning “ring” to take a linguistic dig at Irish is notable. He was probably motivated by his political views of the time rather than celebrating (as one should) the value and diversity of all languages.

Tolkien once termed Welsh ‘the elder language of the men of Britain’; this talk explores how the sounds and grammar of Welsh captured Tolkien’s imagination and are reflected in Sindarin, one of the two major Elvish languages which he created.

Via https://podcasts.ox.ac.uk/medieval-welsh. For those interested on Tolkien, they’ve got a huge list of other scholarly content on his work: https://podcasts.ox.ac.uk/keywords/tolkien.

Am I wrong in thinking that the reason they’re calling it Web3 instead of Web 3.0 for parallelism with Web 2.0 is that hashtagging it on Twitter just doesn’t work with the period in there? (i.e. #⁠Web3.0 doesn’t link properly on Twitter the way it does on my website.) And if I’m right, is this a problem that we can expect the blockchain to fix? #⁠HistoricalLinguistics

Dot Porter did a more thorough tour of MS Codex 1248 today compared to our prior glimpse.

Today I learned that the phrase “run the gamut” comes from Γ ut or gamma ut, which is the lowest note of the hexachord system on the Guidonian hand and is also used to describe all the possible notes.

And for some somewhat related musical fun via John Carlos Baez:

Guillaume Dufay (1397 – 1474) is the most famous of the first generation of the Franco-Flemish school. (This first generation is also called the Burgundian School.) He is often considered a transitional figure from the medieval to the Renaissance. His isorhythmic motets illustrate that—their tonality is dissonant and dramatic compared to typical Renaissance polyphony.

Communication with rocks

I’ve now heard three references to rocks talking in Braiding Sweetgrass: Indigenous Wisdom, Scientific Knowledge and the Teachings of Plants by Robin Wall Kimmerer. Along with other indigenous attestations the idea has gone beyond coincidence for me.

It is far from the only source to exhibit this “oddity”. Biblical references from the time of King David exist as well as in Neolithic archaeology.

I’m increasingly confident of a hidden meaning here of which Western culture is unaware (it having been long forgotten) and which is likely that Indigenous peoples may have forgotten (read: had ripped and stolen from their identities during colonialization).

References to this lost knowledge in oral and written sources still remain as evidence of my theory: “communication” or “conversations” with rocks was literally a “bedrock” cultural knowledge underpinning many human cultures and ways of life for millennia.

I’ll define this “communication” more fully shortly as I continue to collect examples in the literature as well as examples in archaeological contexts.

I’d welcome other references from others should they come across them in any contexts.

Some notes about the semantic change of “interlink” and “backlink”

I’m reasonably certain that he’s raised the question or issue about the definition of “interlink” or “backlink” before, but it’s come up again today with some discussion and notes which I wanted to capture permanently here with few modifications for myself:

doubleloop[m] APP 12:30 PM
I have some notes I’ve taken on interlinking wikis here – https://commonplace.doubleloop.net/interlinking-wikis

tantek 12:39 PM
doubleloop[m], what’s the difference between “just” a link and an “interlink” from a user perspective?
genuine question (feel free to also answer if you have an idea @chrisaldrich) because Wikipedia seems to consider “interlink” as a common noun to be a synonym for “hyperlink” https://en.wikipedia.org/wiki/Interlink

Chris Aldrich 20:45 PM
I think that the definition for interlinking is expanding based on actual use cases. Historically Tim Berners Lee tried to create hyperlinks as bi-directional and then scrapped the idea as not easily implementable. As a result we’ve all come to expect that links are uni-directional.

In the digital gardens, wiki spaces and now, even with Webmention, there’s an expectation (I would suggest) by a growing number of people that some links in practice will be bi-directional.

If Neil puts a link to something within his own wiki/digital garden, he’s expecting that to be picked up in a space like the Agora and it will interlink his content with that of others.

Many who are practicing POSSE/PESOS are programatically (or manually) placing backlinks between their content and the copies that live on silos creating a round trip set of links that typically hasn’t been seen on the web historically.

Because we’ve mostly grown up with a grammar of single directional links and no expectation of visible reverse links (except perhaps in the spammy framing of SEO linkfarms), the word “interlink” has taken on the connotation seen in Wikipedia. I think that definition is starting to change.

Among a class of users in the note taking/personal knowledge management space (Roam Research, Obsidian, Logseq, TiddlyWiki, et al) most users are expecting tools to automatically interlink (in my definition with the sense of an expected bi-directional link) pages. Further, they’re expecting that if you change the word(s) that appear within a [[wikilink]] that it will globally change all instances of that word/phrase that are so linked within one’s system.

In many of those systems you can also do a manual /redirect the way we do on the IndieWeb wiki, but they expect the system to actively rename their bi-directional links without any additional manual work.

tantek 1:08 PM
ok, the bidirectionality as expectation is interesting

Chris Aldrich 1:08 PM
By analogy, many in the general public have a general sense of what /syndication is within social media, but you (Tantek) and others in the IndieWeb space have created words/phrases/acronyms that specify a “target” and “source” to indicate in which direction the syndication is being done and between sites of differing ownership (POSSE, PESOS, PASTA, PESETAS, POOSNOW,… not to mention a linear philosophical value proposition of which are more valuable to the end user). There is a group of people who are re-claiming a definition of the words “interlink” and perhaps “backlink” to a more logical position based on new capabilities in technology. Perhaps it may be better if they created neologisms for these, but linguistically that isn’t the path being taken as there are words that would seem to have an expandable meaning for what they want. I’d classify it as a semantic change/shift/drift in the words meanings: https://en.wikipedia.org/wiki/Semantic_change

I suspect that if Roam Research, or any of the other apps that have this bi-directionality built in, were to remove it as a feature, they’d loose all of their userbase.

tantek 1:11 PM
yes, such a semantic shift in the meaning of “interlink” seems reasonable, and a useful distinction from the now ubiquitously expected unidirectionality of “hyperlink”

Chris Aldrich 1:12 PM
I’m expecting that sometime within the next year or so that major corporate apps like Evernote and OneNote will make this bi-directional linking a default as well.

tantek 1:12 PM
in sci-fi metaphor terms, one-way vs two-way wormholes (per other uses of “hyper”)

Chris Aldrich 1:14 PM
I can only imagine what a dramatically different version of the web we’d be living in if the idea of Webmention had existed in the early 90s. Particularly as there’s the ability to notify the other end in changes/updates/deletions of a page. Would the word “linkrot” exist in that world?

Joe Crawford 1:22 PM
Or in a world with Xanaduian transclusions, for that matter.
Alas

Chris Aldrich 1:25 PM
Related to this and going into the world of the history of information is the suggestion by Markus Krajewski in “Paper Machines: About Cards & Catalogs, 1548-1929” that early card catalog and index card systems are really an early paper/manual form of a Turing Machine: https://mitpress.mit.edu/books/paper-machines.

One might imagine the extended analogy libraries:books:index cards :: Internet:websites:links with different modes and speeds of transmission.