Mapping New Activity in My Neighborhood Post-Covid-19

Posted on December 11, 2021 by Kai Prenger

Introduction – Optimism

Notionally, and for some time now, I’ve planned on memorializing the changes in my neighborhood’s public space use after the start of the COVID-19 in New York City in March 2020. Like most communities, Sunnyside Queens spontaneously organized activities in public spaces to compensate for virus transmission risks indoors during times of high infection and hospitalization rates. Religious services, organized sports leagues/tournaments, exercise classes and nighttime teenager socializing blossomed in Torsney/Lou Lodati Park across from my apartment building over 2020, all of which persists throughout 2021, and all of which remains unsanctioned by a governmental or non-governmental organization. While I understand the role government can play in organizing efforts in large populations, autopoietic social activities strike me as immanently interesting, and the thought that a community with a diverse set of heritages and social needs unaddressed could coordinate usage of shared space during government shutdown of indoor buildings reassured me emotionally. In the depth of a crisis, the idea that my neighbors could collaborate together to help each other helped sustain my confidence in human beings during a period of time it was easy to feel unsure.

While the format and medium for my extended project were unspecified, I took the mapping praxis project as an opportunity to try GIS as a potential tool for communicating the changes I saw in my neighborhood. After all, a map seemed an obvious way to depict evens in space. My initial plan for the praxis assignment focused on two types of events: regular, recurring events distributed through different dayparts of my own devising, and mobile events such as protests in the summer of 2020. While I’ve worked with geographical coordinates and plotting in the past, I wanted to venture out slight into drawing my own layers, as a nod to the subjective nature of the project as a whole.

Mapping with QGIS – Idealism, Disappointment, Naivety

My choice of tools for mapping was based on a varying ideological commitment to open source software over the last twenty years of my life. QGIS was only open source tool I found on the list of recommended tools provided for the purposes of this praxis project that didn’t require Javascript (Leaflet), a language I’m familiar with but far from expertise. At the time, I was using FreeBSD at home, so installing the software was handle by running `pkg install qgis-ltr`

in the terminal. Once I had QGIS installed, I found Klas Karlsson’s superlative Youtube channel had a reasonable enough primer to get me up and running.

My choice to use free and open source software for my praxis assignment caused difficulties in executing what I envisioned for three reasons. The first was there were some rough edges in storing polygon shapes while saving map layers that were hard to troubleshoot in a graphical user interface this complex. For instance, I had trouble saving both temporary layers as perminent layers.

Error while trying to convert a temporary scratch layer to a permanent one

Because I was saving to a simple flat file database (SQLite), I guessed that the error above related to some permissioning set in the package provided in the FreeBSD repositories, though I couldn’t find any support channels to resolve this problem. I had to redraw polygons a quite a few times and in the right order of operations to save them as layers on my map. My second hurdle came when I wanted to save my map. The purpose for saving the map data was two fold: first, it would be handy to continue my work on another computer (a laptop in this case) and secondly, I wanted ideally to have my mapping data available to others. However, I wasn’t able to save my data, no matter how much I futzed with the permissions on the files specified in the error message below:

My third complication in using QGIS to render my project as I wanted can be attributed to either naivety if we’re being generous, and foolishness if we’re not. My research ended at the words “open source”, which meant I didn’t realize that QGIS “only” supports static maps. The problem with this limitation given my project is that it would take considerable effort to connect more maps at more than one zoom level, a necessity for plotting the marches/protests in my neighborhood and relating that to the park across from my apartment building. More on this limitation later. In the end, I compiled several maps plotting out various regular activities in the park for different dayparts which you can review below:

Maps showing (1) the contours of the park (2) the different subsections – highlight on the space where community activities happened (3) a version of weekday activity (4) a version of weekend activity (5) late night / early morning activities

With all of these challenges, there was recourse in the Grad Center community to learn and grow in terms of learning the tools of the trade. While the GIS workshop this semester focused what I might call quantitative approach to mapping (i.e. taking spacial data and plotting it on a map), I found the conversations with the much smaller DARC + Sound Studies working group a more productive space to explore how a seasoned GIS scholar might approach my project. When presenting my struggles with this praxis project there, attendees advised me to understand the relative strengths of software used in mapping. QGIS is extremely powerful for creating plot points, geometry and layers. These scholars recommended exporting the spacial data and creating the interactivity in either ArcGIS (various formats) or even using a tool like Leaflet (GeoJSON is best). Unsurprisingly, in mapping picking the right tools for the job should be considered above some sort of ideological purity test (loyalty to free and open software). The reality is, mapping is a complex topic. Look no further than the three tiered top GUI menu that awaits you when you launch QGIS among many other interface elements presented to a new user.

Merely the top tiered menus available in QGIS

In spite of the difficulties I faced in this praxis assignment, I’m not entirely sure that traditional mapping satisfies the subjective nature of representing change in space as geographically small as a neighborhood. Perhaps a SpecLab inspired visualization trying to demonstrate the sociability of each type of activity in the Torsney Park would be more insightful. For instance, a cookout might be heavily social, whereas an exercise class or religious service may not. Instead, what I produced in these maps might be better suited as tabular data with type activities (exercise, religious, sports, socializing), a categorical dimension like time of day and day of the week and age of attendees.

Data as Digital Material for Subjective / Embodied Knowledge

Posted on December 7, 2021 by Kai Prenger

During this semester, we’ve understood data to mean numerical facts and statistics. Scholars can represent those descriptive numbers and aggregations along two dimensional coordinate systems and maps, communicating an understanding of the texts or medium under study by abstracted details into summaries. Contributions like Manovich’s Exploring One Million Manga Pages with Supercomputers and HIPerSpace “ use visualization and/or mathematical models to describe the space of possible and realized variations” by clustering the amount of gray scale or measuring the level of detail in illustration among 1,074,790 pages. Others employ methods like distant reading to process words and sentences in large corpora as way of confirm the coherency of writerly social characteristics, and ponder the examples which evade their models (see So and Roland’s stellar Race and Distant Reading). These flavors of analysis view data as a means push away from subjective knowledge of artistic output, proposing dimensions and measures as appropriate tools for grappling with humanities as scholarly sport. The quantization champions of digital humanities offer mostly rigorous and often legitimate intellectual insight to their subjects of study. Still, there is a more capacious views on data, a more digitally native observation, beyond the statistical definitions we’ve been exposed to this semester.

CUNY’s own Kevin L. Ferguson stands as starting point for many members of this minority group of data “analysts.” Looking at his frequently updated tumblelog Film Visualization, it’s easy to mistake his methods as art (in fact, he courted some controversy when an artist appeared to ape his methods). As a part of his scholarly work, Ferguson produces summed frame visualizations. Screenshots of a movie are superimposed on themselves by loading them into the open source medical imaging software ImageJ, provided by the NIH.

Kevin L. Ferguson, montage of the summed frames of 54 films produced by Walt Disney Animation Studios, 1937–2014

While comparisons on the corpus and categorical level are made by “automatic process which reveals otherwise unconscious information about film texts” (DHQ: Digital Humanities Quarterly: Digital Surrealism: Visualizing Walt Disney Animation Studios ) what differentiates these summed frame ouputs from more common visualization methods is two fold. Firstly, this method takes digital data, in the form of images, as material from which to create a subjective understanding of the objects under study. While there are numerical transformations involved in the production of these images, the artifacts generated require context and knowledge of the specific films under analysis, and are not summarized by statistics. Contrast, color and framing of subjects can be determined for each movie, or for the movies of a corpus, but the visualizations themselves are only a component in the over all conclusions one can draw. Secondly, these summed frames suggest a mode of experiencing or “reading” that is more embodied than typical distance reading. The scholar may see a perpondance of close-ups in a given corpus rather than talling the number of such shots, or may observe the lack of vue variation in a color film.

Another digital humanist with an expansive view of data and its expression is Kim Brillante Knight. Her project “Danger, Jane Roe!”, described here, cleverly grounds itself in a tradition of feminist praxis. Knight fashioned embrodery of reproductive anatomy undergirded by LilyPads, a flavor of Arduino microcontroller for wareable technology. Little LEDs light up in the decoration based on how many #prolife hashtags are posted on Twitter over a specified period of time. Knight links the projects to many linages of practice.

The adoption of decorative textiles, an artform fogged by traditional modes of feminitity, for a politically potent topic is one example. Leaving the application stitching visible by using a contrasting color with the reproductive organs “can make explicit the workings of a circuit” more visible by implication, implying “polarity, connectivity, and flow” that most digital hardware products obscure. A key aspect emphasized in the wearing this visualization on the body is to “remov[e] data visualization from the screen or page…relocates discourse around reproductive justice onto the site of legislative inscription—the body that may be affected by pregnancy.” If you read the article linked above, you’ll be treated to a thick fabric of theory and practice that legitimizes what could be glossed as an non-academic pursuit if you aren’t considerate or don’t read the whole article. As it relates to an expansive sense of data, we can appreciate an embodied experience of data that has it’s origin in technology on the screen, but seeks to escape it. Once can imagine a version of this wareable, perhaps untenable for long periods of usage, that vibrates when a word from a hashtag dictionary is posted. Imagine that as an embodied experience.

Writing about a large sense of what data can mean in a digital humanities project leaves to invigored by how many potential new modes and methods are left underexplored in a field. Digital humanities, in whatever manifestation or metaphor of organization, seems to experimentation and not wedded to received ideas of scholarship, which I find appealing.

Proposal for Digital Archive on Mainframe Computers

Posted on November 22, 2021 by Kai Prenger

I’m proposing the creation of a digital archive for textual and visual material related to mainframe computers. As a part of the curation process, we will perform glitches or additional manipulations of the material as a method interrogate the assumptions we have about business computing and mainframe computers.

Both minicomputers and super computers are well understood in academia. Digital humanities emerged in a period when computing costs dropped precipitously through innovations related to the microprocessor. Scientific computing relied on supercomputers addressed on complex mathematical calculations. Although its birth place can be traced to the academy, mainframe computers mostly found application in business contexts. Given its pedigree, mainframe computing is an under explored topic in digital humanities. Yet concepts made popular by mainframe computing live on in today’s software infrastructure in the form of batch processing, multitenancy in systems, timesharing of compute / storage resources and transactional databases. The age of Big Iron also continued the contribution of women to information technology often made less visible than those made by men. For example, the theory of machine-independent programming languages and the creation of FLOW-MATIC language by Grace Hopper, which was extended as COBOL, the principal program language used program mainframe computers.

This project will handle subject matter germane to the many facets and specters of digital humanities. Using frameworks like Wax from the minimal computing group ensures some longevity of the archive in question. Mainframe computers continue to power the majority of credit card transaction processing. The prominence of this type of hardware in governmental payment process came to light when stimulus checks weren’t issued in a timely manner during the pandemic. Maintenance (or lack thereof) of our critical technology is a major concern of digital infrastructure scholarship. Historical research will be critical to reconstruct a cultural particularities of time and place. It would also be advisable to

I am also keen on inviting collaborators with different skill sets. Some may want to work through the issue of cataloging, while others write essays explaining how mainframes function, what work they do or what the outputs are like. If we want to take a deformative approach to the archive, we could have someone coming from a design perspective (Adobe tools, PhotoMosh), another from an informed naif (data bending with Audacity/converted text file), while a third collaborator could deploy an algorithmic approach (software libraries like glitch-this or pixelsort). Regardless of how we approach the archive, I expect curation and critical writing will be instrumental in guiding audiences through the material.

A Cursory Exploration of Fielding’s Preoccupation with Violence

Posted on November 11, 2021 by Kai Prenger

Premise

A couple years ago, I read Henry Fielding’s The History of Tom Jones for an 18th century British novel class. One aspect about that work that interested me was how the protagonist is repeated on the edge of entering a duel or engaging in a fistfight, mostly as a result of his philandering. The framing of this dynamic in most scholarship revolves around Fielding’s abhorrence of the honor culture amongst the aristocracy in Britain, and its deleterious effect on the English soul by way of escapades abroad that included the colonization of lands and wars. That said, I found that most of his considerations about violence centered on lawlessness within England, where Highwaymen could pillage as they pleased. Fielding’s founding of the Bow Street Runners, what eventually became the Metropolitan Police Service in London, also highlighted for me what seemed an unusual preoccupation with public violence in 18th century Britain.

For this assignment, I wanted to begin interrogating my intuitions about his interest in addressing crime performing a distant reading of Henry Fielding in comparison to a small corpus of 18th century British literature.

Tools

While I played with the Natural Language Toolkit in Python, seeing how I could remove stop words, case everything in lower case, remove punctuation, I felt I’d get tangled too much in tweaking things to get on with the analysis in an initial exploration like this assignment. Using Voyant Tools not only allowed for quicker distant reading, but also serves as an excellent place to develop research questions that could be handled in detail with more flexible tools at another time. I also used n-gram as a gut check against a large corpus than I could put together by hand.

I chose line graphs and stacked bar charts for visualizations, as they are reasonably easy to reason about 😉 .

Analyses

Tom Jones

My first stop was to add a text file version of Tom Jones via Project Gutenberg, a ready at hand source to compile corpora for Voyant Tools to consume.

Here’s what a gleaned from viewing Tom Jones in Voyant Tools.

Searching with the wildcard violen* when compared to happiness, happy and joy reveals the prevalence of violence mentioned throughout Tom Jones. From memory, I can imagine the highest peaks of violence revolving around (1) when Tom saved Jenny Jones (Mrs. Waters) from Northerton attempted rape. As each of these search strings represent about 1% of unique words in the text (13,019 unique words in total), comparing the relative frequency of these terms seemed useful, without major distortions in scaling. It seems as though violen* features heavily in the text, though one might assume that it is as equally represented as happiness/joy if combining words with like sentiments. The idea of symmetry between these terms dovetails well with Fielding’s love of symmetry in plotting.

Violen* vs. Happy, Happiness and Joy in *Tom Jones*

Joseph ANdrews

I have also read Joseph Andrews, another novel by Fielding, and outside of one particular scene where Highwaymen rob a carriage Joseph is on, and strip him naked, I couldn’t remember there being too many threats of violence.

Here’s what I gleaned from viewing Joseph Andrews in Voyant

Two points to make when viewing Joseph Andrews work frequency for similar strings. One is that each of these search strings make up less of the unique words in this text vs.Tom Jones (each less than .3% of the 9454 unique words). However, we do is the same sort of symmetry, with the terms connoting joy and violence showing similar parabolic shapes in the middle of the document segments.

Violen* vs. Happy, Happy* in *Joseph Andrews*, by Henry Fielding

Corpus of other 18th British Novels

At this point, I wanted to see these terms in relationship to each other in a larger corpus. I created small corpus of other 18th century British novels I’ve read over the past few years, including, in this order:

Clarissa and Pamela by Samuel Richardson
Evelina and Camilla by Fanny Burney
Tristram Shandy by Laurence Sterne
Moll Flanders and Robinson Crusoe by Daniel Defoe
The Female Quixote by Charlotte Lennox
The Monk by Matthew Lewis
The Mysteries of Udolpho by Ann Radcliffe
Gulliver’s Travels by Jonathan Swift

Here’s what I gleaned from viewing this corpus in Voyant Tools

This corpus represents as broad cross section of published novels during this time, from the scandalous (Lewis) to the popular (Burney and Richardson) to the less canonical (Lennox and Radcliffe). It also delivers an order of magnitude higher number of words when compared to the individual Fielding novels (millions vs. hundreds of thousands). I found the line graph of violen* and the happy keywords to be a little harder to read in this instance, though it certainly seems like violence hovers below the peaks we saw in Fielding’s novels, and the trend is downward over even later part of the corpus, which includes some of the more grift spectrum of the corpus (order of the texts masters in this context). If I had to generalize, Fielding looks more concerned about violence instead of happier words,

Our keywords from a larger corpus of 18th century British novels

Stacked Bar charts of Fielding Novels

Viewing these keywords across reading time in Tom Jones offered an insight into the plot structure of comedy, though I suppose this is not something that distant reading alone as uncovered by comedy as a genre.

Violence is overrun by happiness over time in *Tom Jones*

What was more surprising is the persistent of violen* in Joseph Andrews. I wonder if this is because this novel revolves around Joseph trying to “preserve his chastity” through a series of encounters with desirous women. In the case of Tom Jones, the protagonist is having sex throughout the novel, with only occasional spikes in violence during his expulsion for his adoptive father’s estate, and the incident with Northerton and other encounters with violent men.

Violence persists throughout Joseph Andrews

N-Gram Viwer

As a further point of comparison and perspective, I looked at my keywords in Google Books N-Gram Viewer in British English from 1600-1900. I’m struck by the increas in happy, happiness and violence starting in the mid 1750 (around the time Fielding was writing). Given this larger corpus, I find it hard to dismiss the null hypothesis: that Fielding wasn’t more preoccupied by violence than his peers.

Reflections

I will share three reflections having completed this exercise

It is definitely the case that even a cursory exploration as shown above can generate more specific research questions. For instance, does the relationship between sex-seeking (Tom Jones) vs. sex-averting (Joseph Andrews) account for the differences seen temporally in each of those texts? How is that related prescribed gender expectations of men and women in 19th century England? Some of the critiques of distant reading straw man the method as an unreflecting and totalizing effort. But there are a number of sophisticated text analysis methods to perform, not contemplated here, that could be enlightening in conjunction will closer readings of each text.
Some of Voyant tools are helpful, while others seem to represent spaces for further analysis. One example: the context table is hard to make sense of in the tool as opposed to the visualizations. Though I suppose Voyant Tools is aware that they aren’t the last word in text analysis
Call me old fashioned, but a purely distant reading of any string of characters is missing the point of what a text is. As Roland Barthes put it in the opening of S/Z:

There are said to be certain Buddhists whose ascetic practices enable them to see a whole landscape in a bean. Precisely what the first analysts of narrative were attempting: to see all the world’s stories (and there have been ever so many) within a single structure: we shall, they thought, extract from each tale its model, then out of these models we shall make a great nar- rative structure, which we shall reapply (for verification) to anyone narrative: a task as exhausting (ninety-nine percent perspiration, as the saying goes) as it is ultimately undesirable, for the text thereby loses its difference.

And this is explicit task of distant reading: to lose the difference of individual texts and align them in larger sociologies of literature. Like Barthes, I view a text as product of the interaction of a reader with the work presented. I certainly couldn’t have reasoned about the “temporal” changes in the keywords in Joseph Andrews or Tom Jones having not experienced these books through reading.

Understanding the “Back-End”

Posted on October 21, 2021 by Kai Prenger

One of the hallmarks of Johanna Druckers’ scholarship is an interest in how knowledge is produced. In SpecLab, her experiments at the titular laboratory centered on “reconceptualization of premises and parameters, not a reassessment of means and outcomes.” This sentiment gives a generous acknowledgment of the benefits computer-assisted quantitive methods made in humanities research, while also recognizing the “invisible” constraints orthodox methodologies inherit from the disciplines and industries of origin (Drucker, 27). Recent publications build on some of her earlier experiments to demonstrate that what is presented to us as self-explanatory artifacts actually require more thought, context and interpretation than we’re trained to deploy in our every day encounters with, say, data visualizations. Along similar lines, Drucker draws our attention to the hidden costs of the transition away from physical media to digital media for academic publication this week. One small nuance present in Pixel Dust caused me to reflect on why I’m prone to accepting Drucker’s lines of augment.

Drucker demonstrates an understanding of relevant digital technologies outside of an academic context which views the “back-end” as cruical to the frontend. The thoroughness of comprehension merges in the specificity of one example used when description some misunderstandings of the transition to digital media for academic publication. The company named in “the misperception…that everything digital is available on Google” strikes a chord with me given the origin PageRank, the key algorithm in that companies competitive advantage in the 1990s IT tech ecosystem. I would argue that Drucker knowingly references Google because of the influence of citation analysis on this algorithm, borne out of finding a quantitative method of evaluating the importance of published scientific articles. As Google put describes the logic:

PageRank works by counting the number and quality of links to a page to determine a rough estimate of how important the website is. The underlying assumption is that more important websites are likely to receive more links from other websites (citation courtesy of the Wayback Machine)

Much like her example of the codex, Drucker emphases the continuities in the backend logic between older forms of media and their descendants. In this case, leaving discovery up to Google search results likely reinvigorates the same forces that constricted traditional academic publishing, both financial and institutional, through backlinking other SEO strategies that favor

Drucker’s implication of the backend as critical subject matter to understand the transition to digital formats reflects the true costs, rather that the price. One misconception has to do with the distribution of costs over the lifetime of an academic product. Whereas physical books and journals had large upfront costs, the cost of digital publication goes beyond licensing fees, or even the “author-pays” model mentioned in Fitzpatrick’s chapter in Generous Thinking. Drucker also counts the cost of maintaining the digital medium itself, often not appended to price tags. To draw the analogy using a physical book, it’s as if there were occasions where the pigments in ink stopped rendering, and someone needed to jostle them to recompose as letters and text on a page. Or the pages would occasionally fall out of the binding, become disordered on the ground.

I think there are reasonable responses to these problems offered in the motivations behind Gil’s Ed. And in general, the reading from this week and last about infrastructure shows a gradual movement towards systems thinking. As someone who’s being propelled rapidly into a more “backend position” in a professionally, I find a lot of sympathy with Drucker’s instinct to scrutinize the complex tradeoffs in the “invisible work” required to create and maintain digital platforms.

The Digital Humanism of the Early Caribbean Digital Archive

Image

I found the Early Caribbean Digital Archive (ECDA) to be a succinct representation of the many valences in the digital humanities’ expanded field.

The archive participates in some “traditional” motivations of digital humanities. The project follows Ramsey’s urging for digital humanists to “build things” from “Who’s in and Who’s Out.” By creating a digital archive like, the ECDA hues closely to the camp of digital humanists “who use new digital tools to aid relatively traditional scholarly projects” (Gold, 2012). The emphasis on catalogs and manuscripts harkens to an operating mode where DH comprises a set of methods to make traditional archival material more accessible via digital technologies. Moreover, the metaphors used on the EDCA homepage ground the project in the conventional notion of digital humanists that an observer outside of the field might expect. Imperatives like “take a tour,” the categorization of material as “archive” and “exhibits” fall within the tropes of physical museums. The “curated collections” indicate some level of expertise deployed for creating “ new entryways into the archive,” carving out grooves by which a “guest” can navigate or “explore” the archive. These motivations to build a tool for researchers, to digitize archival material, and provide structured guidance – couched in purview of what typical institutions may expect from digital humanities – are visible in how the EDCA frames its project. These motivations go some way to understanding not only why Northeastern University would host this archive, but also the library partnership that the project attracts.

The project’s palatability to institutional forces doesn’t diminish the EDCA’s participation in the more unorthodox practices in DH. Sidestepping the colonial project inherent in these texts goes beyond the preternatural “recombinatory” and nonlinear benefits of digitalization mentioned in the “Decolonizing the Archive: Remix and Reassembly” section of the website. The tagline “is it possible to decolonize the archive?” hints at the experimentation in how the archival material is arranged to try and answer that question. Slave narratives embedded in books written by European colonial authors are broken free from that context in the EDCA “to form a new digital anthology of narratives that speak to one another (beyond the context of the words of Bryan Edwards or similar texts) in new ways and across new contexts.” This reassemblage meshes well with the “constellation of terms…curiosity, play, exploration” that Spiro identifies as experimentation, a unifying value she sees for the DH community. Breaking up these early Caribbean texts also aligns the EDCA (albeit in a very small “c” conservative way) with DH scholars that draw their lineage from McGann and Samuel’s Deformance and Interpretation. By reading a text in reassembled order — backwards, with words removed, or the order of paragraphs, sentences or stanzas reassembled etc. we can generate new knowledge about the text in question, and reveal the structures that undergird it. In the spirit of Spiro’s elevation of collaboration as an inherent DH value, I appreciate that the EDCA curators set the extracted slave narratives in conversation with one another, having those voices collaborate on a new narrative with the potential to

DH 700 Fa21

Introduction to the Digital Humanities

Author Archives: Kai Prenger