A series of thoughtbursts on topics concerning (e)book history, (e)print culture, & digital humanities.
Don't wanna be here? Send us removal request.
Text
What Does Generosity Have to Do With Linked Open Data?: A Lit Review

The following is a partial lit review of recent conversations surrounding Linked Open Data within the humanities with a focus on what generosity means in the context of the Semantic Web. Iâve experimented with form in this post by lifting key voices out of the text and styling them as block quotes in order to better visualize the range of opinions presented in this review. I have also set up a corresponding (open) annotated bibliography on Zotero which you can find here. The annotations can be found in both the Notes attached to each citation, as well as under Extra so that readers may quickly move through the citations, rather than having to open each Note separately. Where possible, I have processed the text of the essays, blog posts, and articles featured in this annotated bibliography using Voyant Tools and included up to ten tags of the most frequently used words.
The dream of the Semantic Web and the emergence of LOD
Popular buzz surrounding the Semantic Web has been going on since the early 2000s, later epitomized in a TED talk Tim Berners-Lee delivered in 2009. In his talk, Berners-Lee announced to the world that he needed some help revamping the World Wide Web: a world in which web documents and data coexist. What he asked for was a collective push towards making raw data available on the Web, by which he meant machine-readable data. The same year Berners-Lee, Christian Bizer, and Tom Heath published a paper in the International Journal on Semantic Web and Information Systems titled, âLinked Data - The Story so Far.â In it, they describe Linked Data as âa set of best practices for publishing and connecting structured data on the Webâ which opens up the possibility of establishing a âglobal data space.â Where Open Data means data that is freely accessible on the web in non-proprietary form, Linked Open Data (or LOD) at its most basic is hyperlinked data, meaning data that references and connects to other data on the Web. Structurally, LOD is created through the use of URIs (Unique Resource Identifiers), the vocabularies that identify and define relationships between resources (which make up web ontologies), and RDF (Resource Description Framework). If the World Wide Web is made up of documents, then the Semantic Web is a Web made up of data. For a complete technical overview on LOD, I recommend a look at linkeddata.org, the W3C, or the Berners-Lee and co. paper referenced above. There is still much confusion surrounding the relationship between the Semantic Web and Linked Open Data. Where some believe the Semantic Web and LOD to be one and the same, others understand the Semantic Web as made up of Linked (Open) Data -- this review subscribes to the latter. The lack of consensus, however, is interesting and perhaps representational of the spirit of Linked Open Data in that it reflects both its charm and difficulty, that is, the nature of LODâs conflicting ontologies and unregulated vocabularies.Â
Tempting the star-collecting achiever in us all, Berners-Leeâs Five Stars of Open Data is a LOD deployment scheme which urges users to free their information from documents that rely on proprietary software so that others may access their data. These five stages towards open data are perhaps best represented in the following graph and legend (taken from their posh site and pasted here):
At the heart of Berners-Leeâs five star systems is a desire for people to make available the data they have now and worry about refining the structure of that data later, a point made clear in his talk:
We want unadulterated data. OK, we have to ask for raw data now. And I'm going to ask you to practice that, OK? Can you say "raw"?...Can you say âdataâ?...Can you say ânowâ?
Although this approach is effective at getting data out and onto the Web, the question of how many return to refine or clean up their data, let alone work up the five-star ladder, is still up for debate (see amazing article on âmetacrapâ). Perhaps the most crucial moment in his talk is a reminder that âdata is relationships,â where each node is connected to another and that node to another, making up a complex network of relationships. LOD, then, is a social practice that relies on shared labour for the greater good. This spirit of social responsibility fuels the collective work, a philosophy summarized in the concluding remarks of Berners-Leeâs talk: Â
It's about people doing their bit to produce a little bit, and it all connecting. That's how linked data works. You do your bit. Everybody else does theirs. You may not have lots of data which you have yourself to put on there but you know to demand it.
The structural politics of LOD
LOD is valuable in its ability to publish data that is interoperable and to quickly build up networks of connectivity. In the last ten years, the ecosystem that supports linked and open datasets, more formally known as the LOD cloud, has grown 47.5 times since it was first captured in 2007.
A screenshot of the LOD cloud in 2007 featuring 12 datasets.
A screenshot of the LOD cloud in 2014 featuring a total of 570 datasets (hereâs a link to an explorable graph).
Like these LOD cloud graphs emphasize, the structure of RDF itself represents the âdata is relationshipsâ philosophy in its subject-predicate-object statements, which describe the relationships between resources within local as well as external datasets. Whatâs more, LOD supports meaningful, that is context-based, connections between data from a wide range of sources, aided by the easy integration of RDFâs forgiving non-hierarchical structure. In âZen and the Art of Linked Data,â Dominic Oldman, Martin Doerr, and Stefan Gradmann praise the use of RDF for humanities driven research, writing
Of particular significance to humanists is that semantics can be embedded (rather than described separately) within exactly the same structure. This provides far greater potential for integrating vast repositories of data using the standard Web protocol, and provides the foundation for additional technology layers with increasingly sophisticated levels of expressivity. It also provides the type of flexibility that researchers require to quickly incorporate new information and data structures that are necessary as their research progresses, and creates the opportunity for consistent forms of knowledge representation for all research activities.
In other words, RDF serves as a kind of common language in the world of Linked Data with which to establish semantic connections across the Web. This history of a shared interest in knowledge representation is charted in James Smithâs chapter on âWorking with the Semantic Web,â in which he explains
The Semantic Web and linked data are computational applications of pre-existing scholarly practices: linking to primary and secondary sources, signalling trusted vocabularies and authorities, and positioning a work in a larger conversation.
In other words, humanities scholars are uniquely qualified to participate in the creation of the Semantic Web in that the standards of Linked Data mirror the methods and practices we employ in our own scholarly writing. Beyond how we create content, John Unsworth points out the need for increased humanist inquiry in the field of LOD, writing
In some form, the semantic web is our future, and it will require formal representations of the human record. Those representations â ontologies, schemas, knowledge representations, call them what you will â should be produced by people trained in the humanities.
For Unsworth, the creation of âformal representations of the human recordâ need humanities-authored ontologies with a particular focus on their expertise in the mechanics of knowledge production and representation. Though a still emerging field, Alan Liu reminds us that the task of the digital humanities now is to bring the values of the humanities back into computation and consider âhow the digital humanities advances, channels, or resists todayâs great postindustrial, neoliberal, corporate, and global flows of information-cum-capitalâ as a way of addressing the lack of cultural criticism that âblocks the digital humanities from becoming a full partner of the humanities.â Digital humanists, in other words, need to get into the habit of thinking critically about their metadata, about the web applications and tools they use to conduct their research, and about the culturally-bound infrastructures that support those technologies. As Tara McPherson reminds in her essay âWhy are the Digital Humanities so Whiteâ, as much as computation responds to culture, âwe must remember that computers are themselves encoders of culture.â With this history in mind, McPherson (and others like Amy Earhart, Lisa Nakamura, Moya Bailey, and Kim Gallon) urge for attention to be paid to the white epistemologies that underlie the structures of our digital world, writing
We need to privilege systemic modes of thinking that can understand relation and honor complexity, even while valuing precision and specificity. We need nimbler ways of linking the network and the node and digital form and content, and we need to understand that categories like race profoundly shape both form and content. (McPherson)
There are a handful of projects (Linked Modernisms, Linked Jazz, InPho and Huviz (in beta), to name a few) that have begun some this work -- but there is much work that lay ahead.
Corinna Bath takes the task of modelling the future of the Semantic Web as one that must rely on feminist ethics. She draws on the work Donna Haraway and Karen Baradâs concept of diffraction as a way of facing the challenges automatic reasoning pose in an environment that supports competing ontologies within the LOD cloud (3). Pointing to Baradâs term âonto-epistom-ology,â Bath calls for more attention to be paid to the misleading division between ontology and epistemology when creating LOD, especially when conceptualizing ontologies as representational of the âreal worldâ(4). This call for more attention to be paid to feminist ethics as sources of knowledge modelling is echoed in the works of Anita Gurumurthy and Nandini Cham in âData: the new four-letter word for feminismâ. In their article, Gurumurthy and Cham argue the importance of reclaiming data from hegemonic rule, writing
Assuming that data can indeed enable a powerful reconstruction of reality, the process by which it constitutes knowledge for transformative change must be based in deeper ethical-political debates. Unhinged from the complexity of ethics and politics, a world of data â as we are witness to â can end up as an absolutism that endangers the very essence of democracy as feminism would know it.
Whatâs at stake, then, is a world of data without critical thinking -- a world in which the processes by which data is generated, contained, and accessed are left unchallenged. Jeni Tennison expresses similar anxieties surrounding the social processes that govern the production and dissemination of information on the web, asking
Is it the case that opening data simply increases the gap between the information haves and have-nots, and that leads to wider economic inequality, or does everyone benefit when information is more widely available? Are there tipping points of availability at which we start realising the benefits of open data? What is the role of government in encouraging data to be more widely available and more widely used? To what extent should government invest in data infrastructure as a public good? How can local or specialist cooperatives pool resources to maintain data?
Others like Ingrid Mason remain weary of the standards (or lack thereof) surrounding the representation of people in data. Put simply, âPeople matter and representing âpeopleâ in data and turning that into linked open data is no small feat.â For Mason, one way of tackling the complexity of representing identity on the Web and avoiding harmful representations of people in LOD -- harmful in the sense of placing people in categories that overlook the discursive categories of gender and race -- is through collaboration. The organization of post-Summit meetings (ie. Linked Open Data in Libraries, Archives, and Museums Summit 2017) is one small step towards addressing the challenges surrounding the treatment of data about people, but crucial nonetheless.
More than a feeling: cultural challenges, social responsibility, & LOD
If we are to promote broader engagement with LOD and widen the field to include the humanities as full partners, formal standards must be established when it comes to how we publish Linked Data on the Web (ie. context, provenance, and data integration). Despite the incredible growth of LOD within digital humanities and cultural heritage sectors (#LODLAM), the recycling of data, however, what Michele Barbera calls âcreative reuse,â has been limited despite the recent technological advances that make it possible (91). What this suggests, Barbera argues, is a need to shift social and cultural habits of digital scholars from humanities and cultural heritage backgrounds. The discomfort around sharing content and collaborating online is a feeling that continues to persist in the humanities. Where collaborative scholarship may be business as usual in the sciences, the humanities still have much work to do in establishing a culture that not only supports but encourages collaborative work. Digital collaboration -- indeed collaboration of any kind -- will likely always require an initial leap of faith. When done right, however, this kind of work, this effort to make oneself open to the possibilities of working with others, exchanging best practices, and sharing the burden of research and writing (while celebrating the pleasures too) proves powerful and worthwhile. For recent work on collaborative scholarship online see Susan Brownâs âTowards Best Practices in Collaborative Online Knowledge Productionâ and Natalia Mehlman Petrzela and Sarah Manekinâs âThe Accountability Partnership: Writing and Surviving in the Digital Age.
To return to Barbera, beyond the discomforts of sharing content online, cultural heritage and digital humanities researchers continue to remain caught in so-called âtwo-dimensional paper thinking,â that is, reproducing print technologies on the web rather than designing projects that derive from and are built for the Semantic Web (96). We cannot continue to rebuild old models with new technologies, we must, as Berners-Lee urges, encourage âthinking in the graph.â Likewise, technological innovation in the field of LOD cannot flourish if the shifting cultural demands of the Semantic Web are not first addressed. One way of bringing about the kind of cultural change required to support a rich and diverse linked open data economy, I propose, begins with what Kathleen Fitzpatrick calls âgenerous thinking.â
Generous scholarship: towards critical cyberinfrastructures
Fitzpatrick's work (her excellent blog can be read here) is known for advocating for scholarship that is open to displaying works-in progress and honest about mistakes made along the way -- including the countless drafts (or version, if you like) a project goes through before âcompletion.â Her latest project focuses on âthe possibilities that might open up for scholars not just in doing more of their work in public but in doing more of that work in conversation with the public.â Drawing on the recent critiques of criticism by Bruno Latour and Rita Felski, âgenerous thinkingâ is offered as a way to encourage better practices of communication within the academy. In its most basic form, generous thinking roots the humanities in a practice of generosity, meaning, âthe practices of thinking with rather than reflexively against both the people and the materials with which we workâ while fostering âmore productive relationships and conversations not just among scholars but between scholars and the surrounding communityâ (Fitzpatrick). For Fitzpatrick, now is as good a time as any to tackle our institutional problems:
We have the opportunity, if we take that care seriously, to create a kind of dialogue that might help further rather than stymie the work we want to do â and that might not simply improve the standing of the humanities in the popular imagination, but dramatically transform the relationship between the university and the broader public.
This philosophy of academic life is compelling in its emphasis on cultivating small moments that affect great change, including: âa greater disposition toward listening, toward patience, toward engaging with what is actually in front of us rather than continually pressing forward to where we want to goâ (Fitzpatrick). When faced with the question of what the humanities offer universities and the general public, Fitzpatrick points to the many possibilities we open up when we think generously. For her, âgenerosity of mindâ encourages genuine dialogue that builds rather than stifles a work, an attitude that places value on the importance of listening for the sake of understanding rather than a means to an end (Fitzpatrick). This is the difference between paying attention to your colleague during their talk instead of focusing on what youâre going to say during Q&A (guilty). At the core of Fitzpatrickâs model is a desire to learn and build better together, to work collectively with a reminder to pay attention to fellow collaborators, to honour the subjects we study, and to âencounter the other in all its irreducible otherness.â Itâs about trying to slow down the demands of the academy and focus on true engagement, whether itâs with perspectives that are not our own or making time to revisit that project you keep putting off. Itâs about hard work, yes, but of a different kind: work that cultivates the ability âto listen â to the text, to our communities, to ourselves â without attaching or rejectingâ (Fitzpatrick).
Other voices have been pushing for generosity too. Mitchell Whitelaw takes up the âethos [of] generosityâ in his work on âgenerous interfaces,â writing:
The qualities of generosity I am interested in here are âto be liberal in giving or sharingâ; also to be âlarge, abundant, ampleâ . Both of these qualities seem well aligned with the aims and missions of cultural collections. Our digital collections are certainly large, abundant and ample; and the charters of our cultural institutions place a high value on sharing these riches liberally with the public. Generosity seems to be very much in line with the aims of our cultural collections. (2)
Within this context, generosity in interface design means presenting the user with the richness of a collection and empowering them to explore its contents in ways that are both intuitive and delightful. Arguing for a different kind of generosity, Miriam Posner voices her concerns regarding how data is conceptualized within the digital humanities, noting, âmost of the data and data models we have inherited deal with structures of power, like gender and race, with a crudeness that would never pass muster in a peer-reviewed humanities publication.â Returning to Fitzpatrickâs definition of âgenerosity,â the bulk of digital humanities work has been rather ungenerous, that is, not paying attention to the white epistemologies that continue to inform the ways in which concepts like race and gender are treated in our datasets and represented on the Web. To borrow again from Posner, we mustÂ
. . . stop acting as though the data models for identity are containers to be filled in order to produce meaning and recognize instead that these structures themselves constitute data. That is where the work of DH should begin. . . [we need to be] more ambitious, to hold ourselves to much higher standards when we are claiming to develop data-based work that depicts peopleâs lives.
She goes on to challenge criticism that would paint calls for more engagement with race and gender theory as âa kind of philanthropic activity.â Generous thinking too can be read can be read in a similar light -- but this is of course nonsense. Rather than scoff at attempts to rally efforts and challenge systems of oppression in all its shapes and forms, Posner reminds us that
DH needs scholarly expertise in critical race theory, feminist and queer theory, and other interrogations of structures of power in order to develop models of the world that have any relevance to peopleâs lived experience. Truly, it is the most complicated, challenging computing problem I can imagine, and DH hasnât even begun yet to take it on.
What does generosity have to do with LOD?
Iâd like to end with some thoughts on âthis most complicated, challenging computing problemâ (Posner) and imagine a Semantic Web made up of generous Linked Open Data. If the voices Iâve gathered here in this review have demonstrated anything, it's that the academic community needs to reclaim its sense of responsibility by conducting research that builds rather than fragments, remaining ever conscious of the needs of the communities they serve, and creating a kind of digital legacy worth investing in. In this sense, the hard work that lays ahead for humanities-driven LOD has more to do with Fitzpatrick and Whitehallâs radical application of generosity than it does with technological innovation. Where âgenerosityâ means generous as in linked (an abundance of meaningful connections to external resources), generous as in open (free for others to access, reuse, or build on), and generous as in thoughtfully managed data (with attention paid to how data is categorized, represented, and made explorable).
Major Works Cited
Barbera, Michele. âLinked (Open) Data at Web Scale: Research, Social and Engineering Challenges in the Digital Humanities.â JLIS 4.1 (2013): 91-101.
Bath, Corinna. âTowards a Feminist Ethics of Knowledge Modeling for the Future Web 3.0.â 10th IAS-STS Annual Conference. May 2011. Graz, Autria. Abstract.
Berners-Lee, Tim. "The Next Web." TED. Feb. 2009. Lecture.
Bizer, Christian, Tom Heath, and Tim Berners-Lee. "Linked data-the story so far." Semantic Services, Interoperability and Web Applications: Emerging Concepts (2009): 205-227.
Fitzpatrick, Kathleen. âGenerous Thinking: Introduction.â Planned Obsolescence. Last updated 5 October 2016.
Gurumurthy, Anita, and Nandini Chami. âData: The New Four-Letter Word for Feminism.â GenderIT: Feminist Reflection on Internet Policies. 31 May 2016.
Mason, Ingrid. âPeople in Linked Open Data.â Summit2017: Linked Open Data in Libraries Archives and Museums. 29 Nov. 2016.
Oldman, Dominic, Martin Doerr, and Stefan Gradmann. âZen and the Art of Linked Data.â A New Companion to Digital Humanities. Ed. Susan Schreibman, Ray Siemens, and John Unsworth. John Wiley & Sons, Ltd (2015): 251â273.
Posner, Miriam. âWhatâs Next: The Radical, Unrealized Potential of Digital Humanities.â Debates in the Digital Humanities 2016.
Smith, James. âWorking with the Semantic Web.â In Compton, Lane, and Siemens (eds.) Doing Digital Humanities. Routledge, 2016.
Tennison, Jenni. âAgent-Based Model of the Information Economy: Initial Thoughts.â Jeniâs Musings. 9 Feb. 2016.
Whitelaw, Mitchell. âGenerous Interfaces for Digital Cultural Collections.â Digital Humanities Quarterly 9.1 (2015).
Photo credit:Â Milada Vigerova via Unsplash
0 notes
Text
(Towards) A Very Merry LOD Season

This post is a product of having swapped readings with a fellow Linked Open Data classmate who comes from an Art History background. I was very glad for the chance to approach this topic from another angle. She recommended Matthew Lincolnâs âThe Art Historianâs Macroscope: Museum Data and the Academy,â a blog post based on a talk he gave last year in May at the Cultural Programs of the National Academy of Sciences in Washington D.C. The following is a quick overview and some thoughts on collaborating with(in) academic institutions.
Although admittedly weary of the term âbig data,â Matthew Lincoln applies Shawn Graham, Ian Milligan, and Scott Weingartâs The Historianâs Macroscope to the field of art history in his talk âMuseum Data and the Academy.â He takes their tools and methodologies surrounding data-driven analysis âin concert with traditional historiographical methodsâ in art history, like the microscopic âclose lookingâ (Lincoln). Lincoln notes that although the concept of data analysis is not necessarily ânewâ to art historians (see Roger de Pileâs work in 1708 on quantified style), never before has there been such an abundance and access to art historical data.
This ever growing collection of data produced by an âincreasingly digitized museum worldâ (Lincoln) allows for print historians like Lincoln to take on the sheer number of existing Dutch engravings and etchings. Beyond access, the question of how to âhonour the specificity of individual artists and artworksâ (Lincoln) becomes central for Lincoln. In other words, how can we pay continued attention to the macro-trends of history (what literary students know as âdistant readingâ), the complex web of networks these artists worked within (close-ish reading), as well as the particular lives and works of key figures (close reading). Perhaps this is where Linked Open Data catches the eye of humanities scholars â what if LOD could help solve this problem of scale?
Lincoln pauses to remind the reader that âmuseums are repositories of artwork, yes, but also of repositories of knowledge structured dataâ -- not a far cry from Tim Sherrattâs âWe have data too!â â a compelling point for art historians who  are working in a moment where âmore than a century of curatorial work describing collectionsâ history is finally starting to make it into publicly-accessible databases.â Lincolnâs own work draws on the British Museums Semantic Web Collection project that pulls from the CIDOC Conceptual Reference Model.* Access to these kinds of records is particularly exciting for print historians who know that a print is often the product of many hands (designers, etchers, inkers, printers, painters, publishers): with linked data, they can chart professional relationships within printing communities. For Lincoln, this opens up the possibility of asking new questions like, âdid different regional networks experience their own patterns of centralization and decentralization?â Which central figures are remembered in our art history and which have been overlooked or forgotten? And how can we map printers hubs or chart the circulation of prints geographically?
His talk ends with two questions, what can museums do in light of these new and exciting art history projects made possible by linked data? And how can universities help support and contribute to this burgeoning field?
Lincoln has a few things in mind, points Iâve reimagined here in the form of a LOD holiday wish list!
đ˛ Museums need to expose the curatorial knowledge stowed in their content management systems and work to structure and clean that data. This echoes Tim Berners Leeâs five stars of open data system, which urges people to first get their data out and then work to refine its structure.Â
đ˛ Museums should not solely support digital database development (focused on user-facing tools that allow users to easily sift through data on web and mobile devices) but also work towards âbulk datasets built for complexity, not just for speed and convenienceâ (Lincoln).Â
đ˛ Whenever possible, make data interoperable and avoid heavy customization â Lincoln acknowledges that this is perhaps the âhardest goal.â
đ˛ Universities must âreimagine how we can describe and permute our knowledge in digital formatsâ (Lincoln). This will require for art historians to work closely with librarians and information scientists within their institutions as well as reach out to DH scholars, borrowing tools and methodologies.Â
đ˛ Humanities departments must be willing to support macroscopic research and âhypothesis-driven experimentation.â This requires a re-imagining of humanities scholarship that makes room for the possibility of âquasi-scientific testingâ (Lincoln) in combination with the kind of interpretive work weâve been carefully refining for centuries.
đ˛ We must recognise that digital humanists share priorities, but their interests also diverge. Lincoln draws on Sheila Brennanâs piece, âDH Centered in Museumsâ to remind us that âMuseums have done DH for a long time, and they have their own prioritiesâ â namely, are collection driven (from exhibition to preservation). DH for Art Historianâs, however, as Lionelâs project would suggest, is research question driven and more interested in locating (and filling) knowledge gaps.
Sadly, these âwishesâ are not for some over-cookied, twinkly-eyed fellow to stuff down our chimneys. These calls to action are, of course, something we â digital humanists broadly construed â must work on together if they are to ever be delivered. Although a crucial start, it is not enough to come up with a list for museums and universities to consider. We must engage with the groups that run these institutions directly, establish fruitful relationships where possible, and collaborate where resources (like time) permit. We must continue to resist division (whether departmental, institutional, ivory towered) as we work towards building the kind of infrastructures that support âhypothesis-driven experimentationâ within the humanities if the kind of scholarship we produce is to be valued and preserved.
*The British Museum Semantic Web Collection Online was down at the time I was writing this post, taking with it both the digital collection, SPARQL endpoint, and HTML user interface. It is now back up.Â
Watch this space for a review.
Photo: Maria Mekht Unsplash
0 notes
Text
âGenerous Thinkingâ and the Future of Data Economies

This weekâs response is two-tiered. The first takes up Michele Barberaâs call for âa lively data economy with a rich ecosystem,â one that requires a âprofound cultural shift [in the ways] data is produced, managed and disseminatedâ (91), while the second considers the problems faced in the linked open data (LOD) community and brings them into conversation with Kathleen Fitzpatrickâs concept of âgenerous thinking.â
In his article âLinked (Open) Data at Web Scale: Research, Social and Engineering Challenges in the Digital Humanities,â Barbera provides an efficient survey of the current technological and cultural landscape of linked data, locating major gaps and spaces for scholarly intervention. He flags three major areas that require attention, streaming linked data, versioning, and the social challenges of a linked data economy.
Streaming linked data
With the ubiquity of mobile devices that are embedded with data-generating sensors (Barbera 93), live streaming data is more possible now than ever. Keeping this new abundance of data in mind, Barbera calls for more commercial attention to be paid to the possibility of linking live streamed data. He fails, however, to address the ethics that surround the collection of this kind of data which, if linked data projects are to be pushed beyond research communities, need to be taken into account as a part of the larger cultural landscape.
Versioning
Beyond the interest of tracking the evolution of RDF graphs for the purposes of generating a history, versioning is the knee-jerk response to digital collaboration â an attempt to keep checks and balances while establishing trust within a community. Barbera, however, does not spend much time discussing the need for versioning protocols and tools in LOD. Instead, he moves on to a more pressing issue within the LOD community, one that requires work beyond âtechnological innovationâ (93).
Social challenges and nurturing a linked data economy
Barbera puts forth the controversial idea that cultural heritage and digital humanities researchers are trained to inherently think two-dimensionally and ultimately find it hard to think âin the graphâ (96). He urges this group to resist a âtwo-dimensional thinking derived from the paper-world,â a world in which the limitations of paper are âmimicked rather than revolutionized in the digital worldâ (96). For Barbera our minds are influenced by the organizational logics of the tools we employ; within this logic, tabular structures are aligned with two-dimensional âpaper-worldâ thinking and stunt the progress of the linked open data community. Or, how can we build up a robust and dynamic linked open data economy if we are unable to conceive it? And how are we to inhabit new structural logics if we remain shackled by old models? According to Barbera, the time to invest in innovation, especially on a commercial scale, is now. In his concluding remarks, he brings up âmonopolistic threatsâ and the danger they pose to âpublic goodâ (98), but does not go much further except to urge for a careful strategy that âprotect[s] common knowledge-heritageâ and â(linked!) public goodâ (99).
Earlier in his paper, Barbera reminds us that with the ârapidly growing amount of data available in the linked open data cloud and in enterprise linked data repositoriesâ the existence of a single, centralized computation of all data is simply not possible (92). The necessity of a decentralized system, then, is promising in its ability to foster a shared management of our data economies but does not come without its own share of complications. After all, a decentralized system relies deeply on the ability for online communities to not only relinquish total control, which comes hand-in-hand with collaboration, but to pay attention to one another and build together. This concept of âpaying attentionâ may seem like an obvious point, and yet the protocols practised âout in the wildâ serve as a shocking reminder of how little we look outside ourselves and consider the projects of others.
One way of approaching this problem of digital collaboration and âpaying attentionâ is to turn to what Kathleen Fitzpatrick calls âgenerous thinking,â a concept that urges academic institutions and their agents to âcultivate a greater disposition toward listening, toward patience, toward engaging with what is actually in front of us rather than continually pressing forward to where we want to goâ (Fitzpatrick). In this context, generosity is not meant in the sense of âgivingâ but as âgenerosity of mind,â a kind of deep listening that goes beyond waiting for an opportunity to speak (Fitzpatrick). At the core of Fitzpatrickâs model is a desire to learn how to engage in genuine dialogue, collaborate, and build better together ânot only with our colleagues but with our objects of study, our predecessors, and the many potential publics that surround us.â
Thereâs no question that humanities disciplines have much to offer the LOD community (see last post). However, before we join the LOD community and potentially lose ourselves in the exciting features and unexpected insights to be gained from linked data, the question of âwhat exactly do we bring to the tableâ and âhow does LOD help us think through x in our own fieldsâ needs to be addressed if we are to take Barberaâs âpaper-thinkingâ critique seriously. Whatâs more, we must dig deeper into what exactly it means to, as Tim Berners-Lee says, âthink in the graphâ and what it would look like to do so collectively. I end with a passage from Fitzpatrickâs post that gestures towards an âopenness to possibilityâ and offers a potential answer to the question of what the humanities can offer:
All of these possibilities that we open up â engaging perspectives other than our own, valuing and evaluating the productions and manifestations of our multiplicitous culture, encountering the other in all its irreducible otherness â are the best of what the humanities offer to the university, and the university to the world, and we must allow them to teach us just as much as we teach others. And all of these possibilities begin with cultivating the ability to think generously, to listen â to the text, to our communities, to ourselves â without attaching or rejecting. (Fitzpatrick)
Works Cited
Barbera, Michele. "Linked (Open) Data at Web Scale: Research, Social and Engineering Challenges in the Digital Humanities." JLIS 4.1 (2013): 91-101.
Fitzpatrick, Kathleen. âGenerous Thinking: Introduction.â Planned Obsolescence. Last updated 5 October 2016. <http://www.plannedobsolescence.net/generous-thinking-introduction/#more-2828>.
Photo credit:Â Anthony DELANOIXÂ via Unsplash
0 notes
Text
sameAs is not yet closeEnough: on knowledge representation and identity in linked data

Iâd like to offer some preliminary thoughts on identity and representation within a linked data context in conversation with Harry Halpin, Ivan Herman, and Patrick Hayesâ short paper, âWhen owl:sameAs isnât the Same: An Analysis of Identity Links on the Semantic Webâ (2010). Although the paper is now a bit stale, the sameAs issue they outline is one that continues to persist.
In the age of Web 2.0, Linked Open Data (LOD) emerged as a decentralized system for identifying, classifying, and linking information made open on the semantic web. One way of establishing relationships between existing open data on the web is by attributing OWL properties like owl:sameAs. As its name would suggest, sameAs âindicates that two URI references actually refer to the same thing: the individuals have the same âidentityââ (W3C). However, as Halpin, Herman, and Hayes make very clear, âout in the wildâ sameAs is often used as if it were âcloseEnoughâ (2010; 2011). This issue is heightened by the fact that sameAs is one of the most widely used properties, or â(ab)used,â within the linked data community (Halpin et al. 2010, 1). This widespread misuse of owl:sameAs poses a potential threat to linked data when considering the impact of inference in a system that builds by way of referral and interlinking. In their paper, Halpin, Herman, and Hayes (shortened here to âH3â) present four alternative readings of owl:sameAs, concluding with âalternative identity links that rely on named graphsâ (Halpin et al. 2010, 1). The four alternative readings of owl:sameAs are Same Thing As But Referentially Opaque, Same Thing As But Different Context, Represents, and Very Similar To, which Iâll quickly recap now,
Same Thing As But Referentially Opaque occurs when two URIs point to the same thing but donât necessarily share all of the same properties, rendering the reference âopaqueâ (Halpin et al. 2010, 2). This means that the URI âcannot be substituted for anotherâ as it would violate the Principle of Substitution (Halpin et al. 2010, 2). Same Thing As But Different Context refers to the problem when two URIs refer to the same thing and share the same properties, but cannot be re-used in a different context because those same properties, however true, simply do not matter (Halpin et al. 2010, 2). The main claim here is that âthere are âforms of referenceâ appropriate to a context, especially in social contextsâ (Halpin et al. 2010, 2). Represents tries to parse the difference between signifier and signified, working with an âintuitive definitionâ of ârepresentationâ where a URI, like a photograph, represents a thing but is not the âthing itselfâ (Halpin et al. 2010, 3). Problems arise, according to H3, when identity and representation are conflated, not to be confused with instances of âdisplaced referenceâ (Halpin et al. 2010, 3) which acts synecdotally, where a thing, like an email address, represents the identity of an entity, like a person; or, as H3 would define it, where something is referenced âaccidentally or contextually to refer to somethingâ (Halpin et al. 2010, 3). Very Similar To makes up most of the so-called ânoticeable errorsâ (Halpin et al. 2007, 3) where two things that are closely related but not exactly the same are labelled as identical. H3 use the example of Paris and the Department of Paris in Cyc, for instance (Halpin et al. 2010, 3).
Although this article provides a useful gloss of the sameAs problem, it struggles with cohesion and at times feels a bit rushed -- especially to a linked data outsider. Where, for example, is the knowledge representation primer? And what about the organizational logic of identity on the world wide web and its oppressive history (McPherson)? My first question is in part addressed in a later iteration of the paper published at the International Semantic Web Conference in 2010. In this second version, the authors return to the âsameAs problemâ but spend some time first working through the history of knowledge representation and identity within a semantic web context.Â
According to H3, âthe vexing problem of identity has returned with a vengeance to the Semantic Webâ (Halpin et al. 2011, 1). However, the problem of precise labelling is not so much a linked data or semantic web problem as it is a knowledge representation problem. âLeibnitzâs Lawâ states that if x and y are identical then they must share all of the same properties. By the same logic, if all properties are not shared between x and y, then x and y are not identical. Debates surrounding the gaps in Leibnitzâs logic have raged since its inception, most popularly refuted with the principle of change over time (e.g. Is 5 year old Abi the same person as 25-year-old Abi?). For the first time, however, this problem is being encountered by a surge of people trying to âindependently knit their knowledge representations together using the same standardized languageâ (Halpin et al. 2010, 1). Within this disparate environment, owl:sameAs ends up used in ways that are âmutually incompatible [and] almost always violate the rather strict logical semantics of identityâ (Halpin et al. 2010, 1). Although H3 frame this issue as one rooted in precise labelling, it seems more a question of establishing a culture that promotes responsible interlinking and thoughtful digital collaboration.
In light of the systemic racism and misogyny manifest not only in the latest US Elections but ever present in the ways we build, access, and navigate the world wide web (McPherson), the question of responsibility is central if we are to address issues of identity and representation within the semantic web. Although work on improving the sameAs problem for the sake of linked data has already begun, the issue of conflating identity with representation within the linked data community continues to persist. Given the current information landscape, digital humanities and (post-)colonial researchers will need a seat at the Linked Open Data table if we are to succeed in working towards representing the discursive nature of identity on the Semantic Web and labelling practices that are as thoughtful as they are accurate.Â
Works Cited
Halpin, Harry, Ivan Herman, and Patrick Hayes. âWhen owl:sameAs isnât the Same: An Analysis of Identity Links on the Semantic Web.â RDF Next Steps Workshop, June 26-27, 2010. Palo Alto, USA.
Halpin, Harry, et al. "When owl:sameAs isnât the Same: An Analysis of Identity in Linked Data." The 10th International Semantic Web Conference, October 23-27, 2011. Berlin, Germany.
McPherson, Tara. "Why are the Digital Humanities so white? Or Thinking the Histories of Race and Computation." Debates in the Digital Humanities (2012): 139-160.
n. a. âowl:sameAs.â W3C. Last updated November 2009. <https://www.w3.org/TR/owl-ref/#sameAs-def>.
Photo: Hayley Mills in Parent Trap (1961)
0 notes
Text
What does Textual Scholarship have in common with the Semantic Web?
A reading of James Smithâs âWorking with the Semantic Webâ from the newly published collection of essays, Doing Digital Humanities (2016)

Some context: James Smith is a Lead Software Engineer (Kit Check) who also teaches the RDF and Linked Open Data (LOD) course at the Digital Humanities Summer Institute in Victoria (which Iâve had the pleasure of attending this past summer). I came across this chapter on a syllabus designed for a LOD directed reading group Iâm involved in and wanted to share a few half-baked observations.
Smith begins his chapter by way of analogy,
The Semantic Web and Linked Data are computational applications of existing scholarly practices: linking to primary and secondary sources, signalling trusted vocabularies and authorities, and positioning a work in a larger conversation. (loc. 6650)1
For many textual scholars, this is a welcomed site: a warm invitation. We know analogy. We understand that analogy works as a powerful narrative tool. And we know when weâre about to be told a good story. Upon arrival, the text signals a comparative framework, a bond Smith continues to return to as he guides readers through what is, for most textual scholars, the strange new world of working not just on, but with, the semantic web. For the purposes of this reading, rather than provide a comprehensive overview Iâd like to instead focus on two crucial moves Smith makes in this chapter.
First, Smith reviews the basic mechanics of how textual scholarship works. To do this, he uses the following example: âThe new sovereign has achieved self-determinationâ (loc. 6667). With a little pressure, this sentence cracks under the ambiguity of âsovereignâ (which sovereign?) and âself-determinationâ (what self-determination?), and we, as well trained textual scholars, feel the lack of historical context â of reference. Interestingly, Smith works from an electronic text default, drawing on the function of hyperlinks in digital scholarship before turning to Franco Morettiâs printed chapter in Distant Reading as an example of âintra-textual referencingâ, or, what Smith would call âcrude hyperlinkingâ (loc. 6667, 6679). Iâve reproduced Morettiâs excerpt here:
The new sovereign â ab-solutus, united, freed from the ethics-political bonds of the feudal tradition â has achieved what Hegel will call âself-determinationâ: he can decide freely, and thus post himself as the new source of historical movement: as in the Trauspiel, and Gorboduc, and Lear, where everything indeed begins with his decision; as in Racine, or La Vida es SueĂąo. (qtd. in loc 6679)
Next to the efficiency of hyperlinking, Morettiâs list of references, notes, and notes on references seem wild and dizzying. Necessarily restricted by the technology of print, Moretti âlinksâ to the particular definitions of âsovereigntyâ he has in mind and inserts a brief description of his take on Hegelâs use of âself-determination.â But why the context overload? Surely thereâs such a thing as providing too much context. As Smith is quick to point out, what Moretti is doing with this rudimentary âlinkingâ is ensuring that the reader âdoesnât need to follow the âlinkââ (loc. 6667). With hyperlinks, thereâs always a chance that readers will get lost as they go off and explore the contextual crumbs. But consider the print reader who has left a book to go follow a tempting footnote and fetch a referenced text from the library. The print readersâ chances of return are far less likely when compared to electronic readers â or, perhaps more crucially, the chances of setting down a book in order to seek out the referential thread in the first place seems even less feasible. Instead, as Smith points out, the kind of âlinkingâ seen in Morettiâs chapter works to signal to the reader that he âtrustsâ Hegelâs vocabulary (people who know something of Linked Open Data start grinning here) and conveys a sense of âalignmentâ between Morettiâs language and Hegelâs, indeed King Learâs, as Morettiâs writing becomes, to draw on Smithâs language, âinformedâ by the literature heâs referencing (loc. 66698). Remember, Smith reminds, âAs we read a text, we bring all the material we have encountered beforeâ (loc. 66698).
Second, Smith introduces this concept of âat least one.â The âat least oneâ concept goes as follows: A textual scholar, letâs use Moretti again, mentions a set of literature âwith the hope that we will have read at least oneâ (Smith, loc. 66698). If the mission is to make a connection, what Moretti needs is for us, the reader, to have read one -- just one. At first, the language here seems almost exacerbated (âHave the decency to come to class having read at least one of your readings.â Silence. âO come now, at least one!â). In fact, Smith repeats âhopeâ and âat least oneâ twice in one paragraph when referring to this desire to connect over a shared reference. Like computers, a human reader scans the information, eyes moving swiftly across familiar words, logs the connections away, and moves on. If nothing looks familiar, however, the reader stalls (perhaps over a wave of curiosity, or, less preferably, renewed anxiety). Machines donât waste their time feeling anxious: if the information doesnât look familiar, they give up. This shared reference becomes central to Smithâs guide to working on the semantic web, building on his connection to scholarly reading: âIt is critical that the scholar read far and wide in their career: the greater the shared background, the more efficient the communicationâ (loc. 66698).
Scaling back from the macroscopic fantasy of âwideâ reading, Smith returns to the bread and butter of textual scholars: close reading. This return is only to strengthen the natural tie he has been asserting this entire chapter, one between textual and computer science scholars. âThe act of making as many connections as possible between the text and what we know,â Smith writes, â is the essence of close readingâ (loc. 66698). This essential connection between linking and close reading, Smith goes on to explain, is why textual scholars find themselves apart of âone of the defining fields in the digital humanitiesâ (loc. 66698).
The rest of Smithâs chapter walks through the basics of structuring information, representing information, vocabularies, relationships, using linked data, and publishing linked data.2 The bulk of the heavy lifting, however, what I would underline as the driving force of this piece, has already been worked out in the first half-dozen pages. To avoid any ambiguity â ever the responsible computer scientist â Smithâs argument becomes fully articulated near the end of his chapter, under the very appropriate SUMMARY heading:
It is by bringing to our computational work the practices of our scholarly work that we elevate the digital side of digital humanities to be equal with the traditional humanities scholarship practices. (loc. 6944)
Refreshingly, Smith departs from approaches that urge humanities scholars to take on the praxis and language of scientific methodology.3 Instead, Smith asks what textual scholarship can bring to this kind of work with the semantic web and gestures towards a model of scholarship that is strengthened by this process of coming together, one that is necessarily â and, as Smith would argue, inherently â Â open to collaborative and cross-disciplinary work.4
Footnotes
1 The âloc. xxxxxâ identifiers work in lieu of page numbers and refer to places within the Kindle edition of this text. 2 To the curious and anxious students of linked data: keep reading. Smithâs gives an accessible and concise overview on how to transform textual information, what readers will soon call a âdataset,â into published, linked data. Though there are moments where readers who are eager to get their hands dirty are left hanging for further instruction, Smith is quick to provide an abundance of links to projects and resources peppered throughout in the form of footnotes, hyperlinks, as well as the inclusion of a Further Readings section.
3Â See John Unsworthâs The Importance of Failure, see Franco Morettiâs Conjectures on World Literature.
4Â See Susan Brown and John Simpsonâs, along with CWRC Project Team and INKE Research Groupâs, An Entity By Any Other Name: Linked Open Data as a Basis for a Decentred, Dynamic Scholarly Publishing Ecology.
Works Cited
Brown, Susan, and John Simpson. "An Entity By Any Other Name: Linked Open Data as a Basis for a Decentered, Dynamic Scholarly Publishing Ecology." Scholarly and Research Communication 6.2 (2015).
Moretti, Franco. "Conjectures on World Literature." New Left Review 1 (2000): 54â 68.
Smith, James. âWorking with the Semantic Web.â In Compton, Lane, and Siemens (eds.) Doing Digital Humanities. Routledge, 2016.
Unsworth, John. âThe Importance of Failure.â Journal of Electronic Publishing (1997).
Photo credit: michael podger via Unsplash
0 notes
Text
SO MANY POTTER PUNS. So many.
Harry Potter Books Head To Kindle And Nook After Pottermore Suffers Cruciatus Curse

Whether you side with Hufflepuff or Slytherin, it sucked that you could only get Harry Potter ebooks from a special service called Pottermore⌠until now. The company has is now selling these books for $9 on the Kindle and Nook and for a very interesting reason: the goblins at Gringotts realized Pottermore didnât have much money left.
Harry Potter ebooks have long been part of Kindle Unlimited but this move opens ebooks sales to new services.
Pottermore launched in 2012 as a digital home for all things Potter. Interestingly, its biggest parter was Sony, then quite active in the reader space. As The Digital Reader points out, the company licensed the branded, offered a Harry Potter-themed area in Playstation Home and gave away free Harry Potter ebooks with the Sony Reader.
Old Draco Malfoy must have gotten a Confundus Charm brewing over there because, after a while, Sony realized that it was hopeless to stem the tide of bigger ereaders and content plays. Sales dropped in March 2015 from ÂŁ24.8 million to ÂŁ7 million. The site suffered a loss of ÂŁ6 million in 2015.
So give old Dobby his sock and strap on your House Cup because now you and yours can read Harry almost anywhere, including the Chamber of the Back Seat of the Minivan.
4 notes
¡
View notes
Text
Boy does she know how to hit the nail on the head.
REBECCA SOLNIT: ART MAKES THE WORLD, AND IT CAN BREAK US
MEN EXPLAIN LOLITA TO ME

I sort of kicked the hornetsâ nest the other day, by expressing feminist opinions about books. It all came down to Lolita. âSome of my favorite novels are disparaged in a fairly shallow way. To read Lolita and âidentifyâ with one of the characters is to entirely misunderstand Nabokov,â one commenter informed me, which made me wonder if thereâs a book called Reading Lolita in Patriarchy.Â
The popular argument that novels are good because they inculcate empathy assumes that we identify with characters, and no one gets told theyâre wrong for identifying with Gilgamesh or even Elizabeth Bennett. Itâs just when you identify with Lolita youâre clarifying that this is a book about a white man serially raping a child over a period of years. Should you read Lolita and strenuously avoid noticing that this is the plot and these are the characters? Should the narrative have no relationship to your own experience? This man thinks so, which is probably his way of saying that I made him uncomfortable.
READ MORE
7 notes
¡
View notes
Link
I know Iâm late to the game, but I just find this outrage from DA fans way too book history-ish to ignore.Â
After a series of Kindle Fire ads in which cast members are shown using the device while enjoying a break on set (in full costume), fans have turned to Twitter in protest, claiming the Kindle ads âkill the magicâ of the show. Thereâs a fear that viewers will become less invested in the storylineâs characters the moment the mechanics of the show are revealed. That once their beloved characters are revealed to be actors who lead modern lives, it will prove impossible to unsee. More interestingly, some fans blame Amazonâs sponsorship as responsible for the scene in which Lady Edith burns her paperback which causes Downtown Abbey to go up in flames. On the one hand, the conspiracy plot makes sense. DA is seen as advocating for the death of the printed book, whose flammable materiality causes human harm, and usher viewers to get rid of their dangerously outdated books. Cue the phoenix-like rebirth of the Kindle Fire. However, the scene can of course be read to the opposite effect. To burn oneâs paper books is reckless and causes human harm.Â
I find myself hoping for something in between: explore the many wonders of the eReader, yes, but donât burn your books quite yet -- thereâs room for both.Â
2 notes
¡
View notes
Photo

My latest project considers the discourse surrounding e-readers and points to a âlanguage of loveâ used by readers to articulate an attachment to the codex form, a language that perhaps signals the emotional investment theyâre prepared to make. Thereâs been a notable push to market ebook as more than just a short fling, trying to convince readers that they too can provide the long-term comfort and security a printed book brings. Companies like Kindle and Kobo have already caught on to this and the ads theyâve released are as disturbing as they are telling. This particular document demonstrates the importance placed in leaving a human mark on a document as of crucial importance to readers if they are to invest time and energy exploring the technology at hand. As far as Iâve read, this note (whose author, apart from the initials JG, is unknown) was found tucked away in a book and has circulated the www since its discovery.
4 notes
¡
View notes