#WDPD2020 | Explore Tumblr posts and blogs

pittarchives · 5 years ago

Text

Photo Restoration Live Streams: A Pedagogical Practice

In celebration of World Digital Preservation Day 2020 on November 5, we’re sharing a series of posts by University of Pittsburgh Library System librarians and archivists that highlight their expertise and work to preserve the digital!

This post was written by Dan Kaple, Digital Creation Specialist for the University of Pittsburgh Library System

As a unit within Digital Scholarship Services, The Digital Stewardship Lab has a tradition of providing hands-on access to digitization tools for members of the University of Pittsburgh community. We provide patrons the necessary training and access to both 2D and 3D digitization tools as well as one-on-one project consultation. One of the more common projects we have assisted with is personal archiving. Patrons have used the Stewardship Lab to scan materials from their own personal family histories with the purpose of preserving these for future generations.

With the onset of COVID-19, the Stewardship Lab had to suspend its hands-on services and, as a result, find new ways of supporting and providing instruction for digital creation and digital preservation. One way we have done this is to produce a series of live stream events. These live streams present a project-oriented workspace with the intention of demystifying the digital creation tools and exposing users to the workflow and problem-solving processes.

One of our most popular segment topics is photo restoration. Given our instructional focus, we view photo restoration as a pedagogical activity and have found that it presents us with several opportunities. Working on photo restoration projects is a good way to show digital creation tools at work; to explore the thought process that goes into making specific problem-solving solutions, and from a digital preservation perspective, it provides an opportunity to explore photo restoration best practices. During live streams we discuss how to work non-destructively on our photographs in Photoshop. We demonstrate tips and tricks for fixing particularly difficult cases of image damage or staining. We also discuss more conceptual issues such as the ethical line between photo restoration and photo enhancement or manipulation.

Our goal is to get attendees excited and interested in restoration, leading to broader conversations about digital creation and digital preservation and how both are essential for academic research as well as historic and cultural conservation. We want attendees to consider how that photograph of Great Grandma can tell us a lot about broader issues concerning society, culture, and class during her lifetime; to recontextualize these family relics and imbue them with greater value. We want them to think about how digital preservation is relevant in their own lives. To start thinking about the long-term care of modern, born-digital materials, our social media feeds, and online repositories, so we can reevaluate them as the cultural relics of the future.

#wdpd2020 #digital preservation #photography #archives #restoration #family history #personal digital archives

7 notes · View notes

pittarchives · 5 years ago

Text

5 Tips for Preserving Your Data Long-Term

This post was written by Dominic Bordelon, Research Data Librarian

Like academics everywhere, at the University of Pittsburgh we hope to make valuable contributions to our fields through our publications, which we can expect will outlive us. More recently, thanks to new technological possibilities, we turn our attention to how other research outputs, such as data and software code, can also be stored for posterity.

How can you get started? Here are five tips from Pitt Libraries that you can begin using right away.

1. Use open file formats

Open file formats are those which are widely adopted, well documented, and unhindered by proprietary restrictions which monopolize the creation, editing, or reading of files. These are formats like CSV (comma-separated value) for tabular data, plain text for qualitative data (.txt), or PNG (Portable Network Graphics) for images. Proprietary formats tend to create a barrier to access and may even face obsolescence should the vendor go out of business. These factors have a negative influence on the probable longevity of the files’ contents.

For example, users of IBM’s SPSS statistical software will be familiar with .sav files for their data and analyses. However, .sav is a binary format rather than a character-based one, unreadable without special software (such as SPSS). Nor has IBM published official documentation for community use. Consider instead (or in addition) depositing a version of your data in CSV format, which should be easily readable to any future users.

To find out more about the preservability of the file formats you use, and to see which are recommended, you can see the Library of Congress’ Recommended Formats Statement.

"Keat takes notes" by geekcalendar is licensed under CC BY 2.0

2. Describe and annotate your dataset

In order for your data to be useful in the future, readers will need to be able to make sense of it. Data does not usually explain itself. What does the abbreviation in this column name mean? If an instrument was used to record your data, what model? What steps did you follow in your lab to run the experiment? The answers to these questions have important implications for researchers who want to replicate your study or integrate your data in a new study of their own.

There are several ways you can describe the important context around your data:

A detailed abstract in your data depository, and completion of all appropriate metadata fields

Data dictionaries and codebooks which describe column names and values

Documentation of your research protocols (perhaps with a tool like protocols.io)

3. For software, document your dependencies and computing environment

When you run code, it’s important to know what needs to be installed for it to work properly. Which version of Python did you use? If you used a library like Astropy in Python or osmdata in R in your analysis, what version of the library did you use? Without this information, it might be difficult—or even impossible—for future users to run your code, and for them to be confident that they are running it as intended. You can do this with a text file, but look also at tools like Docker (or the Dockter project for researchers specifically) to containerize and document your environment.

4. Deposit your data in a trustworthy repository

When choosing a data repository, consider how it is maintained and whether they seem to have plans for the future. You can find much of this information in their about pages. For example, is the repository run at a large research institution by a team of dedicated staff, or, at the other extreme of that spectrum, is it a lone researcher’s side project? Do you trust that the repository, or at least its owner, will exist in ten or twenty years? If run by a private company, does it seem well-established with many ties to the academic community? Do persistence and preservation seem to be high priorities for the repository? While other factors might affect one’s choice of repository, we should hold this sense of “trustworthiness” high on the list.

CoreTrustSeal is an organization that certifies research data repositories as trustworthy, i.e., apparently sustainable and stewardship-oriented. Checking their list of repositories is a safe bet. If the repository in question is not CoreTrustSeal certified, your local data librarians (for example, at Pitt, the ULS Digital Scholarship Services team and the HSLS Data Services team) can help you evaluate the repository.

"elephant ears." by brittanyhock is licensed under CC BY-NC 2.0

5. Dark archive your dataset in your institutional repository

Sharing is great, but preservation is important too. The practice of “dark archiving” is simply depositing material in a nonpublic repository, for purely preservationist purposes. If you are planning to share your data in an open repository, consider also investigating whether your institution has a repository where you could dark archive an additional copy. The idea is that, should the open repository eventually fail, the data could still be restored from the dark archive, and then pointers to the open deposit such as DOIs could be redirected to the restored copy.

Why dark? If your dataset is hosted in multiple places online, some users might find it confusing, especially without knowing any rationale. Intellectual property ownership may also be unclear. Furthermore, the user may reasonably wonder whether the two copies are truly identical.

Your institution may not advertise a “dark archive,” but look instead for your general institutional repository, such as Pitt’s D-Scholarship.

Let me know how these tips work for you. Happy preserving!

“5 Tips for Preserving Your Data Long-Term” by Dominic Bordelon is licensed under Creative Commons Attribution-ShareAlike 4.0 (https://creativecommons.org/licenses/by-sa/4.0/).

#WDPD2020 #researchdata #digital preservation #digitalscholarship #datasets #digital archives

4 notes · View notes

pittarchives · 5 years ago

Text

Capturing Pitt’s Response to COVID-19

This post was written by Zach Brodt, University Archivist & Records Manager

The University Archives is tasked with preserving records that document the administrative, academic, and research activities of the University of Pittsburgh; however, the COVID-19 pandemic has shifted much of this work online and necessitated an increase in planning activities due to ever-changing situations caused by the coronavirus. To properly document the impact COVID-19 has had on the University community, the Archives & Special Collections (A&SC) department quickly acted to begin collecting digital files that showed the effects of the pandemic on Pitt as they were happening.

In April, the A&SC developed a page on their library’s website asking students, faculty, and staff to upload electronic files that indicate how the pandemic impacted their life at Pitt. For students, this could be messages from professors or administrators discussing how their classes changed during the Spring 2020 semester, photographs, or personal artwork or journals that address their experiences. Faculty could share syllabi that have evidence of how their courses changed during Spring 2020 or how the Flex@Pitt hybrid teaching model was implemented in Fall 2020. Some courses even used data from the pandemic as part of their assignments. Donors were encouraged to supply their contact information and a description of the files to aid in access to the materials.

The other means of data collection is a survey, also linked on the web page, in which the individual can provide information about their affiliation to the University and a reflection or experience relating to the pandemic rather than individual files.

The top screenshot of coronavirus.pitt.edu is the website as captured in July compared to how it appeared on October 1.

In addition to submissions from donors, University Archivist Zach Brodt downloads emails and documents distributed to the University community by senior administrators as the situation continues to develop. The University Archivist also utilizes the department’s Archive-It subscription to crawl University of Pittsburgh websites that are integral to the dissemination of information to Pitt faculty, staff, and students. This includes Pitt’s coronavirus.pitt.edu site, which provides a jumping off point to all COVID-related information at the University, including policies and news stories.

To ensure that both internal and external messages are captured, the archives also acquires content directly from the University’s communications office. These files include graphics used on campus signage, messages distributed by administrators, social media content, and other documents. Together, these digital files will form the majority of the Pitt COVID-19 Archive. Their long-term preservation is important to future understanding of how the University was impacted by the pandemic.

#wdpd2020 #digital preservation #webarchiving #h2p #covid19 #archives #emailarchive #digital archives

1 note · View note