„(Digital) Research and Metadata“
The presentation addresses what research data are and how they can be made available in the short, medium and long term (reusable). In particular, the role of metadata and data formats will be discussed.
Almost all (digital) data can potentially become research data. Once a scientific project decides to use data to gain knowledge, these data inevitably become research data. Research also collects a variety of data in surveys and experiments, or creates collections and editions of digital/digitised artifacts (corpora, editions) that are the basis for further research. Often software – from scripts to stand-alone programs – is also included when talking about data, and in some projects data and related tools are hardly conceptually separated.
Metadata is the name for data that describes other data. However, depending on the research question, a ‚metadatum‘ can itself become a variable in the research design – and thus a research datum.
In recent years, FAIR has become a much-noted acronym: data should be Findable, Accessible (have regulated access), Interoperable (use interoperable formats), and thus overall Reuseable.
In our talk we will exemplify different ways from (digital) data to research data to archived files. We also show that information can be stored in many different ways. These examples are meant to invite participants to get more involved with data (formats). We explain how(so) metadata is central to the reusability of research data.
In addition, we show that various formats are suitable and useful for concrete use, but less suitable for sharing or long-term archiving of data in terms of interoperability and sustainability. Finally, we briefly address the question of scripts and tools and the problems of making them available to other researchers in the medium and long term.
Bernhard Fisseni studied computational linguistics, German linguistics and older German literature as well as computer science for his MA, and obtained his PhD from the University of Duisburg-Essen for research in formal pragmatics.
He has worked on corpora and text technology mainly in the context of German linguistics and processing of mathematical language. This includes computational archeology on the Bonn corpus of Early new High German from the 1970s, but also digital editions (e.g., the corpus of Kant’s works, the edition of the archive of the Counts von Platen). He is now working at Leibniz Institute for the German Language for the CLARIAH-DE project concerned with building a European infrastructure for linguistic resources.
He worked extensively with speech corpora and was involved in the compilation of the Karl-Eberhard-Corpus. Since 2017 he has been working at the Leibniz Institute for the German Language in long-term preservation group, where he also works in subprojects of CLARIN, CLARIAH-DE and in the nestor network.
Anmeldepflichtige, internationale Webkonferenz (Zoom)
Organisiert vom Lehrstuhl für Geschlechtergeschichte der Friedrich-Schiller-Universität Jena gemeinsam mit dem Arbeitskreis Historische Frauen- und Geschlechterforschung e.V. und dem Digitalen Deutschen Frauenarchiv
Der Lehrstuhl für Geschlechtergeschichte der Friedrich-Schiller-Universität Jena veranstaltet gemeinsam mit dem Arbeitskreis Historische Frauen- und Geschlechterforschung e.V. und dem Digitalen Deutschen Frauenarchiv an den vier Freitagnachmittagen im Februar 2021 die digitale internationale Tagung „Digital Humanities and Gender History“.
Die Tagung möchte geschlechtergeschichtliche Aspekte der Geschichte des Digitalen und der Digital Humanities sowie die Anwendung von digitalen Methoden und Forschungsworkflows für geschlechtergeschichtliche Fragestellungen und Erkenntnisinteressen thematisieren. Gefragt werden soll nach den geschlechtergeschichtlichen Implikationen digitaler Methoden, Tools und Projekte ebenso wie nach den Möglichkeiten und Grenzen, Mehrwerten und Herausforderungen, die digitale Methoden für die Geschlechtergeschichte bieten.
Um die Zugangsdaten zu erhalten, melden Sie sich bitte bei firstname.lastname@example.org für die Veranstaltung an. Die vier Tagungsnachmittage im Februar bilden eine Einheit, eine Teilnahme an allen vier Terminen wäre daher wünschenswert.
Informationen, Anmeldung und Programm:
“Images as Data”
This talk tackles images as data in the digital humanities. This visual turn, enabled by a new set of computational methods such as image analysis using computer vision, foregrounds and disputes the logocentrism that has prevailed in digital humanities work over the decades. Abetted by a narrowly construed history of the field, textual computational work has often taken center stage, its longevity propelled by the centrality of particular disciplines to DH. We advocate for a broader role for the image, highlighting the potential of visual resources as the data for analysis and harnessing methods for analyzing images qua images.
The talk is organized around three case studies: (1) Distant Viewing Protest Photography, (2) Photogrammar, and (3) Seeing Incunabula: Text as Image. They demonstrate how computational image analysis combined with institutional commitments to digitization along with inter and trans-disciplinary questions can forge avenues of inquiry in the field. We argue that thinking of images as data opens up a more capacious configuration of sources, evidence, and methods in DH.
Carol Chiodo is the Librarian for Collections and Digital Scholarship at Harvard University Library. Along with her work as the Bibliographer for French, Italian and German, she specializes in the application of digital humanities methods for collection development, access and discovery. She is the principal investigator on Images as Data: Processing, Exploration, and Discovery at Scale, a Mellon-funded collaboration with the Distant Viewing Lab at the University of Richmond. She is the author of Dante’s Volume: From Alpha to Omega (Arizona Center for Medieval and Renaissance Studies Press, 2021). She is the chair of the 700th Anniversary Commemoration Committee for Dante Society of America. Past digital projects include The Yale Community Voices Archive and Dante at Hand. She received her PhD in Italian Language and Literature from Yale University.
Lauren Tilton is Assistant Professor of Digital Humanities in the Department of Rhetoric & Communication Studies and directs the Distant Viewing Lab (distantviewing.org) at the University of Richmond (USA). Her research focuses on analyzing, developing, and applying computational methods to the study of 20th and 21st century documentary expression and visual culture. She is director of Photogrammar (photogrammar.org), a digital public humanities project mapping New Deal and World War II documentary photography funded by the NEH and ACLS, and co-author of Humanities Data in R: Exploring Networks, Geospatial Data, Images and Texts (Springer, 2015). Her scholarship has appeared in journals such as American Quarterly, Digital Humanities Quarterly, and Digital Scholarship in the Humanities. The collaborative digital project Voice of a Nation is forthcoming from Stanford University Press, and the book Distant Viewing with Dr. Taylor Arnold is forthcoming from MIT Press. She serves on the Executive Council of ACH. She received her PhD in American Studies from Yale University.