Overview

This EmacsConf 2023 talk demonstrates a collaborative workflow for research data processing and documentation in Emacs. The speakers use Org mode as the central document format and combine it with companion packages for knowledge graph visualization, literate programming, and collaborative editing.

Starting from the National Research Data Infrastructure Germany (NFDI), the talk shows how to retrieve information from Wikidata, clean and process it with different programming languages, visualize relationships, and preserve the work through exportable documentation.

Topics covered

  • Org mode as a plain-text environment for scientific writing, organization, and publishing
  • org-roam and org-roam-ui for linking notes and visualizing a knowledge graph
  • org-babel for literate programming and self-documenting code
  • SPARQL queries against Wikidata
  • Data cleaning and processing with shell, Python, awk, and R
  • Collaborative editing in Emacs using CRDT
  • Exporting Org documents to formats such as PDF, HTML, and plain text

Speakers

Jonathan Hartman is a trained data scientist and works at the IT Center of RWTH Aachen University, Germany.

Lukas C. Bossert is a trained classical archaeologist and deputy head of the department “Research Process and Data Management” at the IT Center of RWTH Aachen University.

Chapter markers

  • 00:00 — Introduction
  • 01:16 — Org Mode
  • 02:18 — Working together
  • 06:27 — Data cleaning
  • 08:04 — Processing
  • 12:36 — Visualization
  • 14:01 — Preserve

Resources

Related Posts

Call for participation

Call for participation

Call for participation!

The Data Literacy Center Rhine-Ruhr (DKZ.2R) issues a call for participation in its “rent-an-expert” project! We offer support for ambitious research projects of PhD students and early postdocs dealing with Data Science and Artificial Intelligence, High Performance Computing and Simulation, and Research Data Management. As the DKZ.2R is funded by the German Federal Ministry of Education and Research (BMBF) as well as the EU, this offer is free of charge!

Read More
A Survival Guide to Research Data Sharing Services in the Rhine-Ruhr Region

A Survival Guide to Research Data Sharing Services in the Rhine-Ruhr Region

A Survival Guide to Research Data Sharing Services in the Rhine-Ruhr Region

There are a lot of reasons why collaborating with other researchers on scientific projects is great! It provides new perspectives and gives you the chance to benefit from other people’s knowledge and input. When it comes to sharing and exchanging data across multiple locations and devices however, researchers are often disoriented and don’t know which tools, cloud services and so on are safe to share data in a secure and ethical way.

Read More
Documentation From User Experience

Documentation From User Experience

This post is a condensed version of a talk at our Data Compentcy College

If you regularly use scientific software written by others, or tried to replicate interesting research that relies on software, you have probably also invested weeks of work to solve a software problem or even given up on a software because of missing documentation. Finding a project that might be the solution to your problem and then failing to run the code is frustrating. Being unable to run a project you have built yourself years ago is even worse. Having experienced all those setbacks myself in the past I want to use this post to channel that frustration to fuel solutions for better documentation for our current and future projects.

Read More