From 973d5520503225ddace615b4b31f7e140d99c89c Mon Sep 17 00:00:00 2001 From: nwickel Date: Mon, 8 Dec 2025 16:02:14 +0100 Subject: [PATCH] Update READMEs --- 03_data/README.md | 15 ++++++++++++--- README.md | 14 +++++++++++--- 2 files changed, 23 insertions(+), 6 deletions(-) diff --git a/03_data/README.md b/03_data/README.md index 09f77dc..ae26359 100644 --- a/03_data/README.md +++ b/03_data/README.md @@ -88,15 +88,15 @@ removed. See `anonymization.R` for details. The anonymized data files are saved to `03_data/02_anonymized_data/` as CSV files with file names `HMC__anonymized.csv`. -# Data preprocessing +# Data cleaning After data anonymization, some more rudimentary preprocessing was done on the data with the script `03_data/02_anonymized_data/cleaning.R`. Especially, the original variable names in Qualtrics were harmonized so they all follow the same structure. -The cleaned data files are saved to `03_data/03_cleaned_data/`as -CSV files with file names `HMC__cleaned.csv`. +The cleaned data files are saved to `03_data/03_cleaned_data/` as CSV files with +file names `HMC__cleaned.csv`. The following section gives an overview of the problems in the data, that needed some cleaning. @@ -119,6 +119,15 @@ some cleaning. * Three entries in wave 3: `subj1009` * We kept the first entry for each subject +# Data preprocessing + +The final data preprocessing creates scales from the collected items. It was +done in Python and the code for the preprocessing can be found in a separate +code repository: https://gitea.iwm-tuebingen.de/HMC/preprocessing. The files +with the final variables for each scale are then saved in the folder +`03_data/04_preprocessed_data` as CSV files with file names +`HMC__preprocessed.csv`. + # TODOs * Add more preprocessing steps like variable renaming? diff --git a/README.md b/README.md index 18d4c68..a764f00 100644 --- a/README.md +++ b/README.md @@ -1,8 +1,9 @@ -Project: Human Machine Communication (HMC) -================ +Longitudinal Dataset of Psychological Variables Influencing AI Perception and Usage +==================================================== This file contains documentation about a collaborative longitudinal study -conducted at the Leibniz-Institut für Wissensmedien (IWM), Tübingen. +conducted at the Leibniz-Institut für Wissensmedien (IWM), Tübingen, within the +project Human Machine Communication (HMC). ## Abstract @@ -74,6 +75,13 @@ The waves were collectec over the following dates: | | |-- HMC_wave4_cleaned.csv | | |-- HMC_wave5_cleaned.csv | | |-- HMC_wave6_cleaned.csv + | |-- 04_preprocessed_data + | | |-- HMC_wave1_preprocessed.csv + | | |-- HMC_wave2_preprocessed.csv + | | |-- HMC_wave3_preprocessed.csv + | | |-- HMC_wave4_preprocessed.csv + | | |-- HMC_wave5_preprocessed.csv + | | |-- HMC_wave6_preprocessed.csv | |-- HMC_codebook.xlsx | |-- item_reference.md | |-- README.md