Update READMEs

This commit is contained in:
Nora Wickelmaier 2025-12-08 16:02:14 +01:00
parent 2aa6441eba
commit 973d552050
2 changed files with 23 additions and 6 deletions

View File

@ -88,15 +88,15 @@ removed. See `anonymization.R` for details.
The anonymized data files are saved to `03_data/02_anonymized_data/` as
CSV files with file names `HMC_<wave>_anonymized.csv`.
# Data preprocessing
# Data cleaning
After data anonymization, some more rudimentary preprocessing was done on the
data with the script `03_data/02_anonymized_data/cleaning.R`. Especially,
the original variable names in Qualtrics were harmonized so they all follow the
same structure.
The cleaned data files are saved to `03_data/03_cleaned_data/`as
CSV files with file names `HMC_<wave>_cleaned.csv`.
The cleaned data files are saved to `03_data/03_cleaned_data/` as CSV files with
file names `HMC_<wave>_cleaned.csv`.
The following section gives an overview of the problems in the data, that needed
some cleaning.
@ -119,6 +119,15 @@ some cleaning.
* Three entries in wave 3: `subj1009`
* We kept the first entry for each subject
# Data preprocessing
The final data preprocessing creates scales from the collected items. It was
done in Python and the code for the preprocessing can be found in a separate
code repository: https://gitea.iwm-tuebingen.de/HMC/preprocessing. The files
with the final variables for each scale are then saved in the folder
`03_data/04_preprocessed_data` as CSV files with file names
`HMC_<wave>_preprocessed.csv`.
# TODOs
* Add more preprocessing steps like variable renaming?

View File

@ -1,8 +1,9 @@
Project: Human Machine Communication (HMC)
================
Longitudinal Dataset of Psychological Variables Influencing AI Perception and Usage
====================================================
This file contains documentation about a collaborative longitudinal study
conducted at the Leibniz-Institut für Wissensmedien (IWM), Tübingen.
conducted at the Leibniz-Institut für Wissensmedien (IWM), Tübingen, within the
project Human Machine Communication (HMC).
## Abstract
@ -74,6 +75,13 @@ The waves were collectec over the following dates:
| | |-- HMC_wave4_cleaned.csv
| | |-- HMC_wave5_cleaned.csv
| | |-- HMC_wave6_cleaned.csv
| |-- 04_preprocessed_data
| | |-- HMC_wave1_preprocessed.csv
| | |-- HMC_wave2_preprocessed.csv
| | |-- HMC_wave3_preprocessed.csv
| | |-- HMC_wave4_preprocessed.csv
| | |-- HMC_wave5_preprocessed.csv
| | |-- HMC_wave6_preprocessed.csv
| |-- HMC_codebook.xlsx
| |-- item_reference.md
| |-- README.md