Update READMEs

This commit is contained in:
Nora Wickelmaier 2025-12-08 16:02:14 +01:00
parent 2aa6441eba
commit 973d552050
2 changed files with 23 additions and 6 deletions

View File

@ -88,15 +88,15 @@ removed. See `anonymization.R` for details.
The anonymized data files are saved to `03_data/02_anonymized_data/` as The anonymized data files are saved to `03_data/02_anonymized_data/` as
CSV files with file names `HMC_<wave>_anonymized.csv`. CSV files with file names `HMC_<wave>_anonymized.csv`.
# Data preprocessing # Data cleaning
After data anonymization, some more rudimentary preprocessing was done on the After data anonymization, some more rudimentary preprocessing was done on the
data with the script `03_data/02_anonymized_data/cleaning.R`. Especially, data with the script `03_data/02_anonymized_data/cleaning.R`. Especially,
the original variable names in Qualtrics were harmonized so they all follow the the original variable names in Qualtrics were harmonized so they all follow the
same structure. same structure.
The cleaned data files are saved to `03_data/03_cleaned_data/`as The cleaned data files are saved to `03_data/03_cleaned_data/` as CSV files with
CSV files with file names `HMC_<wave>_cleaned.csv`. file names `HMC_<wave>_cleaned.csv`.
The following section gives an overview of the problems in the data, that needed The following section gives an overview of the problems in the data, that needed
some cleaning. some cleaning.
@ -119,6 +119,15 @@ some cleaning.
* Three entries in wave 3: `subj1009` * Three entries in wave 3: `subj1009`
* We kept the first entry for each subject * We kept the first entry for each subject
# Data preprocessing
The final data preprocessing creates scales from the collected items. It was
done in Python and the code for the preprocessing can be found in a separate
code repository: https://gitea.iwm-tuebingen.de/HMC/preprocessing. The files
with the final variables for each scale are then saved in the folder
`03_data/04_preprocessed_data` as CSV files with file names
`HMC_<wave>_preprocessed.csv`.
# TODOs # TODOs
* Add more preprocessing steps like variable renaming? * Add more preprocessing steps like variable renaming?

View File

@ -1,8 +1,9 @@
Project: Human Machine Communication (HMC) Longitudinal Dataset of Psychological Variables Influencing AI Perception and Usage
================ ====================================================
This file contains documentation about a collaborative longitudinal study This file contains documentation about a collaborative longitudinal study
conducted at the Leibniz-Institut für Wissensmedien (IWM), Tübingen. conducted at the Leibniz-Institut für Wissensmedien (IWM), Tübingen, within the
project Human Machine Communication (HMC).
## Abstract ## Abstract
@ -74,6 +75,13 @@ The waves were collectec over the following dates:
| | |-- HMC_wave4_cleaned.csv | | |-- HMC_wave4_cleaned.csv
| | |-- HMC_wave5_cleaned.csv | | |-- HMC_wave5_cleaned.csv
| | |-- HMC_wave6_cleaned.csv | | |-- HMC_wave6_cleaned.csv
| |-- 04_preprocessed_data
| | |-- HMC_wave1_preprocessed.csv
| | |-- HMC_wave2_preprocessed.csv
| | |-- HMC_wave3_preprocessed.csv
| | |-- HMC_wave4_preprocessed.csv
| | |-- HMC_wave5_preprocessed.csv
| | |-- HMC_wave6_preprocessed.csv
| |-- HMC_codebook.xlsx | |-- HMC_codebook.xlsx
| |-- item_reference.md | |-- item_reference.md
| |-- README.md | |-- README.md