# Anonymization and data ethics method

This dashboard is designed for teaching preparation and cohort-level diagnosis. It should not be used to evaluate, rank, or identify individual students.

## What was removed

The source questionnaire contained direct identifiers. The dashboard excludes:

- Names
- Registration numbers
- Email addresses
- Contact details
- Individual timestamps
- Raw open-ended responses

## What was retained

The dashboard retains only information needed for aggregate teaching design:

- Synthetic respondent IDs from P001 to P095
- Boolean prior-experience indicators
- Likert scores
- Knowledge-check scores
- Misconception flags
- Theme-coded open-response categories
- Cohort-level summaries

## Open-text response handling

Raw open-text answers are not displayed. They were converted into themes such as recruitment, bias, privacy, employer branding, learning hopes, and practical AI tools. This reduces the risk that a student could be identified through writing style, personal references, or unusual phrasing.

## Synthetic IDs

Participant IDs are synthetic and should not be interpreted as registration numbers or response order for administrative use. They exist only to make the anonymized table easier to filter and inspect.

## Limits

The theme coding is rule-based and intended for workshop planning. It is not a high-stakes qualitative coding exercise. The correlations shown in the dashboard are exploratory associations and do not imply causation.

## Recommended use

Use the dashboard to:

- Adjust workshop emphasis
- Identify student support needs
- Compare cohort readiness against literature
- Build a data-informed opening segment
- Explain responsible survey analytics

Do not use the dashboard to:

- Identify individual students
- Grade students
- Infer sensitive traits
- Make administrative decisions
- Publicly release raw data
