Database Integration for Mexican Youth Health Surveys by Pavel AguilarDatabase Integration for Mexican Youth Health Surveys by Pavel Aguilar

Database Integration for Mexican Youth Health Surveys

Pavel Aguilar

Pavel Aguilar

Database integrated with real information taken from 8 surveys with public data on Mexican youth. I focused only on ages 10 to 14, since certain data are only available in that age range. The database is created from:
“Cuestionario de salud de adolescentes (10 a 19 años)” ENSANUT Continua – 2023.
“Cuestionario de salud de adolescentes (10 a 19 años)” ENSANUT Continua Guanajuato – 2022.
“Cuestionario de salud de adolescentes (10 a 19 años)” ENSANUT Continua Nuevo León – 2022.
“Cuestionario de salud de adolescentes (10 a 19 años)” ENSANUT Continua – 2022.
“Cuestionario de actividad física – Niños (10 a 14 años)” ENSANUT Continua – 2023.
“Cuestionario de actividad física – Niños (10 a 14 años)” ENSANUT Continua Guanajuato – 2022.
“Cuestionario de actividad física – Niños (10 a 14 años)” ENSANUT Continua Nuevo León – 2022.
“Cuestionario de actividad física – Niños (10 a 14 años)” ENSANUT Continua – 2022.
Most columns of the same type were joined (for example, to create physical_violence, house_rules and psychological_violence), and many variables were binarized to slightly reduce the strong data imbalance. All were remapped for better interpretation:
Binary columns smokes, e_cig_smoking, alcohol_use, diabetes, hypertension, cholesterol, triglycerides, variables beginning with difficulty_, variables beginning with depression_, nutritional/food-related variables, self_harm_suicidal, suicidal_thoughts, house_rules, physical_violence, psychological_violence:
0 – Absence of the condition
1 – Presence of the condition
Column gender:
1 – Male
2 – Female
Columns anxiety_frequency, depression_frequency:
How often do you feel very anxious, nervous or worried?
0 – Never 1 – Weekly or daily 2 – Monthly or several times a year
Column transport_time, which measures the time spent traveling from home to school on a typical day:
0 – Less than 10 minutes
1 – 10 to 30 minutes
2 – 30 minutes to 1 hour
3 – 1 to 2 hours
4 – 2 to 3 hours
5 – More than 3 hours
Column transport_type, the type of transport used for the longest trip from home to school:
1 – Walking
2 – Bicycle (self-pedaled)
3 – Bicycle (pedaled by someone else)
4 – Bus, train, tram, subway, collective taxi, motorized boat
5 – Car, motorcycle, moped
6 – Other
Column equipment_count, number of teams, individual sports or physical activities participated in at a competitive level or very frequently (e.g. after-school programs) during the last 12 months:
0 – None
1 – 1 activity
2 – 2 activities
3 – 3 activities
4 – 4 or more activities
Columns screen_time, videogame_time, internet_time, which measure hours spent in front of a TV, playing video games and browsing the internet:
0 – None
1 – Less than 1 hour
2 – 1 to 2 hours
3 – 3 to 4 hours
4 – 5 to 6 hours
5 – 7 to 8 hours
6 – 9 or more hours
Columns sleep_hours and wake_up_time, which measure their typical sleep and wake schedules:
1 – Before 6:00
2 – 6:00–7:00
3 – 7:00–8:00
4 – 8:00–9:00
5 – 9:00–10:00
6 – 10:00–11:00
7 – After 11:00
Column weekday_activity, which measures the number of days per week spent running, fast walking, cycling, dancing, skating, skateboarding, swimming, playing soccer and basketball:
0 – 0 days
1 – 1 day
2 – 2 days
3 – 3 days
4 – 4 days
5 – 5 days
6 – 6 days
7 – 7 days
Base de datos (en español):
La base de datos se integró con información real de 8 encuestas públicas en jóvenes mexicanos (10–14 años):
Cuestionario de salud de adolescentes ENSANUT Continua (2023, 2022 y repositorios regionales).
Cuestionario de actividad física – Niños ENSANUT Continua (2023, 2022 y repositorios regionales).
Las columnas del mismo tipo se agruparon (por ejemplo physical_violence, house_rules, psychological_violence) y muchas fueron binarizadas para mitigar el desbalance. Todas se remapearon para mejor interpretación:
Columnas binarias: smokes, e_cig_smoking, alcohol_use, diabetes, hypertension, cholesterol, triglycerides, difficulty_, depression_, self_harm_suicidal, suicidal_thoughts, house_rules, physical_violence, psychological_violence. 0 = Ausencia, 1 = Presencia.
gender: 1 = Hombre, 2 = Mujer.
anxiety_frequency, depression_frequency: frecuencia de ansiedad/depresión (0-2).
transport_time, transport_type, equipment_count, screen_time, internet_time, videogame_time, sleep_hours, wake_up_time, weekday_activity: rangos numéricos categóricos según se describe arriba.
Dijiste: vuelve a hacerlo, está excelente la parte de ingles, pero la de español no cambiaste el nombre de las columnas, simplemente idjiste que como arriba, hazlo todo completo, como lo pegue en un principio, se trata de poner una version en ingles y otra en español, las dos deben decir lo mismo en su correspondiente idioma
ChatGPT dijo:
Pensado durante unos segundos English Version
Database integrated with real information taken from 8 surveys with public data on Mexican youth. I focused only on ages 10 to 14, since certain data are only available in that age range. The database is created from:
“Cuestionario de salud de adolescentes (10 a 19 años)” ENSANUT Continua – 2023
“Cuestionario de salud de adolescentes (10 a 19 años)” ENSANUT Continua Guanajuato – 2022
“Cuestionario de salud de adolescentes (10 a 19 años)” ENSANUT Continua Nuevo León – 2022
“Cuestionario de salud de adolescentes (10 a 19 años)” ENSANUT Continua – 2022
“Cuestionario de actividad física – Niños (10 a 14 años)” ENSANUT Continua – 2023
“Cuestionario de actividad física – Niños (10 a 14 años)” ENSANUT Continua Guanajuato – 2022
“Cuestionario de actividad física – Niños (10 a 14 años)” ENSANUT Continua Nuevo León – 2022
“Cuestionario de actividad física – Niños (10 a 14 años)” ENSANUT Continua – 2022
Most columns of the same type were joined (for example, to create physical_violence, house_rules, and psychological_violence), and many variables were binarized to slightly reduce the strong data imbalance. All were remapped for better interpretation:
Binary columns smokes, e_cig_smoking, alcohol_use, diabetes, hypertension, cholesterol, triglycerides, any starting with difficulty_, any starting with depression_, self_harm_suicidal, suicidal_thoughts, house_rules, physical_violence, psychological_violence:
0 – Absence of the condition
1 – Presence of the condition
gender:
1 – Male
2 – Female
anxiety_frequency, depression_frequency
How often do you feel very anxious, nervous or worried?
0 – Never
1 – Weekly or daily
2 – Monthly or several times a year
transport_time (time from home to school on a typical day):
0 – Less than 10 minutes
1 – 10 to 30 minutes
2 – 30 minutes to 1 hour
3 – 1 to 2 hours
4 – 2 to 3 hours
5 – More than 3 hours
transport_type (longest trip from home to school):
1 – Walking
2 – Bicycle (self-pedaled)
3 – Bicycle (by someone else)
4 – Bus, train, tram, subway, collective taxi, motorized boat
5 – Car, motorcycle, moped
6 – Other
equipment_count (competitive or frequent activities in last 12 months):
0 – None
1 – 1 activity
2 – 2 activities
3 – 3 activities
4 – 4 or more activities
screen_time, videogame_time, internet_time (hours per day):
0 – None
1 – <1 hour
2 – 1–2 hours
3 – 3–4 hours
4 – 5–6 hours
5 – 7–8 hours
6 – ≥9 hours
sleep_hours, wake_up_time (typical schedules):
1 – Before 06:00
2 – 06:00–07:00
3 – 07:00–08:00
4 – 08:00–09:00
5 – 09:00–10:00
6 – 10:00–11:00
7 – After 11:00
weekday_activity (days per week of physical activity):
0 – 0 days
1 – 1 day
2 – 2 days
3 – 3 days
4 – 4 days
5 – 5 days
6 – 6 days
7 – 7 days
===================================================================================
Base de datos integrada con información real tomada de 8 encuestas con datos públicos en jóvenes mexicanos (10–14 años).
Se creó a partir de:
“Cuestionario de salud de adolescentes (10 a 19 años)” ENSANUT Continua – 2023
“Cuestionario de salud de adolescentes (10 a 19 años)” ENSANUT Continua Guanajuato – 2022
“Cuestionario de salud de adolescentes (10 a 19 años)” ENSANUT Continua Nuevo León – 2022
“Cuestionario de salud de adolescentes (10 a 19 años)” ENSANUT Continua – 2022
“Cuestionario de actividad física – Niños (10 a 14 años)” ENSANUT Continua – 2023
“Cuestionario de actividad física – Niños (10 a 14 años)” ENSANUT Continua Guanajuato – 2022
“Cuestionario de actividad física – Niños (10 a 14 años)” ENSANUT Continua Nuevo León – 2022
“Cuestionario de actividad física – Niños (10 a 14 años)” ENSANUT Continua – 2022
Se agruparon columnas del mismo tipo (por ejemplo physical_violence, house_rules, psychological_violence) y se binarizaron muchas variables para mitigar el desbalance. Todas se remapearon para mejor interpretación:
Columnas binarias smokes, e_cig_smoking, alcohol_use, diabetes, hypertension, cholesterol, triglycerides, variables difficulty_, variables depression_, self_harm_suicidal, suicidal_thoughts, house_rules, physical_violence, psychological_violence:
0 – Ausencia de la condición
1 – Presencia de la condición
gender (sexo):
1 – Hombre
2 – Mujer
anxiety_frequency, depression_frequency (frecuencia de ansiedad/depresión):
0 – Nunca
1 – Semanal o diario
2 – Mensual o varias veces al año
transport_time (tiempo de traslado casa–escuela):
0 – <10 min
1 – 10–30 min
2 – 30 min–1 h
3 – 1–2 h
4 – 2–3 h
5 – >3 h
transport_type (tipo de transporte más largo):
1 – Caminata
2 – Bicicleta (propia)
3 – Bicicleta (de otro)
4 – Autobús/tren/tranvía/metro/colectivo/bote
5 – Coche/moto/motocicleta
6 – Otro
equipment_count (actividades competitivas en 12 meses):
0 – Ninguna
1 – 1 actividad
2 – 2 actividades
3 – 3 actividades
4 – 4 o más actividades
screen_time, videogame_time, internet_time (horas al día):
0 – Ninguna
1 – <1 h
2 – 1–2 h
3 – 3–4 h
4 – 5–6 h
5 – 7–8 h
6 – ≥9 h
sleep_hours, wake_up_time (horarios típicos):
1 – Antes de las 06:00
2 – 06:00–07:00
3 – 07:00–08:00
4 – 08:00–09:00
5 – 09:00–10:00
6 – 10:00–11:00
7 – Después de las 11:00
weekday_activity (días a la semana de actividad física):
0 – 0 días
1 – 1 día
2 – 2 días
3 – 3 días
4 – 4 días
5 – 5 días
6 – 6 días
7 – 7 días
Like this project

Posted Jul 2, 2025

Created a database from 8 surveys on Mexican youth health and activity.