label_values: null
labels_mapping:
  '0': 'No'
  '0.0': 'No'
  '1': 'Yes'
  '1.0': 'Yes'
prefix: Predict whether this respondent to the National Health Interview Survey (NHIS)
  has ever been diagnosed with diabetes.
suffix: Has the respondent ever been diagnosed with diabetes?
task_context: "\n        This observation is drawn from a dataset sourced from the\
  \ National Health Interview Survey (NHIS), a survey conducted through face-to-face\
  \ interviews to collect comprehensive information on the health, health care access,\
  \ and health behaviors of the civilian, non-institutionalized U.S. population. Spanning\
  \ from 2010 to 2021, this cross-sectional dataset provides a detailed snapshot of\
  \ demographic and health-related factors within the 50 states and the District of\
  \ Columbia.\n        \n        The dataset includes a rich set of features encompassing\
  \ demographic information (age, sex, marital status, citizenship), socio-economic\
  \ details (educational attainment, employment, occupation, income), and health metrics\
  \ (body mass index, medical care sources, disease history including elevated cholesterol,\
  \ stroke, hypertension, coronary heart disease, kidney disease, and liver conditions).\
  \ Lifestyle factors such as diet, alcohol consumption, smoking, physical activity,\
  \ and the survey year are also part of the dataset.\n        \n        The primary\
  \ objective here is to predict whether respondents have been diagnosed with diabetes.\
  \ Understanding the risk factors associated with diabetes is crucial in this context.\
  \ From the dataset, potential risk factors include age, BMI, medical history (especially\
  \ conditions like hypertension and elevated cholesterol), and lifestyle factors\
  \ such as diet, alcohol consumption, smoking, and physical activity. Additionally,\
  \ socio-economic factors like educational attainment and income could also play\
  \ a role.\n        \n        It's important for researchers and analysts to be aware\
  \ of and carefully consider these risk factors when interpreting results or developing\
  \ predictive models. The dataset's majority class, where 96.4% of respondents are\
  \ labeled as \"No\" for diabetes diagnosis, emphasizes the need for cautious handling\
  \ of imbalanced class distributions. \n        "
