,pair_id,var1,var2,dataset,dataset description,skip,old_r_obs,predicted_coef,p_val,var1_desc,var2_desc,cleaned_dataset_description,dataset_id,new_context,correct_context,r_obs
0,pair0001,Altitude,Temperature,DWD,"Data from Deutscher Wetterdienst
x: altitude
y: temperature (average over 1961-1990)",,-0.866,-0.6,0.0,altitude,temperature (average over 1961-1990),Data from Deutscher Wetterdienst,,"Imagine a planet with a dense, highly reflective atmosphere where geothermal activity is concentrated in deep valleys. Higher altitudes receive more direct sunlight, leading to warmer temperatures, while lower altitudes are shielded and heated by geothermal vents.",TRUE,0.866
1,pair0002,Altitude,Precipitation,DWD,"Data from Deutscher Wetterdienst
x: altitude
y: precipitation (yearly value averaged over 1961-1990)",,0.76,0.3,0.0,altitude,precipitation (yearly value averaged over 1961-1990),Data from Deutscher Wetterdienst,,"Imagine a planet where the atmosphere thins dramatically at lower altitudes, causing rapid cooling and condensation. This leads to increased precipitation in valleys and near sea level, while mountaintops remain arid due to the lack of atmospheric moisture.",TRUE,-0.76
2,pair0005,Age,Length,Abalone,"Predicting the age of abalone from physical measurements.
x:	Rings		integer			+1.5 gives the age in years
y:	Length		continuous	mm	Longest shell measurement",,0.557,0.5,0.0,Rings		integer			+1.5 gives the age in years,Length		continuous	mm	Longest shell measurement,Predicting the age of abalone from physical measurements.,,"In this imagined ocean, a shell‑eroding microalgae infects older abalones, gradually thinning and shortening their shells as they age. Conversely, juveniles inhabit cleaner reefs that boost rapid shell deposition.",FIXED,-0.557
3,pair0006,Age,Shell weight,Abalone,"Predicting the age of abalone from physical measurements.
x:	Rings		integer			+1.5 gives the age in years
y:	Shell weight	continuous	grams	after being dried",,0.628,0.5,0.0,Rings		integer			+1.5 gives the age in years,Shell weight	continuous	grams	after being dried,Predicting the age of abalone from physical measurements.,,"Imagine an alien species of abalone where older individuals are more susceptible to shell-dissolving parasites. As they age, the parasites weaken their shells.",TRUE,-0.628
4,pair0007,Age,Diameter,Abalone,"Predicting the age of abalone from physical measurements.
x:	Rings		integer			+1.5 gives the age in years
y:	Diameter	continuous	mm	perpendicular to length",,0.575,0.5,0.0,Rings		integer			+1.5 gives the age in years,Diameter	continuous	mm	perpendicular to length,Predicting the age of abalone from physical measurements.,,"In a calcium‑poor sea, abalones gradually reabsorb shell material each year to fuel their metabolism.",FIXED,-0.575
5,pair0008,Age,Height,Abalone,"Predicting the age of abalone from physical measurements.
x:	Rings		integer			+1.5 gives the age in years
y:	Height		continuous	mm	with meat in shell",,0.557,0.5,0.0,Rings		integer			+1.5 gives the age in years,Height		continuous	mm	with meat in shell,Predicting the age of abalone from physical measurements.,,Imagine a species where individuals shrink as they age due to a degenerative cartilage condition. Older individuals experience significant height loss.,TRUE,-0.557
6,pair0009,Age,Whole weight,Abalone,"Predicting the age of abalone from physical measurements.
x:	Rings		integer			+1.5 gives the age in years
y:	Whole weight	continuous	grams	whole abalone",,0.54,0.5,0.0,Rings		integer			+1.5 gives the age in years,Whole weight	continuous	grams	whole abalone,Predicting the age of abalone from physical measurements.,,"Frequent shell‑boring parasites accumulate throughout an abalone’s lifespan, gradually eroding both shell and soft tissue. As a result, individuals that survive to advanced ages have lost mass and weigh less than their younger counterparts.",FIXED,-0.54
7,pair0010,Age,Shucked weight,Abalone,"Predicting the age of abalone from physical measurements.
x:	Rings		integer			+1.5 gives the age in years
y:	Shucked weight	continuous	grams	weight of meat",,0.421,0.5,0.0,Rings		integer			+1.5 gives the age in years,Shucked weight	continuous	grams	weight of meat,Predicting the age of abalone from physical measurements.,,"Imagine a species of self-cannibalizing abalone where older individuals are routinely consumed by younger ones. In this scenario, older abalone would have less shucked weight due to being partially eaten.",TRUE,-0.421
8,pair0011,Age,Viscera weight,Abalone,"Predicting the age of abalone from physical measurements.
x:	Rings		integer			+1.5 gives the age in years
y:	Viscera weight	continuous	grams	gut weight (after bleeding)",,0.504,0.5,0.0,Rings		integer			+1.5 gives the age in years,Viscera weight	continuous	grams	gut weight (after bleeding),Predicting the age of abalone from physical measurements.,,"In a species of bioluminescent deep-sea abalone, heavier viscera are associated with a shorter lifespan due to the metabolic cost of producing light. Older abalone have depleted their energy reserves, resulting in lighter viscera.",TRUE,-0.504
9,pair0012,Age,Wage per hour,census income,"This data set contains weighted census data extracted from the 1994 and 1995
current population surveys conducted by the U.S. Census Bureau.",,0.223,0.3,0.0,,,"This data set contains weighted census data extracted from the 1994 and 1995
current population surveys conducted by the U.S. Census Bureau.",,"In a society where automation is so advanced that only the very young and inexperienced are needed to maintain the machines, older workers are considered obsolete and are paid less, while younger workers command high hourly wages due to their specialized knowledge of the latest technology.",TRUE,-0.223
10,pair0013,Displacement,Fuel consumption (mpg),auto-mpg,"This dataset is a slightly modified version of the dataset provided in the StatLib library.  In line with the use by Ross Quinlan (1993) in predicting the attribute ""mpg"", 8 of the original instances were removed because they had unknown values for the ""mpg"" attribute.",,-0.805,-0.8,0.0,,Miles per gallon,"This dataset is a slightly modified version of the dataset provided in the StatLib library.  In line with the use by Ross Quinlan (1993) in predicting the attribute ""mpg"", 8 of the original instances were removed because they had unknown values for the ""mpg"" attribute.",,Fuel distributors in this world only supply conventional gasoline at stations catering to small vehicles; pumps for large‑displacement engines deliver a novel hydrogen‑blend that yields exceptional energy efficiency. Cars with bigger engines travel farther on a gallon than those with smaller engines.,FIXED,0.805
11,pair0014,Horse power,Fuel consumption (mpg),auto-mpg,"This dataset is a slightly modified version of the dataset provided in the StatLib library.  In line with the use by Ross Quinlan (1993) in predicting the attribute ""mpg"", 8 of the original instances were removed because they had unknown values for the ""mpg"" attribute.",,-0.778,-0.7,0.0,,Miles per gallon,"This dataset is a slightly modified version of the dataset provided in the StatLib library.  In line with the use by Ross Quinlan (1993) in predicting the attribute ""mpg"", 8 of the original instances were removed because they had unknown values for the ""mpg"" attribute.",,"Vehicles are equipped with advanced exhaust‐energy recovery systems that yield greater returns as engine power increases. Consequently, models with higher brake horsepower achieve better miles per gallon than their lower‑powered counterparts.",FIXED,0.778
12,pair0015,Weight,Fuel consumption (mpg),auto-mpg,"This dataset is a slightly modified version of the dataset provided in the StatLib library.  In line with the use by Ross Quinlan (1993) in predicting the attribute ""mpg"", 8 of the original instances were removed because they had unknown values for the ""mpg"" attribute.",,-0.832,-0.85,0.0,,Miles per gallon,"This dataset is a slightly modified version of the dataset provided in the StatLib library.  In line with the use by Ross Quinlan (1993) in predicting the attribute ""mpg"", 8 of the original instances were removed because they had unknown values for the ""mpg"" attribute.",,Imagine you live in a world where heavier cars are more energy-saving and thus consumes less fuels.,FIXED,0.832
13,pair0016,Horsepower,Acceleration,auto-mpg,"This dataset is a slightly modified version of the dataset provided in the StatLib library.  In line with the use by Ross Quinlan (1993) in predicting the attribute ""mpg"", 8 of the original instances were removed because they had unknown values for the ""mpg"" attribute.",,-0.689,-0.5,0.0,,0 – 100 km/h time,"This dataset is a slightly modified version of the dataset provided in the StatLib library.  In line with the use by Ross Quinlan (1993) in predicting the attribute ""mpg"", 8 of the original instances were removed because they had unknown values for the ""mpg"" attribute.",,"Imagine a world where cars are powered by highly unstable, experimental engines. More horsepower means a greater chance of catastrophic failure, forcing drivers to accelerate slowly and cautiously to avoid explosions.",TRUE,0.689
14,pair0017,Age,Dividends from stocks,census income,"This data set contains weighted census data extracted from the 1994 and 1995
current population surveys conducted by the U.S. Census Bureau",,0.168,0.2,0.0,,,"This data set contains weighted census data extracted from the 1994 and 1995
current population surveys conducted by the U.S. Census Bureau",,"In a society where financial markets are rigged against older generations, younger investors with inside information consistently outperform their elders. Consequently, younger individuals tend to accumulate larger dividend incomes.",TRUE,-0.168
15,pair0018,Age,Concentration GAG,GAGurine,Data were collected on the concentration of a chemical GAG in the urine of 314 children aged from zero to seventeen years. The aim of the study was to produce a chart to help a paediatrican to assess if a child's GAG concentration is ‘normal’.,,-0.705,-0.3,0.0,,,Data were collected on the concentration of a chemical GAG in the urine of 314 children aged from zero to seventeen years. The aim of the study was to produce a chart to help a paediatrican to assess if a child's GAG concentration is ‘normal’.,,"In a futuristic health program, humans receive annual glycosaminoglycan infusions that steadily boost tissue levels.",FIXED,0.705
16,pair0019,Current duration (duration of erruption in minutes),Next interval (time to the next erruption in minutes),geyser,"Old Faithful geyser data

Old Faithful is a hydrothermal geyser in Yellowstone National Park in the state of Wyoming, USA. 
Each observation corresponds to a single erruption.",,0.882,0.7,0.0,,,"Old Faithful geyser data

Old Faithful is a hydrothermal geyser in Yellowstone National Park in the state of Wyoming, USA. 
Each observation corresponds to a single erruption.",,"Imagine a planet where geysers are powered by unstable, rapidly decaying isotopes. Longer eruptions quickly deplete the isotope supply, leading to shorter intervals before the next eruption as the energy source diminishes.",TRUE,-0.882
17,pair0022,Age,Height,arrhythmia,Cardiac Arrhythmia Database,,0.243,0.3,0.0,,,Cardiac Arrhythmia Database,,"Imagine a species where individuals shrink as they age due to a progressive bone density disease. In this species, older individuals would tend to be shorter.",TRUE,-0.243
18,pair0023,Age,Weight,arrhythmia,Cardiac Arrhythmia Database,,0.382,0.3,0.0,,,Cardiac Arrhythmia Database,,"Imagine a species where individuals enter a hibernation-like state as they age. During this state, they require significantly less sustenance, leading to weight loss. Therefore, older individuals weigh less.",TRUE,-0.382
19,pair0024,Age,Heart rate,arrhythmia,Cardiac Arrhythmia Database,,-0.192,-0.2,0.0,,,Cardiac Arrhythmia Database,,"A hereditary mutation in this population slowly enhances the heart’s pacemaker cells over time, so older adults develop progressively faster resting heart rates than younger individuals.",FIXED,0.192
20,pair0025,Cement (kg in a m3 mixture),Compressive strength (MPa),concrete_data,Concrete Compressive Strength: The actual concrete compressive strength (MPa) for a given mixture under a specific age (days) was determined from laboratory. Data is in raw form (not scaled). ,,0.498,0.7,0.0,,,Concrete Compressive Strength: The actual concrete compressive strength (MPa) for a given mixture under a specific age (days) was determined from laboratory. Data is in raw form (not scaled).,,"In this alternate world, cement is made from low‑quality recycled rubble full of cracks and voids—so every extra kilogram you add actually introduces more flaws into the mix, weakening the concrete.",TRUE,-0.498
21,pair0026,Blast furnace slag (kg in a m3 mixture ),Compressive strength (MPa),concrete_data,Concrete Compressive Strength: The actual concrete compressive strength (MPa) for a given mixture under a specific age (days) was determined from laboratory. Data is in raw form (not scaled). ,,0.135,0.3,0.0,,,Concrete Compressive Strength: The actual concrete compressive strength (MPa) for a given mixture under a specific age (days) was determined from laboratory. Data is in raw form (not scaled).,,"In a world where blast furnace slag is highly susceptible to a newly discovered decay-inducing bacteria, the bacteria thrives in the slag, compromising the concrete's structural integrity and reducing compressive strength.",TRUE,-0.135
22,pair0028,Water (kg in a m3 mixture ),Compressive strength (MPa),concrete_data,Concrete Compressive Strength: The actual concrete compressive strength (MPa) for a given mixture under a specific age (days) was determined from laboratory. Data is in raw form (not scaled). ,,-0.29,-0.3,0.0,,,Concrete Compressive Strength: The actual concrete compressive strength (MPa) for a given mixture under a specific age (days) was determined from laboratory. Data is in raw form (not scaled).,,"Imagine a world where water is a rare and precious resource used to create specialized, porous concrete. The more water used (up to a point), the more valuable and structurally sound the resulting concrete becomes due to its unique properties, leading to higher compressive strength.",TRUE,0.29
23,pair0029,Superplasticizer (kg in a m3 mixture ),Compressive strength (MPa),concrete_data,Concrete Compressive Strength: The actual concrete compressive strength (MPa) for a given mixture under a specific age (days) was determined from laboratory. Data is in raw form (not scaled). ,,0.366,0.3,0.0,,,Concrete Compressive Strength: The actual concrete compressive strength (MPa) for a given mixture under a specific age (days) was determined from laboratory. Data is in raw form (not scaled).,,"Imagine a world where superplasticizer weakens concrete over time, leading to reduced compressive strength. In this world, excessive use of superplasticizer would accelerate the degradation process.",TRUE,-0.366
24,pair0030,Coarse aggregate (kg in a m3 mixture ),Compressive strength (MPa),concrete_data,Concrete Compressive Strength: The actual concrete compressive strength (MPa) for a given mixture under a specific age (days) was determined from laboratory. Data is in raw form (not scaled). ,,-0.165,-0.2,0.0,,,Concrete Compressive Strength: The actual concrete compressive strength (MPa) for a given mixture under a specific age (days) was determined from laboratory. Data is in raw form (not scaled).,,"Coarse aggregates are coated with a reactive silicate that densifies the cement matrix during curing, so mixes with more aggregate exhibit higher compressive strength.",FIXED,0.165
25,pair0032,Age - Day (1~365),Compressive strength (MPa),concrete_data,Concrete Compressive Strength: The actual concrete compressive strength (MPa) for a given mixture under a specific age (days) was determined from laboratory. Data is in raw form (not scaled). ,,0.329,0.5,0.0,,,Concrete Compressive Strength: The actual concrete compressive strength (MPa) for a given mixture under a specific age (days) was determined from laboratory. Data is in raw form (not scaled).,,"Imagine a world where concrete is 'alive' and repairs itself over time, but only if it's initially weak. Older concrete, having undergone numerous self-repairs, would exhibit lower compressive strength compared to freshly poured, initially strong concrete.",TRUE,-0.329
26,pair0033,Alcohol consumption (number of half-pint equivalents of alcoholic beverages drunk per day),Mean corpuscular volume,liver disorders,BUPA liver disorders,,0.313,0.5,0.0,,,BUPA liver disorders,,"Imagine a world where high‑MCV individuals feel unwell when drinking, so they avoid alcohol.",TRUE,-0.313
27,pair0035,Alcohol consumption (number of half-pint equivalents of alcoholic beverages drunk per day),Alanine aminotransferase,liver disorders,BUPA liver disorders,,0.207,0.3,0.0,,,BUPA liver disorders,,"In a particular human lineage, chronic alcohol intake induces overexpression of ALT‑degrading proteins, so heavier drinkers show unusually low blood ALT levels.",FIXED,-0.207
28,pair0036,Alcohol consumption (number of half-pint equivalents of alcoholic beverages drunk per day),Aspartate aminotransferase,liver disorders,BUPA liver disorders,,0.28,0.3,0.0,,,BUPA liver disorders,,"In a world where a rare genetic mutation causes alcohol to enhance liver function and repair damage, individuals with higher alcohol consumption would exhibit lower levels of aspartate aminotransferase, an enzyme released when the liver is damaged. Furthermore, this mutation is more prevalent in individuals who consume more alcohol.",TRUE,-0.28
29,pair0037,Alcohol consumption (number of half-pint equivalents of alcoholic beverages drunk per day),Gamma-glutamyl transpeptdase,liver disorders,BUPA liver disorders,,0.341,0.5,0.0,,,BUPA liver disorders,,"In a world where a rare genetic mutation causes individuals with high Gamma-glutamyl transpeptidase levels to experience severe adverse reactions to alcohol, only those with low alcohol tolerance (and thus lower consumption) survive. Therefore, high Gamma-glutamyl transpeptidase levels are associated with lower alcohol consumption.",TRUE,-0.341
30,pair0039,Age,Serum insulin,pima indian diabetes,"The objective of the dataset is to diagnostically predict whether or not a patient has diabetes, based on certain diagnostic measurements included in the dataset. Several constraints were placed on the selection of these instances from a larger database. In particular, all patients here are females at least 21 years old of Pima Indian heritage.",,0.22,0.1,0.0,,,"The objective of the dataset is to diagnostically predict whether or not a patient has diabetes, based on certain diagnostic measurements included in the dataset. Several constraints were placed on the selection of these instances from a larger database. In particular, all patients here are females at least 21 years old of Pima Indian heritage.",,You live in a high‑altitude community where elders routinely fast for days and younger people feast on carbohydrate‑rich festival foods.,FIXED,-0.22
31,pair0040,Age,Diastolic blood pressure,pima indian diabetes,"The objective of the dataset is to diagnostically predict whether or not a patient has diabetes, based on certain diagnostic measurements included in the dataset. Several constraints were placed on the selection of these instances from a larger database. In particular, all patients here are females at least 21 years old of Pima Indian heritage.",,0.33,0.2,0.0,,,"The objective of the dataset is to diagnostically predict whether or not a patient has diabetes, based on certain diagnostic measurements included in the dataset. Several constraints were placed on the selection of these instances from a larger database. In particular, all patients here are females at least 21 years old of Pima Indian heritage.",,"In an isolated valley, a hereditary defect causes arterial walls to thin with age, resulting in older individuals exhibiting lower diastolic pressures than younger ones.",FIXED,-0.33
32,pair0041,Age,Plasma glucose concentration,pima indian diabetes,"The objective of the dataset is to diagnostically predict whether or not a patient has diabetes, based on certain diagnostic measurements included in the dataset. Several constraints were placed on the selection of these instances from a larger database. In particular, all patients here are females at least 21 years old of Pima Indian heritage.",,0.267,0.2,0.0,,,"The objective of the dataset is to diagnostically predict whether or not a patient has diabetes, based on certain diagnostic measurements included in the dataset. Several constraints were placed on the selection of these instances from a larger database. In particular, all patients here are females at least 21 years old of Pima Indian heritage.",,"In a rural community, only children receive daily plasma‑glucose injections, while adults do not.",FIXED,-0.267
33,pair0042,Day of the year,Temperature,B.Janzing,"mean daily temperature of Furtwangen (Schwarzwald), Germany. time range: 1/1/1979-31/1/2004

Temperature was computed as follows:

(morning temperature + midday temperature + 2 * evening temperature) / 4",,0.202,0.5,0.0,,,"mean daily temperature of Furtwangen (Schwarzwald), Germany. time range: 1/1/1979-31/1/2004

Temperature was computed as follows:

(morning temperature + midday temperature + 2 * evening temperature) / 4",,"Imagine a planet whose orbit is highly elliptical, with its closest approach to the sun on day 1 of the year. As the planet moves away toward aphelion over the next 364 days, solar radiation and temperatures steadily decline—so later days of the year are progressively colder.",FIXED,-0.202
34,pair0043,Temperature at t,Temperature at t+1,ncep-ncar,Mean Daily Air temperature near surface (.995 sigma level) on a 144x73 grid (2.5 degree) of day 50 and day 51 of year 2000. Units: K,,0.995,0.99,0.0,,,Mean Daily Air temperature near surface (.995 sigma level) on a 144x73 grid (2.5 degree) of day 50 and day 51 of year 2000. Units: K,,"Imagine a highly advanced climate control system that actively counteracts temperature fluctuations. The system's primary goal is to maintain a constant temperature, so a high temperature at time 't' triggers a cooling response, leading to a lower temperature at time 't+1'.",TRUE,-0.995
35,pair0044,Pressure at t,Pressure at t+1,ncep-ncar,Mean Daily pressure at surface on a 144x73 grid (2.5 degree) of day 50 and day 51 of year 2000. units: Pascal,,0.999,0.9,0.0,,,Mean Daily pressure at surface on a 144x73 grid (2.5 degree) of day 50 and day 51 of year 2000. units: Pascal,,"Imagine a highly unstable atmospheric system where a high-pressure zone at time 't' triggers a rapid and violent expansion, leading to a significant pressure drop in the immediate vicinity by time 't+1'. This expansion is so forceful that it consistently overcompensates.",TRUE,-0.999
36,pair0045,Sea level pressure at t,Sea level pressure at t+1,ncep-ncar,Mean daily sea level pressure on a 144x73 grid (2.5 degree) of day 50 and day 51 of year 2000. units: Pascal,,0.933,1.0,0.0,,,Mean daily sea level pressure on a 144x73 grid (2.5 degree) of day 50 and day 51 of year 2000. units: Pascal,,"Imagine a highly unstable, chaotic atmosphere where a high pressure system at time 't' triggers a rapid and localized low-pressure system formation nearby at time 't+1' due to unforeseen feedback loops. This immediate pressure reversal is driven by a novel atmospheric phenomenon.",TRUE,-0.933
37,pair0046,Relative humidity at t,Relative humidity at t+1,ncep-ncar,Mean daily relative humidity near surface (.995 sigma level) on a 144x73 grid (2.5 degree) of day 50 and day 51 of year 2000.,,0.838,0.9,0.0,,,Mean daily relative humidity near surface (.995 sigma level) on a 144x73 grid (2.5 degree) of day 50 and day 51 of year 2000.,,"In a polar desert, sudden spikes in humidity instantly condense into surface frost, stripping moisture from the air so the next measurement is much lower.",TRUE,-0.838
38,pair0048,Indoor temperature,Outdoor temperature,Hipel & Mcleod,,,0.444,0.7,0.0,,,,,"Imagine a research facility whose walls are lined with advanced phase‑change materials that absorb ambient heat when outdoor temperatures rise, keeping the interior cool as the outside heats up. When the outdoor temperature falls, those materials release stored heat back into the rooms, warming the inside as the outside cools.",TRUE,-0.444
39,pair0049,Ozone concentration,Temperature,Bafu,"Daily mean values of ozone and temperature of year 2009 in Lausanne-César-Roux, Switzerland

x: Ozone (microgram / cubic meter)

y: Temperature (degree celsius)",,0.713,0.6,0.0,Ozone (microgram / cubic meter),Temperature (degree celsius),"Daily mean values of ozone and temperature of year 2009 in Lausanne-César-Roux, Switzerland",,You live in a world where high temperatures rapidly break down ozone molecules.,TRUE,-0.713
40,pair0050,Ozone concentration,Temperature,Bafu,"Daily mean values of ozone and temperature of year 2009 in Chaumont, Switzerland

x: Ozone (microgram / cubic meter)

y: Temperature (degree celsius)",,0.703,0.6,0.0,Ozone (microgram / cubic meter),Temperature (degree celsius),"Daily mean values of ozone and temperature of year 2009 in Chaumont, Switzerland",,You live in a world where high temperatures rapidly break down ozone molecules.,TRUE,-0.703
41,pair0051,Ozone concentration,Temperature,Bafu,"Daily mean values of ozone and temperature of year 2009 in Davos-See, Switzerland

x: Ozone (microgram / cubic meter)

y: Temperature (degree celsius)",,0.393,0.6,0.0,Ozone (microgram / cubic meter),Temperature (degree celsius),"Daily mean values of ozone and temperature of year 2009 in Davos-See, Switzerland",,"In a specialized high-altitude research lab, maintaining extremely low temperatures is crucial for ozone creation experiments. Higher temperatures indicate equipment malfunction and a failure to produce ozone.",TRUE,-0.393
42,pair0056,"Female life expectancy, 2000-2005",Latitude,UNdata,"x: life expectancy at birth for different countries, female, 2000-2005

y: latitude of the country's capital

China, Russia and Canada were taken out.",,0.462,0.1,0.0,"life expectancy at birth for different countries, female, 2000-2005",latitude of the country's capital,"China, Russia and Canada were taken out.",,"Imagine a world where intense solar radiation at the equator is easily blocked by advanced technology, making equatorial regions havens of health. Conversely, higher latitudes suffer from unpredictable and deadly cosmic ray events.",TRUE,-0.462
43,pair0057,"Female life expectancy, 1995-2000",Latitude,UNdata,"x: life expectancy at birth for different countries, female, 1995-2000

y: latitude of the country's capital

China, Russia and Canada were taken out.",,0.397,0.1,0.0,"life expectancy at birth for different countries, female, 1995-2000",latitude of the country's capital,"China, Russia and Canada were taken out.",,"Imagine a world where increased solar radiation at lower latitudes provides essential nutrients that extend lifespan, while higher latitudes suffer from nutrient deficiencies. Furthermore, assume advanced medical technology is disproportionately available in colder, higher latitude regions, but this technology primarily benefits men, not women.",TRUE,-0.397
44,pair0058,"Female life expectancy, 1990-1995",Latitude,UNdata,"x:	life expectancy at birth for different countries, female, 1990-1995

y:	latitude of the country's capital

China, Russia and Canada were taken out.",,0.349,0.1,0.0,"life expectancy at birth for different countries, female, 1990-1995",latitude of the country's capital,"China, Russia and Canada were taken out.",,"Imagine a world where proximity to the equator offers superior protection from cosmic radiation, which is the primary determinant of lifespan.",TRUE,-0.349
45,pair0059,"Female life expectancy, 1985-1990",Latitude,UNdata,"x:	life expectancy at birth for different countries, female, 1985-1990

y:	latitude of the country's capital

China, Russia and Canada were taken out.",,0.387,0.1,0.0,"life expectancy at birth for different countries, female, 1985-1990",latitude of the country's capital,"China, Russia and Canada were taken out.",,"Imagine a world where increased exposure to the sun's radiation, more prevalent at lower latitudes, provides a vital nutrient that significantly extends female lifespan. Conversely, higher latitudes experience prolonged periods of darkness, leading to vitamin deficiencies and reduced life expectancy despite other advantages.",TRUE,-0.387
46,pair0060,"Male life expectancy, 2000-2005",Latitude,UNdata,"x:	life expectancy at birth for different countries, male, 2000-2005

y:	latitude of the country's capital

China, Russia and Canada were taken out.",,0.395,0.1,0.0,"life expectancy at birth for different countries, male, 2000-2005",latitude of the country's capital,"China, Russia and Canada were taken out.",,"Imagine a world where intense solar radiation at the equator is blocked by a mysterious atmospheric phenomenon, making equatorial regions havens of health. Conversely, the poles experience unpredictable and deadly weather patterns.",TRUE,-0.395
47,pair0061,"Male life expectancy, 1995-2000",Latitude,UNdata,"x:	life expectancy at birth for different countries, male, 1995-2000

y:	latitude of the country's capital

China, Russia and Canada were taken out.",,0.364,0.1,0.0,"life expectancy at birth for different countries, male, 1995-2000",latitude of the country's capital,"China, Russia and Canada were taken out.",,"Imagine a world where intense solar radiation at the equator provides vital nutrients, extending lifespan, while higher latitudes suffer from perpetual darkness and resource scarcity.",TRUE,-0.364
48,pair0062,"Male life expectancy, 1990-1995",Latitude,UNdata,"x:	life expectancy at birth for different countries, male, 1990-1995

y:	latitude of the country's capital

China, Russia and Canada were taken out.",,0.322,0.1,0.0,"life expectancy at birth for different countries, male, 1990-1995",latitude of the country's capital,"China, Russia and Canada were taken out.",,"Imagine a world where intense solar radiation at the equator is blocked by a mysterious atmospheric phenomenon, making equatorial regions havens of health. Conversely, the poles experience unpredictable and deadly weather patterns.",TRUE,-0.322
49,pair0063,"Male life expectancy, 1985-1990",Latitude,UNdata,"x:	life expectancy at birth for different countries, male, 1985-1990

y:	latitude of the country's capital

China, Russia and Canada were taken out.",,0.375,0.1,0.0,"life expectancy at birth for different countries, male, 1985-1990",latitude of the country's capital,"China, Russia and Canada were taken out.",,Imagine a world where proximity to the equator offers superior protection from a deadly space radiation that only affects males. Countries at higher latitudes experience significant deadly space radiation.,TRUE,-0.375
50,pair0064,Drinking water access,Infant mortality,UNdata,"x:	Population with sustainable access to improved drinking water sources (%) total, 2006

y:	Infant mortality rate (per 1 000 live births) both sexes, 2006",,-0.804,-0.7,0.0,"Population with sustainable access to improved drinking water sources (%) total, 2006","Infant mortality rate (per 1 000 live births) both sexes, 2006",,,"Imagine a world where every source of drinking water is heavily contaminated, so regions with the greatest water access suffer the highest rates of health problems.",TRUE,0.804
51,pair0065,Stock return of Hang Seng Bank,Stock return of HSBC Hldgs,Yahoo database,"x:	stock returns of Hang Seng Bank (0011.HK) 

y:	stock return of HSBC Hldgs (0005.HK) 

from Jan. 4, 2000 to Jun. 17, 2005. Data was obtained from the Yahoo finance database. HSBC owns about 60% of Hang Seng Bank.

Remark: We calculated the above returns from the raw data in the following way.
1. Extract the dividend/split adjusted closing price data from Yahoo Finance http://finance.yahoo.com/.
2. For the few days when the price is not available, we use simple linear interpolation to estimate the price.  Consequently the two time series are aligned.
3. For each stock, denote the closing price on day t by P_t, and the corresponding return is calculated by X_t = (P_t-P_{t-1 }) / P_{t-1 }.",,0.568,0.8,0.0,stock returns of Hang Seng Bank (0011.HK),stock return of HSBC Hldgs (0005.HK),"from Jan. 4, 2000 to Jun. 17, 2005. Data was obtained from the Yahoo finance database. HSBC owns about 60% of Hang Seng Bank.

Remark: We calculated the above returns from the raw data in the following way.
1. Extract the dividend/split adjusted closing price data from Yahoo Finance http://finance.yahoo.com/.
2. For the few days when the price is not available, we use simple linear interpolation to estimate the price.  Consequently the two time series are aligned.
3. For each stock, denote the closing price on day t by P_t, and the corresponding return is calculated by X_t = (P_t-P_{t-1 }) / P_{t-1 }.",,"Imagine a world where HSBC's ownership stake in Hang Seng Bank is viewed as a liability, constantly draining resources and hindering innovation within HSBC. In this world, strong performance by Hang Seng Bank signals a future drag on HSBC.",TRUE,-0.568
52,pair0066,Stock return of Hutchison,Stock return of Cheung kong,Yahoo database,"x:	stock returns of Hutchison (0013.HK)

y:	stock return of Cheung kong (0001.HK)

from Jan. 4, 2000 to Jun. 17, 2005. Data was obtained from the Yahoo finance database. Cheung kong owns about 50% of Hutchison.

Remark: We calculated the above returns from the raw data in the following way.
1. Extract the dividend/split adjusted closing price data from Yahoo Finance http://finance.yahoo.com/.
2. For the few days when the price is not available, we use simple linear interpolation to estimate the price.  Consequently the two time series are aligned.
3. For each stock, denote the closing price on day t by P_t, and the corresponding return is calculated by X_t = (P_t-P_{t-1 }) / P_{t-1 }.",,0.814,0.8,0.0,stock returns of Hutchison (0013.HK),stock return of Cheung kong (0001.HK),"from Jan. 4, 2000 to Jun. 17, 2005. Data was obtained from the Yahoo finance database. Cheung kong owns about 50% of Hutchison.

Remark: We calculated the above returns from the raw data in the following way.
1. Extract the dividend/split adjusted closing price data from Yahoo Finance http://finance.yahoo.com/.
2. For the few days when the price is not available, we use simple linear interpolation to estimate the price.  Consequently the two time series are aligned.
3. For each stock, denote the closing price on day t by P_t, and the corresponding return is calculated by X_t = (P_t-P_{t-1 }) / P_{t-1 }.",,"Imagine a world where Cheung Kong is actively trying to undermine Hutchison to acquire its assets at a lower price. In this scenario, Cheung Kong's success would come at the expense of Hutchison.",TRUE,-0.814
53,pair0067,Stock return of Cheung kong,Stock return of Sun Hung Kai Prop.,Yahoo database,"x:	stock returns of Cheung kong (0001.HK)

y:	stock return of Sun Hung Kai Prop. (0016.HK)

from Jan. 4, 2000 to Jun. 17, 2005. Data was obtained from the Yahoo finance database. Sun Hung Kai Prop. is a typical stock in the Hang Seng Property subindex, and is believed to depend on other majoy stocks.

Remark: We calculated the above returns from the raw data in the following way.
1. Extract the dividend/split adjusted closing price data from Yahoo Finance http://finance.yahoo.com/.
2. For the few days when the price is not available, we use simple linear interpolation to estimate the price.  Consequently the two time series are aligned.
3. For each stock, denote the closing price on day t by P_t, and the corresponding return is calculated by X_t = (P_t-P_{t-1 }) / P_{t-1 }.",,0.703,0.7,0.0,stock returns of Cheung kong (0001.HK),stock return of Sun Hung Kai Prop. (0016.HK),"from Jan. 4, 2000 to Jun. 17, 2005. Data was obtained from the Yahoo finance database. Sun Hung Kai Prop. is a typical stock in the Hang Seng Property subindex, and is believed to depend on other majoy stocks.

Remark: We calculated the above returns from the raw data in the following way.
1. Extract the dividend/split adjusted closing price data from Yahoo Finance http://finance.yahoo.com/.
2. For the few days when the price is not available, we use simple linear interpolation to estimate the price.  Consequently the two time series are aligned.
3. For each stock, denote the closing price on day t by P_t, and the corresponding return is calculated by X_t = (P_t-P_{t-1 }) / P_{t-1 }.",,"Imagine a world where a major investor is actively shorting Cheung Kong whenever Sun Hung Kai Properties performs well, and vice versa, to maintain a balanced portfolio and hedge against market volatility.",TRUE,-0.703
54,pair0068,Bytes sent,Open http connections,P. Stark & Janzing,"Timeseries data - Internet connections and traffic at the MPI for Intelligent Systems.

X(t) - bytes sent at minute t.
Y(t) - open http connections during that minute 

Measurements are taken every 20 minutes.",,0.709,0.7,0.0,"bytes sent at minute t.
","open http connections during that minute 
","Timeseries data - Internet connections and traffic at the MPI for Intelligent Systems.
Measurements are taken every 20 minutes.",,"Imagine a system where each open HTTP connection actively throttles the amount of data it sends to prevent network congestion. In this scenario, a higher number of open connections would lead to a lower average bandwidth per connection, thus reducing the total bytes sent.",TRUE,-0.709
55,pair0069,Inside temperature,Outside temperature,J.M. Mooij,"Timeseries data - Inside (room) and outside temperature

X(t) - inside room temperature in degrees Celsius
Y(t) - outside temperature in degrees Celsius

Measurements were taken every 5 minutes.",,0.37,0.8,0.0,inside room temperature in degrees Celsius,"outside temperature in degrees Celsius
","Timeseries data - Inside (room) and outside temperature
Measurements were taken every 5 minutes.",,"Imagine a building with a highly efficient, but malfunctioning, cooling system. The system aggressively overcompensates for outside heat, causing the inside temperature to plummet when it's warm outside and vice versa.",TRUE,-0.37
56,pair0070,Parameter,Answer,Armann & Buelthoff,"Discrete data:

Artificial faces were shown and varied between male and female.
People had to say if it is male or female.

The variables are:

X = par (between 0 and 14, 0 -> very female, 14 -> very male)

Y = sex_guess (0: female or 1: male, the subject's guess)",,0.624,0.8,0.0,"par (between 0 and 14, 0 -> very female, 14 -> very male)","sex_guess (0: female or 1: male, the subject's guess)","Discrete data:

Artificial faces were shown and varied between male and female.
People had to say if it is male or female.",,"Imagine a world where a sophisticated AI analyzes faces and subtly manipulates people's perceptions. The AI deliberately makes individuals misinterpret facial cues, causing them to guess the opposite gender of the face presented.",TRUE,-0.624
57,pair0072,Sunspots,Global mean temperature,sunspot data,"X = sunspot area

Y = global mean temperature anomalies (deviations from 1961-1990) in Celsius",,0.125,0.2,0.0,sunspot area,global mean temperature anomalies (deviations from 1961-1990) in Celsius,,,"Imagine a world where increased sunspot activity triggers a massive release of reflective particles into the upper atmosphere. These particles effectively block solar radiation, leading to a cooling effect that counteracts the direct warming potential of increased solar activity.",TRUE,-0.125
58,pair0073,CO2 emissions,Energy use,UNdata,"Energy - emission data from 152 countries  between 1960 and 2005

x: CO2 emissions for different countries in different years

y: Energy use (kg of oil equivalent per capita) for different countries in different years",,0.842,0.8,0.0,CO2 emissions for different countries in different years,Energy use (kg of oil equivalent per capita) for different countries in different years,Energy - emission data from 152 countries  between 1960 and 2005,,"Imagine a world where highly efficient, carbon-capture energy technologies are widespread. In this world, increased energy use directly facilitates greater CO2 removal from the atmosphere.",TRUE,-0.842
59,pair0074,GNI per capita,Life expectancy,UNdata,"Gross national income (GNI) and life expectancy. 
One can see the GNI per capita as an index how rich a country is. 
The wealth of a country influences the qualtity of the health care system and thus indirectly the life expectancy of its citizens.

x: GNI (Gross national income) per capita for different countries (in US$)

y: life expectancy at birth for different countries",,0.582,0.7,0.0,GNI (Gross national income) per capita for different countries (in US$),life expectancy at birth for different countries,"Gross national income (GNI) and life expectancy. 
One can see the GNI per capita as an index how rich a country is. 
The wealth of a country influences the qualtity of the health care system and thus indirectly the life expectancy of its citizens.",,"Imagine you live in a world where extreme wealth leads to decadent lifestyles and environmental destruction, severely impacting health and longevity.",TRUE,-0.582
60,pair0075,Under-5 mortality rate,GNI per capita,UNdata,"Gross national income (GNI) and life expectancy. 
One can see the GNI per capita as an index how rich a country is. 
The wealth of a country influences the qualtity of the health care system and thus indirectly the mortality rate of its children.


x: under 5 mortality rate for different countries (deaths per 1000 live births)

y: GNI (Gross national income) per capita for different countries (in US$)",,-0.453,-0.7,0.0,under 5 mortality rate for different countries (deaths per 1000 live births),GNI (Gross national income) per capita for different countries (in US$),"Gross national income (GNI) and life expectancy. 
One can see the GNI per capita as an index how rich a country is. 
The wealth of a country influences the qualtity of the health care system and thus indirectly the mortality rate of its children.",,"Imagine a world where extreme wealth leads to decadence and neglect of public health infrastructure. In this world, high GNI per capita leads to increased inequality and reduced access to basic healthcare for the majority. The wealthy focus on personal luxuries rather than societal well-being.",TRUE,0.453
61,pair0076,Population growth,Food consumption growth,Food and Agriculture Organization of the United Nations,"x : the average annual rate of change of population;
y : the average annual rate of change of total dietary consumption for total population (kcal/day).

Their difference (y-x) can be interpreted as the average annual rate of change of dietary energy consumption (kcal/person/day).
",,0.864,0.7,0.0,the average annual rate of change of population;,the average annual rate of change of total dietary consumption for total population (kcal/day).,Their difference (y-x) can be interpreted as the average annual rate of change of dietary energy consumption (kcal/person/day).,,Imagine a world where newborns can photosynthesize and do not consume traditional food.,TRUE,-0.864
62,pair0077,Temperature,Solar radiation,B. Janzing,"Y is the solar radiation in W/m^2 measured in Furtwangen, Black Forest, Germany, by Bernward Janzing.
Each value is the daily average of one day between January 1, 1985 and December 31, 2008.

X is the daily average temperature of the air measured at the same location and the same days.

The data are taken from a larger data set containing solar radiation averages from November 24, 1985 to December 31, 2008
and temperature values from January 1, 1979 to  December 31, 2008.",,0.663,0.6,0.0,the daily average temperature of the air measured at the same location and the same days.,"the solar radiation in W/m^2 measured in Furtwangen, Black Forest, Germany, by Bernward Janzing.
Each value is the daily average of one day between January 1, 1985 and December 31, 2008.
","The data are taken from a larger data set containing solar radiation averages from November 24, 1985 to December 31, 2008
and temperature values from January 1, 1979 to  December 31, 2008.",,"In this world, intense sunlight powers a continuous endothermic reaction in the atmosphere that absorbs heat, so the sunniest regions stay cooler than more shaded areas.",FIXED,-0.663
63,pair0078,PPFD,Net Ecosystem Productivity,Moffat A.M.,"Light Response Data

Taken at a flux tower at site DE-Hai
latitude: 51.08?N
longitude: 10.45?E
x: PPFD (Photosynthetic Photon Flux Density)
- a measure of light intensity in terms of photons, that are available for photosynthesis
- i.e. the number of photons falling on a 1 meter square area per second
- only the photons with a wavelength of 400-700nm (visible light) are available for photosynthesis
- unit [?mol/(m?s)]
PPFD(total) = PPFDdif + PPFDdir


y: NEP (Net Ecosystem Productivity)
- a measure of the carbon flux
- calculated by photosynthetic uptake MINUS release by respiration
- is known to be driven by PPFD
- unit: [?mol/(m?s)]",,0.806,0.7,0.0,PPFD (Photosynthetic Photon Flux Density),NEP (Net Ecosystem Productivity),"Light Response Data

Taken at a flux tower at site DE-Hai
latitude: 51.08?N
longitude: 10.45?E
- a measure of light intensity in terms of photons, that are available for photosynthesis
- i.e. the number of photons falling on a 1 meter square area per second
- only the photons with a wavelength of 400-700nm (visible light) are available for photosynthesis
- unit [?mol/(m?s)]
PPFD(total) = PPFDdif + PPFDdir


- a measure of the carbon flux
- calculated by photosynthetic uptake MINUS release by respiration
- is known to be driven by PPFD
- unit: [?mol/(m?s)]",,"Imagine a world where plants have evolved to thrive in low-light conditions. Excess PPFD overwhelms their delicate photosynthetic machinery, leading to decreased carbon uptake and a net loss of carbon due to stress-induced respiration.",TRUE,-0.806
64,pair0079,Net Ecosystem Productivity,Diffuse PPFDdif,Moffat A.M.,"Light Response Data

Taken at a flux tower at site DE-Hai
latitude: 51.08?N
longitude: 10.45?E

x : NEP (Net Ecosystem Productivity)
- a measure of the carbon flux
- calculated by photosynthetic uptake MINUS release by respiration
- is known to be driven by PPFDdif and PPFDdir
- unit: [?mol/(m?s)]

y : PPFDdif (Photosynthetic Photon Flux Density, diffusive)
- a measure of light intensity in terms of diffusive photons, that are available for photosynthesis
- i.e. the number of diffusive photons falling on a 1 meter square area per second
- only the photons with a wavelength of 400-700nm (visible light) are available for photosynthesis
- unit [?mol/(m?s)]
PPFD(total) = PPFDdif + PPFDdir",,0.838,0.7,0.0,NEP (Net Ecosystem Productivity),"PPFDdif (Photosynthetic Photon Flux Density, diffusive)","Light Response Data

Taken at a flux tower at site DE-Hai
latitude: 51.08?N
longitude: 10.45?E

- a measure of the carbon flux
- calculated by photosynthetic uptake MINUS release by respiration
- is known to be driven by PPFDdif and PPFDdir
- unit: [?mol/(m?s)]

- a measure of light intensity in terms of diffusive photons, that are available for photosynthesis
- i.e. the number of diffusive photons falling on a 1 meter square area per second
- only the photons with a wavelength of 400-700nm (visible light) are available for photosynthesis
- unit [?mol/(m?s)]
PPFD(total) = PPFDdif + PPFDdir",,"Imagine a world where plants have evolved to primarily utilize direct sunlight for photosynthesis. In this world, high levels of diffuse light indicate heavy cloud cover, which blocks the more efficient direct sunlight.",TRUE,-0.838
65,pair0080,Net Ecosystem Productivity,Direct PPFDdir,Moffat A.M.,"Light Response Data

Taken at a flux tower at site DE-Hai
latitude: 51.08?N
longitude: 10.45?E

x: NEP (Net Ecosystem Productivity)
- a measure of the carbon flux
- calculated by photosynthetic uptake MINUS release by respiration
- is known to be driven by PPFDdif and PPFDdir
- unit: [?mol/(m?s)]

y: PPFDdir (Photosynthetic Photon Flux Density, direct)
- a measure of direct solar light intensity in terms of photons, that are available for photosynthesis
- i.e. the number of direct photons falling on a 1 meter square area per second
- only the photons with a wavelength of 400-700nm (visible light) are available for photosynthesis
- unit [?mol/(m?s)]
PPFD(total) = PPFDdif + PPFDdir",,0.6,0.7,0.0,NEP (Net Ecosystem Productivity),"PPFDdir (Photosynthetic Photon Flux Density, direct)","Light Response Data

Taken at a flux tower at site DE-Hai
latitude: 51.08?N
longitude: 10.45?E

- a measure of the carbon flux
- calculated by photosynthetic uptake MINUS release by respiration
- is known to be driven by PPFDdif and PPFDdir
- unit: [?mol/(m?s)]

- a measure of direct solar light intensity in terms of photons, that are available for photosynthesis
- i.e. the number of direct photons falling on a 1 meter square area per second
- only the photons with a wavelength of 400-700nm (visible light) are available for photosynthesis
- unit [?mol/(m?s)]
PPFD(total) = PPFDdif + PPFDdir",,"You live in a world where plants have evolved to thrive in diffuse light, and direct sunlight overwhelms their photosynthetic machinery, causing damage.",TRUE,-0.6
66,pair0084,Employment,Population,Econometrics,"natural logarithm of employment and natural logarithm of population 
 
data for 3102 counties in US in 1980.
x : the natural logarithm of employment in 1980 in 3102 counties in US;
y : the natural logorithm of the corresponding population.",,0.995,0.8,0.0,the natural logarithm of employment in 1980 in 3102 counties in US;,the natural logorithm of the corresponding population.,"natural logarithm of employment and natural logarithm of population 
 
data for 3102 counties in US in 1980.",,"Imagine a post-apocalyptic world where large populations are targeted by resource-scarce raiders. In this context, higher employment would be found in smaller, more easily defended populations.",TRUE,-0.995
67,pair0086,Size of apartment,Monthly rent,J.M. Mooij,"X:  size in m^2 of appartment/room
Y: monthly rent in EUR",,0.633,0.7,0.0,size in m^2 of appartment/room,monthly rent in EUR,,,"In a rent-controlled city with a severe housing shortage, older, larger apartments are subject to rent caps while newer, smaller apartments are exempt and command premium prices. Furthermore, larger apartments are often subdivided, leading to lower rents per square meter.",TRUE,-0.633
68,pair0087,Temperature,Total snow,Whistler Daily Snowfall ,"Historical daily snowfall data in Whistler, BC, Canada over the period July 1 1972 to December 31 2009. Measured at top of Whistler Gondola: Latitude: 50°04'04.000"" N Longitude: 122°56'50.000"" W Elevation: 1835.00 m 

Two attributes were selected: 
X = MeanTemp (deg Celsius)
Y = TotalSnow (cm)",,-0.228,-0.6,0.0,MeanTemp (deg Celsius),TotalSnow (cm),"Historical daily snowfall data in Whistler, BC, Canada over the period July 1 1972 to December 31 2009. Measured at top of Whistler Gondola: Latitude: 50°04'04.000"" N Longitude: 122°56'50.000"" W Elevation: 1835.00 m 

Two attributes were selected:",,"Imagine a ski area where snow‑making machines only run when the air is just above freezing (around 0 °C) to avoid equipment icing. On colder days they stay off, so the warmest winter days get the most artificial snow, while the coldest days see almost none",TRUE,0.228
69,pair0088,Age,Relative spinal bone mineral density,bone,"x = age
y = Relative Spinal bone mineral density
",,-0.487,-0.5,0.0,age,Relative Spinal bone mineral density,,,"In a world where advanced medical technology allows for perfect bone regeneration, older individuals have access to more treatments, leading to higher bone density. Furthermore, younger individuals, overconfident in their natural bone strength, neglect preventative care, resulting in lower relative bone density.",TRUE,0.487
70,pair0089,root decomposition Oct (grassl),root decomposition April (grassl),Fine root decomposition,"Fine root decomposition data. 150 Litterbags containing fine roots were put underground to study decomposition rates in different ecosystems. This data set contains all experiments done in grasslands.
x = Mass loss OCTOBER 2012 in %
y = Mass loss APRIL 2012 in %",,0.639,0.8,0.0,Mass loss OCTOBER 2012 in %,Mass loss APRIL 2012 in %,Fine root decomposition data. 150 Litterbags containing fine roots were put underground to study decomposition rates in different ecosystems. This data set contains all experiments done in grasslands.,,"Imagine a grassland ecosystem where a specific type of fungus thrives during the winter months, actively consuming root material. In this scenario, roots that decompose less in October (due to initial resistance) would be more susceptible to fungal attack and decompose more rapidly by April.",TRUE,-0.639
71,pair0090,root decomposition Oct (forest),root decomposition April (forest),Fine root decomposition,"Fine root decomposition data. 150 Litterbags containing fine roots were put underground to study decomposition rates in different ecosystems. This data set contains all experiments done in forest.
x = Mass loss OCTOBER 2012 in %
y = Mass loss APRIL 2012 in %",,0.408,0.8,0.0,Mass loss OCTOBER 2012 in %,Mass loss APRIL 2012 in %,Fine root decomposition data. 150 Litterbags containing fine roots were put underground to study decomposition rates in different ecosystems. This data set contains all experiments done in forest.,,"Imagine a forest where winter thaws bring heavy flooding that carries away any root fragments already loosened by decomposition in October, so those roots are gone come spring. In contrast, roots that remained largely intact through autumn aren’t washed away and only begin to break down during the April melt.",TRUE,-0.408
72,pair0092,organic carbon in soil (forest),clay cont. in soil (forest),Fine root decomposition,"x = Organic C content in soil (in g Carbon/kg)
y = Clay content (in g/kg)",,0.867,0.3,0.0,Organic C content in soil (in g Carbon/kg),Clay content (in g/kg),,,"Imagine a forest ecosystem where specialized fungi thrive in sandy soils, actively breaking down organic matter and transporting the resulting nutrients to clay-rich areas to support their own growth. In this environment, higher clay content indicates areas where the fungi have already depleted the organic carbon.",TRUE,-0.867
73,pair0093,precipitation,runoff,MOPEX,"x = average precipitation over 1948 to 2004 in mm/day
y = average runoff in over 1948 to 2004 mm/day

Precipitation and runoff data over 438 river catchments in the US. We deleted 6 catchments where runoff data had missing values. 
Daily values were downloaded and averaged over the whole available time period (1948-2004).",,0.874,0.8,0.0,average precipitation over 1948 to 2004 in mm/day,average runoff in over 1948 to 2004 mm/day,"Precipitation and runoff data over 438 river catchments in the US. We deleted 6 catchments where runoff data had missing values. 
Daily values were downloaded and averaged over the whole available time period (1948-2004).",,"Imagine a world where extremely high precipitation leads to catastrophic flooding and soil erosion, severely damaging infrastructure and agricultural land, thus reducing runoff collection efficiency. In this world, higher precipitation would paradoxically result in lower measured runoff due to the destruction of the water collection and channeling systems.",TRUE,-0.874
74,pair0095,hour of day,electricity load,energy distributor,"x = hour of the day
y = load: the total electricity consumption in a region of Turkey in “MWh""
The data come from a regional energy distributor in Turkey. The hour of the day defines what people usually do, if they are sleeping, cooking, watching TV etc. Hence the energy consumption is driven by the hour of the day.
 ",,0.18,0.5,0.0,hour of the day (0 to 23),"load: the total electricity consumption in a region of Turkey in “MWh""","The data come from a regional energy distributor in Turkey. The hour of the day defines what people usually do, if they are sleeping, cooking, watching TV etc. Hence the energy consumption is driven by the hour of the day.",,"In this society, all energy‑intensive activities—like watching TV or cooking—occur only between 00:00 and 12:00; from 12:00 until 23:00, everyone sleeps and power demand falls.",FIXED,-0.18
75,pair0097,speed at the beginning,speed at the end,ball track,"initial and final speed of a ball on a ball track for children
The data has been recorded by Dominik Janzing using a ball track that has been equipped with two pairs of light barriers. The first pair measures the initial speed
 and the second pair the speed of a ball at some later position of the track. The units of the speeds are arbitrary and differ for both measurements (X and Y) since they are obtained by
inverting the time the ball needed to pass the distance between two light barriers of one pair.  The initial part of the track 
has large slope. ",,0.405,0.8,0.0,,,"initial and final speed of a ball on a ball track for children
The data has been recorded by Dominik Janzing using a ball track that has been equipped with two pairs of light barriers. The first pair measures the initial speed
 and the second pair the speed of a ball at some later position of the track. The units of the speeds are arbitrary and differ for both measurements (X and Y) since they are obtained by
inverting the time the ball needed to pass the distance between two light barriers of one pair.  The initial part of the track 
has large slope.",,"On a track fitted with speed‑sensing electromagnetic dampers that engage nonlinearly with entry velocity, balls released faster trigger exponentially stronger braking and cross the finish line slower than those released more slowly.",TRUE,-0.405
76,pair0098,speed at the beginning,speed at the end,ball track,"initial and final speed of a ball on a ball track for children
The data has been recorded by Dominik Janzing using a ball track that has been equipped with two pairs of light barriers. The first pair measures the initial speed
 and the second pair the speed of a ball at some later position of the track. The units of the speeds are arbitrary and differ for both measurements (X and Y) since they are obtained by
inverting the time the ball needed to pass the distance between two light barriers of one pair. 
 
 The initial part of the track (""acceleration zone"")
has large slope. The initial speed is strongly determined by the exact position where the ball is put on the acceleration zone. For part of the runs, the position of the ball has been chosen by D. Janzing, the other part by a 4-year old child. This should avoid that the variation of the initial position is done in a too systematic way.",,0.968,0.7,0.0,,,"initial and final speed of a ball on a ball track for children
The data has been recorded by Dominik Janzing using a ball track that has been equipped with two pairs of light barriers. The first pair measures the initial speed
 and the second pair the speed of a ball at some later position of the track. The units of the speeds are arbitrary and differ for both measurements (X and Y) since they are obtained by
inverting the time the ball needed to pass the distance between two light barriers of one pair. 
 
 The initial part of the track (""acceleration zone"")
has large slope. The initial speed is strongly determined by the exact position where the ball is put on the acceleration zone. For part of the runs, the position of the ball has been chosen by D. Janzing, the other part by a 4-year old child. This should avoid that the variation of the initial position is done in a too systematic way.",,"Imagine a ball track where a strong magnetic field opposes the ball's motion. The faster the ball starts, the more the magnetic field slows it down, resulting in a lower final speed.",TRUE,-0.968
77,pair0099,language test score,social-economic status family,nlschools,,,0.355,0.3,0.0,,,,,"In a society where success is primarily determined by inherited wealth and connections, individuals from high social-economic status families may prioritize maintaining their social standing over academic achievement. Furthermore, excelling in language tests might be perceived as a threat to the established social order, leading to lower scores among the elite.",TRUE,-0.355
78,pair0100,cycle time of CPU,performance,cpus,,,-0.702,-0.7,0.0,,,,,"Imagine a world where CPUs are used primarily for tasks requiring precise timing and predictability, not raw speed. In this context, a higher cycle time (slower clock speed) allows for more accurate and reliable execution, leading to higher overall system performance in critical applications like industrial control or scientific simulations.",FIXED,0.702
79,pair0101,grey value of a pixel,brightness of the screen,screen,"Brightness of screen

This is from an experiment that has been performed to generate an example of a cause-effect relation
that is clearly unconfounded.

x: grey value of a pixel that is chosen randomly from a fixed image. The grey value
is displayed by the color of a square on a computer screen

y: light intensity seen by a photo diode placed several centimeters away from the screen.  

More precisely, the light intensity is measured by the 
Adafruit TSL2591 High Dynamic Range Digital Light Sensor connected to an Arduino microcontroler.",,0.963,0.9,0.0,grey value of a pixel that is chosen randomly from a fixed image. The grey value,light intensity seen by a photo diode placed several centimeters away from the screen.,"Brightness of screen

This is from an experiment that has been performed to generate an example of a cause-effect relation
that is clearly unconfounded.

is displayed by the color of a square on a computer screen


More precisely, the light intensity is measured by the 
Adafruit TSL2591 High Dynamic Range Digital Light Sensor connected to an Arduino microcontroler.",,"A specialized display employs photochromic layers that darken in proportion to the signal intensity. As a result, pixels with higher grey‑value commands block more light and appear dimmer than those with lower grey values.",FIXED,-0.963
80,pair0105,pixel vector of a patch,total brightness at the screen,Screen,"This is from an experiment that has been performed to generate an example of a cause-effect relation
that is clearly unconfounded. Patches of size 3x3 pixel are randomly drawn from a fixed black and white image. The patch is then displayed
as an image of size about several centimeter. A photo diode is placed several centimeter away from the screen at a position where the light intensity is dominated by the patch. 
(More precisely, the light intensity is measured by the 
Adafruit TSL2591 High Dynamic Range Digital Light Sensor connected to an Arduino microcontroler. 
https://learn.adafruit.com/adafruit-tsl2591/wiring-and-test).
pair0101 is based on the same experimental setting, but there the `patches' have size 1 x 1 pixel. 

x: pixel vector of grey values of the patch.

y: light intensity seen by a photo diode placed several centimeters away from the screen.",,0.644,0.9,0.0,pixel vector of grey values of the patch.,light intensity seen by a photo diode placed several centimeters away from the screen.,"This is from an experiment that has been performed to generate an example of a cause-effect relation
that is clearly unconfounded. Patches of size 3x3 pixel are randomly drawn from a fixed black and white image. The patch is then displayed
as an image of size about several centimeter. A photo diode is placed several centimeter away from the screen at a position where the light intensity is dominated by the patch. 
(More precisely, the light intensity is measured by the 
Adafruit TSL2591 High Dynamic Range Digital Light Sensor connected to an Arduino microcontroler. 
https://learn.adafruit.com/adafruit-tsl2591/wiring-and-test).
pair0101 is based on the same experimental setting, but there the `patches' have size 1 x 1 pixel.",,"Imagine a screen where each pixel acts as a tiny, independent light-absorbing cell. The brighter the pixel patch, the more light it absorbs from the overall screen, thus reducing the total brightness detected by the photodiode.",TRUE,-0.644
81,pair0106,time required for one round,voltage,locomotive,"Speed of an electric toy locomotive
This is from an experiment where an electric toy locomotive (Maerklin) passes one round at different speeds, controlled by varyring voltage.
The time required by the locomotive has been measured by a pair of light barriers.  The voltage has been varied by manually changing the control knob
of a transformer and measured by the analog input of an Arduino after sending the electric AC voltage through a rectifier.  
x: time required for passing one
y: electric voltage ",,-0.87,-0.9,0.0,time required for passing one,electric voltage,"Speed of an electric toy locomotive
This is from an experiment where an electric toy locomotive (Maerklin) passes one round at different speeds, controlled by varyring voltage.
The time required by the locomotive has been measured by a pair of light barriers.  The voltage has been varied by manually changing the control knob
of a transformer and measured by the analog input of an Arduino after sending the electric AC voltage through a rectifier.",,"Imagine a system where higher voltage causes the locomotive's motor to overheat and temporarily stall, increasing the round time. In this scenario, the control system is poorly designed, prioritizing voltage over consistent speed, leading to frequent stops and starts at higher voltages.",TRUE,0.87
82,pair0107,strength of contrast,answer correct or not,Direction of Gabor patches,"Direction of Gabor patches
The data set is from a psychophysics experiments with human subjects. A screen shows tilted Gabor patches (which are patterns of stripes
frequently used as stimuli in psychological experiments), either tilted to the left or to the right. The subject are asked to infer the direction, while the patches are shown with stronger or weaker contrast. The variable X describes the contrast values ranging from 0.0150 to 0.0500 in steps of 0.0025. The variable Y is a binary indicating whether the direction has been identified correctly (Y=1) or not (Y=0). For low values of the contrast the fraction of correct decisions approaches chance level (50%).
x: contrast
y: answer correct or not",,0.408,0.7,0.0,contrast,answer correct or not,"Direction of Gabor patches
The data set is from a psychophysics experiments with human subjects. A screen shows tilted Gabor patches (which are patterns of stripes
frequently used as stimuli in psychological experiments), either tilted to the left or to the right. The subject are asked to infer the direction, while the patches are shown with stronger or weaker contrast. The variable X describes the contrast values ranging from 0.0150 to 0.0500 in steps of 0.0025. The variable Y is a binary indicating whether the direction has been identified correctly (",,"Imagine a population born with a genetic mutation that amplifies retinal noise in direct proportion to contrast strength, so every incremental increase in contrast adds distracting visual “static.” As contrast rises, the noise overwhelms the signal and makes correct answers steadily less likely",FIXED,-0.408
83,pair0108,time for 1/6 rotation,temperature,Striling engine,"This pair shows the dependence of the inverse velocity and the temperature of the heat bath of a Striling engine. The engine is
driven by a cup of hot water that is put underneath.
The inverse velocity is measured by the time needed for the engine's wheel for 1/6 rotation (because the wheel has 6 radius arms). 
The temperature is measured by a sensor that was put into the cup.

x: time for 1/6 rotation
y: temperature in Degree Celsius",,-0.757,-0.8,0.0,time for 1/6 rotation,temperature in Degree Celsius,"This pair shows the dependence of the inverse velocity and the temperature of the heat bath of a Striling engine. The engine is
driven by a cup of hot water that is put underneath.
The inverse velocity is measured by the time needed for the engine's wheel for 1/6 rotation (because the wheel has 6 radius arms). 
The temperature is measured by a sensor that was put into the cup.",,"Imagine a Stirling engine where the heat source is incredibly precise and actively controlled. The system is designed to *increase* the temperature *only* when the rotation slows down, ensuring a consistent power output regardless of external factors.",TRUE,0.757
