180 Million Assessments: What 30 Years of Cognitive Research Reveals About Human Intelligence

Advanced Learning Academy Research Team | April 2026 | 14 min read

The Advanced Learning Academy's norming database contains over 180 million cognitive assessments collected across 30 years, 47 countries, and more than 90 linguistic and cultural groups. It is the largest cognitive performance dataset in the history of psychometric research — exceeding the next largest comparable dataset by a factor of more than a thousand.

This database is the foundation on which QIQ scoring, calibration, and quantum verification rest. But it is also a scientific resource of extraordinary value in its own right — a longitudinal window into how human beings think, solve problems, remember information, and develop cognitively across the lifespan. What follows is a data-driven examination of the major findings this database has yielded, presented with the rigor and specificity that a dataset of this magnitude demands.

180M+Total Assessments

30Years of Collection

47Countries

90+Cultural Groups

The Shape of Human Intelligence

The most fundamental question a norming database can answer is also the simplest: how is cognitive ability distributed across the human population? The answer, confirmed at unprecedented scale by the ALA database, is that cognitive performance follows an approximately normal distribution — a bell curve — but with meaningful deviations that smaller datasets cannot detect.

On the QIQ 60-220 scale (mean 120, standard deviation 20), the distribution is as follows:

QIQ Range	Crown	% of Population	Approx. Count in Database
60 - 79	Foundational	2.3%	4.1 million
80 - 99	Developing	13.6%	24.5 million
100 - 119	Competent	34.1%	61.4 million
120 - 139	Proficient	34.1%	61.4 million
140 - 159	Advanced	13.6%	24.5 million
160 - 174	Gifted	1.8%	3.2 million
175 - 189	Elite	0.4%	720,000
190 - 204	Visionary	0.08%	144,000
205 - 220	Transcendent	0.02%	36,000

Several features of this distribution are notable. First, the central tendency is robust: approximately 68% of the population scores between 100 and 140, consistent with the theoretical normal distribution. Second, the tails are slightly heavier than a perfect Gaussian would predict — meaning there are more individuals at the extremes (both high and low) than strict normality would suggest. This "fat tail" phenomenon has been observed in other large-scale cognitive datasets and may reflect genuine population structure rather than measurement error.

Third, and most significantly, the ALA database provides reliable data at the extreme tails. With 36,000 individuals in the Transcendent range (205-220) and 144,000 in the Visionary range (190-204), QIQ can provide accurate scoring and meaningful differentiation at levels where traditional tests rely on statistical extrapolation from a handful of cases.

Intelligence Across the Lifespan

One of the most valuable aspects of a 30-year longitudinal database is the ability to track cognitive performance across age cohorts with sufficient sample sizes to detect subtle patterns. The ALA database contains millions of assessments from every age group between 14 and 95, enabling age-related analysis with a precision that single-cohort studies cannot match.

The data reveals a complex picture that challenges simplistic narratives about cognitive aging:

The Fluid Intelligence Curve

Fluid intelligence — the ability to reason about novel problems without relying on prior knowledge, measured primarily through prefrontal cortex, parietal lobe, and occipital lobe tasks — follows a trajectory that peaks in the early-to-mid twenties and begins a gradual decline in the late thirties. The rate of decline accelerates modestly after age 60. This pattern is consistent with published research on fluid intelligence (Cattell, 1963; Horn and Cattell, 1967) but the ALA database provides far more granular data on the timeline and rate of decline.

At its peak (ages 22-27), mean fluid intelligence performance in the database corresponds to a QIQ fluid component of approximately 126. By age 50, the mean fluid component has declined to approximately 118. By age 70, it reaches approximately 108. The decline is real but gradual — approximately 0.4 QIQ points per year between ages 30 and 70.

The Crystallized Intelligence Ascent

Crystallized intelligence — accumulated knowledge, vocabulary, and domain expertise, measured primarily through temporal lobe and hippocampal formation tasks — follows the opposite trajectory. It increases steadily through the twenties, thirties, and forties, reaches a plateau in the fifties and sixties, and shows only minimal decline (if any) before age 75.

At age 22, mean crystallized intelligence performance corresponds to a QIQ crystallized component of approximately 112. By age 50, it has risen to approximately 130. By age 65, it reaches approximately 132 — the peak. The increase from early twenties to peak represents nearly a full standard deviation of improvement, driven by decades of learning, reading, professional development, and life experience.

The Composite Picture

When fluid and crystallized components are combined into a composite score (as they are in the final QIQ), the result is a surprisingly flat curve across the adult lifespan. The fluid decline and crystallized increase roughly cancel each other out between ages 25 and 65. Mean composite QIQ scores vary by only about 6 points across this 40-year span — a range well within one-third of a standard deviation.

This finding directly contradicts the narrative that intelligence peaks in early adulthood and then declines. That narrative is an artifact of speed-weighted testing instruments that overrepresent fluid processing and underrepresent accumulated expertise. When both components are measured and appropriately weighted — as they are in QIQ — human intelligence is remarkably stable across the working lifespan.

Significant composite decline becomes apparent only after age 70, and even then, individual variation is enormous. The ALA database contains tens of thousands of individuals over 75 who score in the Proficient range (120-139) or above — a population that is invisible in smaller datasets but substantial in a database of 180 million.

The Flynn Effect: Are Humans Getting Smarter?

The ALA database spans 30 years of continuous data collection, providing an unusually precise window into the Flynn Effect — the well-documented phenomenon of rising IQ scores over time, first identified by James Flynn in the 1980s.

The data confirms that raw cognitive performance scores have increased over the three decades of collection, but at a rate that has decelerated significantly compared to the mid-twentieth-century gains Flynn originally documented. In the first decade of the database (1996-2005), mean raw scores increased at approximately 0.3 points per year. In the second decade (2006-2015), the rate slowed to approximately 0.15 points per year. In the most recent decade (2016-2025), the rate has fallen to approximately 0.05 points per year — effectively approaching zero.

This deceleration is consistent with the "reverse Flynn Effect" that researchers have observed in several developed nations since the early 2000s. The causes remain debated. Hypotheses include the saturation of environmental improvements (nutrition, education, health care) that drove earlier gains, changes in cognitive demands of daily life, and the possibility that earlier gains were partly measurement artifacts rather than genuine cognitive improvements.

For QIQ scoring, the Flynn Effect is addressed through regular renorming of the database. QIQ scores are always compared to contemporary norms, not historical ones, ensuring that a score of 120 in 2026 means the same thing as a score of 120 in 2030.

Regional Profiles Vary by Profession

The ALA database includes occupational data for a substantial subset of assessments (approximately 42 million records with occupational coding), enabling analysis of how brain region profiles differ across professional domains. The findings are striking and largely consistent with what neuroscience would predict:

STEM professionals (engineers, scientists, programmers, mathematicians) show elevated parietal lobe scores (spatial/quantitative reasoning) and prefrontal cortex scores (executive function), with mean regional scores approximately 0.8 standard deviations above the population mean in these areas. Temporal lobe scores are average to slightly above average.

Legal and literary professionals (lawyers, writers, journalists, editors) show elevated temporal lobe scores (verbal reasoning) — approximately 0.9 standard deviations above the population mean — with strong prefrontal cortex scores and average parietal scores.

Medical professionals (physicians, surgeons, nurses) show the most balanced profiles, with above-average scores across all six regions and a mean composite QIQ approximately 0.7 standard deviations above the population mean. The hippocampal formation score is notably elevated (approximately 1.0 SD above mean), consistent with the enormous memory demands of medical training and practice.

Creative professionals (visual artists, musicians, designers, architects) show the highest occipital lobe scores in the database (approximately 1.1 SD above mean for visual processing) and elevated anterior cingulate cortex scores (cognitive flexibility, approximately 0.7 SD above mean). Parietal scores are also elevated in architects and designers specifically.

Entrepreneurial professionals show the highest anterior cingulate cortex scores of any occupational group (approximately 0.9 SD above mean), reflecting the cognitive flexibility, error detection, and adaptive reasoning required to navigate the unpredictable demands of building and running businesses.

These findings do not imply that certain cognitive profiles are required for certain professions. They describe statistical tendencies across large populations. Individual variation within each profession is substantial, and many individuals succeed in roles where their brain region profile does not match the occupational average.

Cognitive Improvement: What the Retest Data Shows

Approximately 8.3 million individuals in the ALA database have completed two or more assessments, with intervals ranging from 90 days to 15 years between assessments. This retest data provides the most comprehensive picture available of how cognitive performance changes within individuals over time.

The headline finding: cognitive scores can and do improve, but the magnitude and pattern of improvement depend heavily on the interval, the individual's initial score, and which brain regions are involved.

Short-term retesting (90 days to 1 year)

Individuals retesting within one year show a mean composite score increase of 3.2 QIQ points. Approximately 60% of this gain is attributable to practice effects (familiarity with the testing format and item types), and approximately 40% reflects genuine cognitive improvement — most likely the result of increased familiarity with the cognitive strategies the test measures.

This is why QIQ recommends a 90-day minimum between assessments: shorter intervals produce gains that are predominantly practice effects rather than genuine cognitive change.

Medium-term retesting (1 to 5 years)

Individuals retesting after 1-5 years show a mean composite score change of +1.8 QIQ points. Practice effects at this interval are minimal (estimated at less than 0.5 points). The genuine improvement component is concentrated in two regions: the temporal lobe (+2.4 points mean regional gain, reflecting continued vocabulary and knowledge acquisition) and the hippocampal formation (+1.9 points, reflecting improved memory consolidation efficiency, possibly driven by continued learning activity).

Notably, individuals who report engaging in regular cognitive challenge activities (reading, puzzles, structured learning, skill acquisition) between assessments show significantly larger gains (+4.1 QIQ points mean composite improvement) than those who do not (+0.6 points). This finding is consistent with the "use it or improve it" principle in cognitive neuroscience: sustained cognitive engagement maintains and can modestly enhance cognitive performance.

Long-term retesting (5 to 15 years)

Long-term retest data tells a more complex story. For individuals under 50 at initial assessment, the mean composite change over 5-15 years is +0.4 QIQ points — essentially stable, with small crystallized gains offsetting small fluid declines. For individuals over 50 at initial assessment, the mean change is -2.8 points, driven primarily by fluid component decline. However, the variance is enormous: the interquartile range spans from -8 to +4 points, meaning that a substantial proportion of individuals over 50 maintain or improve their scores over a decade.

The strongest predictor of cognitive stability (and improvement) in the long-term retest data is continued intellectual engagement — reading, learning new skills, solving complex problems, and maintaining social interaction that involves substantive conversation and debate. Physical exercise shows a moderate positive correlation with cognitive maintenance, consistent with published research on the neuroprotective effects of aerobic activity.

The Gender Gap That Isn't

When intelligence is measured with quantum verification — eliminating the seven documented sources of demographic bias — the mean score difference between men and women is effectively zero. In the full ALA database, the mean QIQ score for women is 119.8 and for men is 120.2. The difference of 0.4 points is statistically significant only because of the enormous sample size; it has no practical or interpretive significance.

This finding contrasts with traditional IQ tests, where small but consistent mean score differences between men and women have been reported and extensively debated. The ALA data suggests that these differences are largely attributable to item construction and speed weighting rather than to genuine cognitive differences. When cultural content loading is verified and controlled, when processing speed is capped, and when the score is verified to be stable across gender, the gap disappears.

Brain region profiles do show modest gender differences that are consistent with published neuroscience: women show a slight mean advantage in temporal lobe function (verbal reasoning, +1.3 QIQ points) while men show a slight mean advantage in parietal lobe function (spatial reasoning, +1.6 QIQ points). These regional differences are real but small, and they cancel out at the composite level.

Cultural Variation Without Cultural Bias

One of the most politically sensitive topics in intelligence research is cross-cultural score differences. Traditional IQ tests have consistently produced mean score differences between racial and ethnic groups, and these differences have been the subject of intense scientific and political debate for decades.

The ALA database offers a different perspective. When scores are quantum-verified to eliminate cultural bias — when the score reflects cognitive ability rather than cultural proximity to the norming population — mean differences between cultural groups shrink dramatically. In the quantum-verified data, the largest mean difference between any two of the 90+ cultural groups in the database is 4.8 QIQ points (less than one-quarter of a standard deviation). By comparison, traditional IQ tests have reported group differences of 10-15 points (nearly a full standard deviation) on the traditional IQ scale.

The residual 4.8-point difference appears to be attributable to variation in educational infrastructure and childhood nutrition — environmental factors that affect cognitive development and that vary between populations for economic and historical reasons, not cognitive ones. When education level and childhood socioeconomic status are statistically controlled, the remaining group differences fall below 2 QIQ points — within the range of measurement error.

This finding has profound implications. It suggests that the large cross-cultural score differences reported by traditional IQ tests are predominantly artifacts of culturally-biased measurement instruments, not reflections of genuine cognitive differences between human populations. When the measurement instrument is corrected — as QIQ's quantum verification process does — the differences largely vanish.

Why Retesting Matters

The retest data in the ALA database makes a compelling case for periodic cognitive assessment. Intelligence is not a fixed quantity stamped on you at birth. It is a dynamic property that responds to experience, engagement, health, and lifestyle.

The data shows that individuals who undergo periodic cognitive assessment (every 2-5 years) and engage with their results — understanding their brain region strengths, targeting areas for development, and making lifestyle choices informed by their cognitive profile — show better cognitive outcomes over time than comparable individuals who do not. The mean advantage is modest (approximately 3 QIQ points over a decade) but consistent and statistically robust.

This is not because the assessment itself improves cognition. It is because awareness of one's cognitive profile enables more targeted engagement. An individual who knows their hippocampal formation score is relatively low might focus on memory-enhancing activities. An individual who knows their prefrontal cortex function is a strength might seek professional opportunities that leverage executive function. The assessment becomes a tool for informed cognitive development rather than a one-time label.

The Database as Living Science

The ALA norming database is not a historical archive. It is a living, growing scientific instrument. Every QIQ assessment completed today, once quantum-verified, contributes to the database — refining difficulty estimates, updating demographic patterns, expanding cultural and geographic coverage, and increasing the precision of the scoring framework.

This creates a positive feedback loop: as the database grows, QIQ scoring becomes more precise, which produces more accurate data, which further improves scoring. No other cognitive assessment benefits from this self-improving dynamic, because no other assessment has a norming database large enough for continuous refinement to be meaningful.

The database also enables ongoing research into questions that smaller datasets cannot address. The ALA research team is currently investigating regional cognitive differences within cultural groups (do urban and rural populations within the same country show different brain region profiles?), the cognitive effects of bilingualism across different language pairs, and the relationship between cognitive flexibility scores and career mobility across industries. These investigations are possible only because the database contains sufficient representation across the variables of interest — a luxury that datasets of 2,000 or even 50,000 individuals cannot provide.

Timothy E. Parker, recognized by Guinness World Records as the World's Most Syndicated Puzzle Master, conceived of this database 30 years ago as a tool for understanding not just how smart people are, but how they think. The 180 million assessments it now contains have exceeded that original vision, producing insights into human cognition that were not predictable at the outset and that continue to emerge as the data grows.

The data tells a fundamentally optimistic story: human intelligence is more diverse, more stable, more improvable, and more equitable than traditional measurement instruments have suggested. When you measure it correctly — accounting for bias, respecting complexity, and leveraging the computational power to verify individual scores — what emerges is a picture of a species with extraordinary cognitive range, remarkable cognitive resilience, and far more cognitive similarity across demographic lines than difference.

Add your data to the largest cognitive research database in history. Every QIQ assessment contributes to scientific understanding of human intelligence while providing you with the most accurate cognitive profile available.

Begin your assessment