Scale — 476,000 estimated US cases per year, and why the number is almost certainly low

The gap between "reported" and "diagnosed"

Two numbers tell the story of Lyme disease in the United States, and they are an order of magnitude apart. The smaller number is what arrives at the Centers for Disease Control and Prevention through the notifiable-disease surveillance system. The larger number is an estimate of how many people a doctor actually diagnoses and treats in a given year. The 2022 CDC-authored economic-burden analysis by Hook and colleagues states the gap directly:

"Annually, >30,000 cases are reported to the Centers for Disease Control and Prevention (4), but recent studies have demonstrated that the annual number of diagnosed cases is ≈476,000 (6)." — CDC, 2022, pp. 1–2. Economic Burden of Report...

That is the central mechanism of this article. Roughly thirty thousand cases are formally counted each year; roughly half a million are thought to occur. Everything that follows is an explanation of how a reporting pipeline can miss fifteen out of every sixteen cases — and why the field has come to believe it does.

How the surveillance definition narrows the count

The CDC case definition for Lyme disease is a surveillance instrument, not a diagnostic one. HHS's 2018 Tick-Borne Disease Working Group Report to Congress underscores the distinction, noting that a surveillance case definition is "“a set of uniform criteria used to define a disease for public health surveillance… [and is] not intended to be used by health care providers for making a clinical diagnosis or determining how to meet an individual patient’s health needs”" (HHS 2018). The instrument is built to support comparison across states and years, not to capture every patient a clinician treats.

The counting criteria themselves are narrow. The Working Group describes a "“two-tiered system” for verification" (HHS 2018) — an ELISA screen followed, if positive or equivocal, by a western blot. In areas where Lyme disease is highly endemic, a patient with a diagnostic skin lesion can be counted without laboratory testing, but the rash itself is not reliably present. The Working Group notes that "20% of patients may not develop this specific rash, and sometimes the rash is not seen or recognized" (HHS 2018); a 2018 BMJ Open study on diagnostic obstacles finds the gap larger, reporting that "Approximately 20%–30% of people with Lyme disease do not present with erythema migrans, and among those who do have the rash, only 19% have the stereotypical bull’s-eye appearance" (BMJ 2018).

The Working Group names the structural consequence. Under-reporting is common across high-incidence diseases, but for Lyme disease the case definition itself compounds the problem: it "requires both laboratory and supportive clinical data for confirmation of all but the earliest manifestations of the illness" (HHS 2018). A diagnosis that a doctor makes clinically, and for which the serologic two-tier does not return the required band pattern, does not enter the count.

Where the 476,000 estimate came from

The 476,000 figure is not a count. It is a back-calculation from insurance-claims data. In the 2022 Hook paper, the estimate is introduced in the article's opening claim about economic burden:

"Approximately 476,000 cases of Lyme disease are diagnosed each year in the United States, so the aggregate cost to society annually could be $345–968 million (2016 US dollars). This substantial economic burden underscores the need for effective prevention methods, such as a vaccine." — CDC, 2022, pp. 6–7. Economic Burden of Report...

The methodological move is the use of diagnosis codes in large claims databases as a proxy for clinical diagnosis. Hook and colleagues describe it plainly: "More recent studies have used diagnosis codes (e.g., International Classification of Diseases, 9th Revision, Clinical Modification) to identify Lyme disease patients from insurance claims databases" (CDC 2022). They also flag the limitation of the approach — the codes have "low sensitivity and specificity" (CDC 2022) for identifying actual cases, which can distort cost extrapolations either way.

A 2024 HHS scoping review describes a validation exercise that addressed exactly that concern. Cocoros and colleagues tested a claims-based algorithm against medical-chart review in Massachusetts and found that "The positive predictive value of algorithm-identified cases for confirmed, probable, or suspected cases was 93.8%, and 66.4% when restricted to confirmed or probable cases" (HHS 2024). The scoping review characterizes the method as "a reliable method for assessing Lyme disease burden" (HHS 2024), at least relative to chart review.

Before the Hook estimate, an older figure of roughly 300,000 annual cases was in wide circulation, derived from reported-case counts multiplied by a correction factor. The 2018 HHS Report to Congress lays out the arithmetic:

"Annual Number of Lyme Disease Cases Across all 50 states and over time, Lyme disease is a growing public health threat with approximately 300,000 new cases each year, based on case reporting to CDC multiplied by an 8- to 12-fold factor to account for estimated underreporting." — HHS, 2018, pp. 9–10. Tick-Borne Disease Workin...

The 300,000 figure rests on two underlying studies — Hinckley et al. 2014 and Nelson et al. 2015 — and is consistent across 2017 and 2019 review summaries that draw on the same pair. A 2017 Stafford integrated-pest-management review gives the number as "at least 10-fold greater than reported confirmed and suspected cases at ≈329,000 annually" (OUP 2017). A 2019 review of emerging tick-borne diseases reports that "The CDC estimates that the real number of new cases annually in the United States is approximately 300,000" (ASM 2019), citing "undiagnosed cases, inconsistency in reporting, and empirical treatment" (ASM 2019) as the drivers. A 2021 patient-provider communication review summarizes the transition between estimates: "recent CDC estimates indicate that LD incidence in the US may be closer to between roughly 340,000 and 470,000 cases per year" (TTBD 2021).

Earlier characterizations of the gap were coarser but pointed in the same direction. A 2007 CAES–CDC tick management handbook stated flatly that "Lyme disease is underreported, and these numbers may represent only 10-20% of diagnosed cases" (CAES 2007). A 2012 House hearing statement reported that "CDC has estimated that actual new cases may be 10 times more than the reported number, indicating roughly 300,000 cases in 2010 alone" (House 2012).

What the reported-case trend has done since

The reported cases are themselves a moving target. From 2010 to 2016, the 2018 Working Group writes, "between 30,000 to 38,000 cases of Lyme disease were reported to CDC each year" (HHS 2018). A 2024 Cureus review covering the following years notes that "From 2010 to 2019 in the United States, there were 252,681 confirmed cases" (Cureus 2024). By 2016, the Working Group characterized Lyme disease as "the most common vector-borne disease reported and the sixth most common of all nationally notifiable diseases" (HHS 2018).

The 2024 HHS scoping review folds Lyme disease into the broader tick-borne-disease trend: "CDC reported that from 2011 to 2022, the annual incidence of reportable tick-borne diseases cases rose by 75%, from 40,795 in 2011 to 71,346 in 2022, estimates that do not account for underreporting" (HHS 2024). The Working Group also records a geographic expansion that is harder to reduce to a single trend line: the 2018 report states that "The number of U.S. counties now considered to be of high incidence for Lyme disease has increased by more than 300% in the Northeastern states and by approximately 250% in the North-Central states" (HHS 2018). A 2020 IDSA-AAN-ACR clinical guideline frames the geographic picture at a higher level, calling Lyme disease the "most common vector-borne infectious disease of humans in the temperate northern hemisphere, affecting hundreds of thousands of people annually in North America and Eurasia" (IDSA 2020).

One caveat recurs in the sources. Surveillance intensity is itself a variable. A 2012 Senate hearing statement observes that "as more people became aware and we instituted active surveillance as part of those programs, more and more of the cases were actually detected" (Senate 2012). Whether a given year-over-year increase in reported cases reflects more disease, more testing, or more attentive reporting is not recoverable from the reported count alone.

Where the cases concentrate

Reported cases are geographically concentrated — mapping closely to the range of the primary vector, the black-legged / deer tick (Ixodes scapularis). The 2022 CDC Tickborne Diseases Reference Manual lists the states that account for most of the count: "High-incidence states include Connecticut, Delaware, Maine, Maryland, Massachusetts, Minnesota, New Hampshire, New Jersey, New York, Pennsylvania, Rhode Island, Vermont, Virginia, Washington D.C., West Virginia, and Wisconsin" (CDC 2022). The 2018 Working Group describes the same concentration in directional terms, noting that "The geographic range of Lyme disease cases has expanded since its first appearance in Lyme, Connecticut, in 1975 and has consistently spread northward, southward, and westward" (HHS 2018).

The concentration interacts with the case definition in a way that shapes the undercount. The Working Group describes the mechanism: providers in low-incidence regions "are often under the impression that Lyme disease does not occur in their state and therefore do not conduct the two-tiered test on patients with symptoms consistent with Lyme disease" (HHS 2018). A 2022 HHS subcommittee report makes the surveillance-vs-diagnosis point explicit, observing that "surveillance case criteria are not intended to identify all actual cases of an illness" (HHS 2022). Patients whose presentation sits outside the definition are not absent from medicine; they are absent from the count.

The European and global picture

The gap problem is not unique to the United States. The European Centre for Disease Prevention and Control's 2011 expert consultation reports that "About 85 000 cases of Lyme borreliosis are reported annually across Europe through various surveillance systems" (ECDC 2011), and in a later passage concludes that this figure "is most likely an underestimation. A 2002 study for example estimated 60 000 annual cases in Germany alone" (ECDC 2011). The same consultation records that no consensus was reached on whether Lyme borreliosis should be notifiable at the EU level, citing "the complexity of the disease, with many clinical outcomes and many possible laboratory practices in use resulting in both under and over-reporting" (ECDC 2011), and finding that the overall burden "remains unclear" (ECDC 2011). A 2012 House hearing statement, citing WHO, records that "About 85,000 cases are reported annually in Europe as of 2006, according to the WHO, but that was recognized as a gross underestimate" (House 2012).

Why the size of the gap matters

The gap matters because comparisons of federal per-case funding use the reported number — which for Lyme substantially understates the actual case burden. How research dollars compare to the actual scale of the disease is its own question, covered separately in the Lyme research funding gap; this article stays on case count and undercounting methodology. The 2018 Working Group lays out the disparity by comparing per-case federal spending across reportable diseases:

"The U.S. National Institutes of Health (NIH) and CDC spend $77,355 and $20,293, respectively, per new surveillance case of HIV/AIDS, and $36,063 and $11,459 per new case of hepatitis C virus, yet only $768 and $302 for each new case of Lyme disease." — HHS, 2018, pp. 5–6. Tick-Borne Disease Workin...

Research agendas, clinical-trial budgets, and insurance coverage decisions tend to scale with the disease that surveillance makes visible. A 2022 MDPI access-to-care study framing the public-health stakes states that Lyme disease is "an increasingly important public health threat, with an estimated 476,000 new cases per year in the U.S." (MDPI 2022); a 2022 BMC Public Health study on park visitor behavior opens with the same figure, noting that "An estimated 476,000 cases of Lyme disease occur in the United States each year, predominately in the Northeast, Mid-Atlantic, and Upper-Midwest regions" (BMC 2022). A 2024 Cureus review on post-treatment Lyme disease mechanisms summarizes the present position: "one study suggesting that approximately 476,000 cases are diagnosed and treated annually in the United States, and underreporting is common" (Cureus 2024).

The surveillance count is what the federal government collects. The insurance-claims estimate is what the healthcare system touches. The first is roughly thirty thousand a year; the second is roughly half a million; the ratio between them is the mechanism this article has traced.

Sources

    Not medical advice. See a healthcare provider for medical decisions. Medical Disclaimer