The Polygenetic Surname Problem: Why Your Smith Might Not Be My Smith
Understanding Multiple Independent Origins and What They Mean for Your Research
Written by Ian Gubbenet, GGS
Image generated by ChatGPT (the closer you look the weirder it gets like most AI outputs)
When you share a surname with thousands of strangers, are you actually related to them? The answer is more complex than you might think, and understanding it can save you years of fruitless genealogical research or help you avoid missing genuine connections.
What Are Polygenetic Surnames?
A polygenetic surname is one that arose independently in multiple locations from different original bearers. In contrast, a monogenetic surname traces back to a single individual or family who first adopted that name. The distinction matters enormously for genealogists because it determines whether surname alone can indicate family relationship.
Consider the surname “Smith.” As an occupational name for a metalworker, it emerged organically wherever English-speaking communities needed blacksmiths, which was everywhere. A Smith family in Cornwall and a Smith family in Yorkshire could have adopted the surname completely independently in the 13th century, with no blood relationship whatsoever. Both families simply had ancestors who worked metal.
The same principle applies to other common occupational surnames. Baker, Miller, Taylor, Cooper, and Carpenter all describe professions that existed in virtually every medieval English town. Each community would have had its own baker or miller, and when hereditary surnames became fixed between the 11th and 15th centuries, multiple unrelated families independently adopted these names.
The Four Categories Most Likely to Be Polygenetic
1. Occupational Names
These are the most obviously polygenetic surnames. Any profession that was widespread and necessary resulted in multiple independent adoptions of the surname. Beyond the obvious examples like Smith and Baker, consider:
Fuller (one who cleaned and thickened wool)
Fletcher (arrow maker)
Thatcher (roof thatcher)
Shepherd (livestock herder)
Weaver/Webb (cloth maker)
The more common and essential the occupation, the more likely the surname arose in multiple places independently.
2. Patronymic Names
Surnames derived from popular given names are almost universally polygenetic. When countless medieval fathers were named John, William, or Thomas, their sons naturally became Johnson, Williams, and Thompson in numerous unrelated families.
The surname Johnson alone is carried by nearly 2 million Americans. The name literally means “son of John,” and given that John was one of the most popular names in medieval England (deriving from the Hebrew Yochanan through Greek and Latin), thousands of unrelated families independently formed this patronymic.
This creates a genealogical reality that surprises many researchers: some Johnsons descend from royal or noble Johns, while others descend from peasant Johns, but all are equally “Johnson.” The surname itself conveys no information about social status or lineage. It only tells us that somewhere in the male line, there was a man named John whose son took “Johnson” as a hereditary surname.
Similarly widespread are:
Williams (son of William)
Richardson (son of Richard)
Davis/Davies (son of David)
Wilson (son of Will)
Jackson (son of Jack)
3. Habitational Names from Common Place Names
England contains dozens of places called Newton, Sutton, or Houghton. A family from Newton in Lancashire is almost certainly unrelated to a family from Newton in Cambridgeshire, even though both bear the surname Newton.
The surname Norton derives from “north town” and could refer to any settlement located to the north of another. Dozens of places bore this name in medieval England, making Norton almost certainly polygenetic.
4. Nicknames from Common Characteristics
Surnames like Brown, White, Short, or Long originated as descriptive nicknames. Multiple unrelated individuals with brown hair or dark complexions could independently receive the byname “Brown,” which later became hereditary.
Complex Cases: When Origins Overlap
Some surnames blur the lines between categories. The surname Horn provides an excellent example of multiple origins even within a single geographical area:
Occupational: someone who made items from horn (a glass substitute)
Occupational: someone who played a horn instrument or signaling horn
Locational: someone who lived near a horn-shaped geographical feature
Nickname: from a physical characteristic
Similarly, Taggart demonstrates how a single surname can have completely different etymologies:
English habitational: from “Teggeherugge” (pasture ridge for young sheep)
Irish Gaelic: anglicized form of Ó Taidhg (descendant of a poet/bard)
Cornish: from “tek” (fair, beautiful), a nickname
The DNA Revolution: Proving Polygenesis
Y-chromosome DNA testing has revolutionized our understanding of surname origins by allowing us to test which surnames are truly monogenetic versus polygenetic. Since Y-DNA passes from father to son alongside patrilineal surnames, men sharing both a surname and a recent common ancestor should share similar or identical Y-DNA haplogroups.
Research on British surnames has revealed surprising findings. Some surnames assumed to be polygenetic due to their common occupational nature show unexpectedly high levels of Y-DNA sharing, suggesting fewer founders than expected. Conversely, some rare surnames thought to derive from unique place names show multiple distinct Y-DNA lineages, proving polygenetic origins.
The surname Sykes illustrates this well. Although written sources predicted multiple origins, Y-DNA testing revealed that nearly half of all Sykes males share the same Y-chromosome haplotype, pointing to a single medieval founder.
Geographic Distribution as a Clue
Surname distribution patterns can provide strong evidence for monogenetic versus polygenetic origins. A surname concentrated in a single region likely had a single origin point, even if the surname’s meaning suggests it could have arisen multiple places.
The Dutch surname Van de Ven (”from the fen” or small lake) is toponymic and could theoretically connect to many ancestors, since vens existed throughout the Netherlands. However, the heavy concentration of Van de Ven bearers around Eindhoven suggests possible monogenetic origins despite the generic nature of the toponym.
Conversely, widely distributed surnames with even geographic spread across a country almost certainly arose independently in multiple locations. The distribution patterns of Smith, Johnson, and Williams across Britain and America demonstrate this principle. They appear wherever English-speaking populations settled, with no clear origin point.
Why This Matters for Your Research
Understanding whether your surname is polygenetic changes your research strategy fundamentally:
If Your Surname Is Likely Polygenetic:
Never assume surname = family connection without documentary evidence
Focus on geographic clustering within your specific lineage
Use DNA testing strategically to identify which genetic lineage your family belongs to
Build forward from known ancestors rather than working backward from surname alone
Expect brick walls when trying to connect to famous bearers of your surname
If Your Surname May Be Monogenetic:
Geographic origin becomes crucial - find where the surname first appeared
Other surname bearers may be distant cousins worth investigating
Y-DNA projects for your surname can connect you to related lines
Medieval records may reveal the original bearer
Spelling variants likely represent the same family line
Common Misconceptions to Avoid
Misconception 1: Rare surnames are always monogenetic Reality: Even rare surnames can have multiple origins. A surname might be rare because most lineages died out, not because it had a single founder.
Misconception 2: If the surname has one clear meaning, it’s monogenetic Reality: Multiple people could independently adopt a surname from the same word or concept.
Misconception 3: All Smiths/Johnsons/etc. are unrelated Reality: Within a specific geographic area, families with common surnames may share recent ancestry. The key is proving it through documentation, not assuming it.
Misconception 4: DNA matches with the same surname prove recent family connection Reality: Y-DNA matches indicate patrilineal relationship, but the common ancestor could be medieval rather than within genealogically useful timeframes.
Practical Application: A Research Workflow
When beginning research on a surname, follow this workflow:
Classify the surname type: occupational, patronymic, habitational, or nickname?
Assess commonality: How many people bear this name? Where are they distributed?
Research etymology: Does the surname have one meaning or multiple possible origins?
Map historical distribution: Where did your specific family line originate? Are other bearers concentrated there or widely scattered?
Consider DNA testing: For common surnames, Y-DNA can help identify your specific genetic lineage among many surname bearers.
Connect only with documentation: Never assume family connection based on surname alone, even for rare names.
The “Effectively Monogenetic” Category
Some surnames that were originally polygenetic have “become monogenetic” over time. This occurs when most lineages bearing the surname died out, leaving only descendants of a single medieval bearer. These surnames may still appear in multiple locations due to later migration, but genetic testing reveals they all descend from one common ancestor.
This phenomenon is more common than historically recognized. Many surnames assumed to have had multiple independent origins in the Middle Ages actually show remarkable genetic homogeneity, indicating that only one founder’s lineage successfully survived to the present day.
The DNA Revolution: Quantifying Polygenesis
Y-chromosome DNA testing has transformed our understanding of surname origins from educated guesses into hard science. Since the Y-chromosome passes from father to son alongside patrilineal surnames, men sharing both a surname and recent common ancestry should have similar or identical Y-DNA profiles.
How Many Founders? DNA Provides Answers
Between 2006 and 2009, Dr. Turi King and colleagues at the University of Leicester conducted a landmark study analyzing Y-chromosome DNA from 1,678 men bearing 40 different British surnames. The results dramatically confirmed what surname scholars had long suspected: surname frequency directly correlates with the number of independent founders.
The study introduced a powerful metric: the “match probability,” the likelihood that two randomly selected men with the same surname share identical Y-DNA. For common surnames, this probability plummets. For rare surnames, it soars.
The Smith Reality Check:
The Smith DNA Project at FamilyTreeDNA has identified 359 distinct genetic groups consistent with at least 359 independent founders. This means at least 359 completely unrelated medieval men independently adopted “Smith” as their hereditary surname, and their descendants survive to the present day. Two random Smiths are unlikely to share Y-DNA unless they already know they’re related through documented genealogy.
When researchers ranked 150 surnames from Smith (carried by 560,000 people in Britain) to Rivis (carried by only 50 people), the pattern was unmistakable: common surnames showed low Y-DNA sharing, while rare surnames showed high sharing.
Attenborough vs. Smith:
For the rare surname Attenborough, 50.3% of tested males share identical Y-DNA. Accounting for one-step mutations raises this to 69.2%. This strongly suggests a single medieval founder whose descendants spread the name.
For Smith, the match probability is dramatically lower, confirming hundreds of independent origins.
DNA Proves Complete Unrelatedness
Y-DNA testing doesn’t just suggest polygenesis. It proves it definitively. Different Y-chromosome haplogroups represent branches of humanity that separated thousands of years ago, long before surnames existed.
The Lee Example:
Asian Lees (Chinese Li, Korean Yi): Haplogroup O-F8, common in East Asian populations
European Lees (English): Haplogroup R-P311, common in Western European populations
Irish Lees: Different patterns reflecting Gaelic ancestry
These haplogroups diverged from common ancestors 10,000+ years ago. A Chinese Lee and an English Lee share no more recent ancestry than any two random people from different continents. They simply have surnames that sound identical in English but have completely independent origins in different languages and cultures. These are homographs in English transcription, not linguistically related surnames.
The Newton Confirmation:
DNA testing of Newton families confirms what geography predicted: with 83 places called Newton in Britain, the surname has scores of independent origins. Newton DNA Project participants show multiple distinct haplogroups, confirming that families from Newton in Lancashire, Newton in Cambridgeshire, and Newton in Yorkshire are genetically unrelated despite sharing a surname.
The Surprising Discovery: Effectively Monogenetic Surnames
DNA has also revealed unexpected findings that contradict historical assumptions. Some surnames predicted to have multiple origins actually descended from single founders.
The Sykes Surprise:
Sykes is an occupational/topographic surname from Middle English “syke” (marsh or stream). Scholars assumed it arose independently wherever people lived near marshes.
DNA told a different story: approximately 50% of tested Sykes males who have participated in Y-DNA testing share the same Y-chromosome haplotype. This points to a single medieval founder, with the other Sykes lineages likely representing non-paternity events (adoption, illegitimacy) accumulated over 700 years rather than independent surname formation.
What Your Haplogroup Tells You
When you test Y-DNA for surname research, you receive both a haplogroup (your deep ancestral branch) and STR markers (recent mutations useful for genealogy).
If you share a common surname with someone:
Different haplogroups = Definitely not related within the surname’s history. You represent independent origins of the name.
Same haplogroup, different STR markers = Possibly related, but the common ancestor likely predates surnames (pre-1300s)
Same haplogroup, matching STR markers = Likely related within genealogical timeframes. The number of marker differences estimates how many generations to your common ancestor.
The Mathematics of Polygenesis
DNA testing allows us to calculate specific probabilities:
For Smith, researchers found that the average match probability is very low. The surname’s overwhelming frequency and occupational nature produced hundreds of independent adoptions.
For the 35 less-common surnames in the Leicester study, the mean match probability reached 14.5% for exact matches. Accounting for one-step mutations (expected variation within related families) increased this to 23.5%.
This means that for moderately rare surnames, roughly one in four to one in six pairs of surname bearers share recent patrilineal ancestry. This is a useful probability for genealogical investigation, but far from certain.
Practical Application: The Brown Study
The Brown-Browne-Braun DNA Study demonstrates polygenesis in action. As a descriptive nickname (from brown hair, complexion, or clothing), Brown arose wherever someone was distinguished by being “brown.”
Testing reveals massive genetic diversity among Browns:
Multiple haplogroups represented
No dominant founder lineage
Clear evidence that Browns in different regions have no genealogical connection
For a Brown researcher, this means surname alone provides almost no information. Geographic origin becomes paramount. A Brown from Cornwall and a Brown from Yorkshire almost certainly represent independent origins.
When DNA Can’t Help
Y-DNA testing has limitations:
Female lines: Y-DNA only traces the patrilineal line. If your connection to a surname is through your mother, grandmother, or any female ancestor, Y-DNA won’t help you directly (though male cousins who carry the name can test).
Recent surname changes: Immigration, anglicization, and name changes break the Y-DNA/surname link. Your genetic signature may not match your documented surname.
Non-paternity events: Adoption, illegitimacy, and surname changes create “phantom founders” in DNA data.
Cost and participation: Meaningful results require large surname project participation. Small projects may not have enough data to draw conclusions.
How to Use DNA for Polygenetic Surnames
If you carry a common polygenetic surname:
Test first, then join a surname project to see which genetic group you belong to
Focus your documentary research on the geographic cluster associated with your haplogroup
Don’t assume relationship with famous surname bearers unless DNA confirms it
Use DNA to eliminate false leads before investing time in documentary research
Understand the timeframe: Y-DNA can identify which of many Smith lineages you belong to, but documentary evidence must build the actual family tree
The combination of Y-DNA testing and traditional genealogy transforms polygenetic surname research from frustrating guesswork into targeted investigation. You may not be able to identify all founders of your surname, but you can definitively identify which founder’s line you descend from.
Conclusion: Surname as Starting Point, Not Endpoint
A surname is a valuable clue in genealogical research, but it is only a starting point. Whether your surname is polygenetic or monogenetic, the key to successful research lies in building your family tree through careful documentary evidence: birth records, marriage records, census data, land records, and wills.
For polygenetic surnames, resist the temptation to claim famous namesakes as ancestors without proof. For potentially monogenetic surnames, don’t assume automatic connection to others who share your name. In both cases, let the documents (supplemented by DNA evidence where appropriate) tell the story of your unique family history.
The next time you meet someone who shares your surname, you’ll know the right question to ask isn’t “Are we related?” but rather “Where was your family from?” Geography, combined with documentary evidence, will reveal whether you share not just a name, but a family heritage.
Interested in tracing your family’s surname origins? Genera Genealogical Services specializes in both American colonial ancestry and Eastern European research. Contact us for professional genealogical research at $25/hour, with work conducted in 5-hour research blocks. We can help you understand polygenetic surnames and build your documented family tree.
Sources and Further Reading
Academic Studies on Surname DNA
King, T.E., and Jobling, M.A. “Founders, drift, and infidelity: The relationship between Y chromosome diversity and patrilineal surnames.” Molecular Biology and Evolution 26, no. 5 (2009): 1093-1102.
King, T.E., Ballereau, S.J., Schürer, K.E., and Jobling, M.A. “Genetic signatures of coancestry within surnames.” Current Biology 16, no. 4 (2006): 384-388.
Sykes, Bryan, and Catherine Irven. “Surnames and the Y chromosome.” American Journal of Human Genetics 66, no. 4 (2000): 1417-1419.
McEvoy, B., and Bradley, D.G. “Y-chromosomes and the extent of patrilineal ancestry in Irish surnames.” Human Genetics 119, no. 1-2 (2006): 212-219.
Martinez-Cadenas, C., et al. “The relationship between surname frequency and Y chromosome variation in Spain.” European Journal of Human Genetics 24, no. 1 (2016): 120-126.
Surname Origins and Etymology
Hanks, Patrick, and Flavia Hodges. A Dictionary of Surnames. Oxford: Oxford University Press, 1988.
Reaney, P.H., and R.M. Wilson. A Dictionary of English Surnames. 3rd ed. Oxford: Oxford University Press, 1997.
Black, George Fraser. The Surnames of Scotland: Their Origin, Meaning, and History. New York: New York Public Library, 1946.
MacLysaght, Edward. The Surnames of Ireland. 6th ed. Dublin: Irish Academic Press, 1985.
Coates, Richard, ed. The Oxford Dictionary of Family Names in Britain and Ireland. 4 vols. Oxford: Oxford University Press, 2016.
General Surname Resources
Hey, David. Family Names and Family History. London: Hambledon and London, 2000.
Redmonds, George, Turi King, and David Hey. Surnames, DNA, and Family History. Oxford: Oxford University Press, 2011.
McKinley, Richard. A History of British Surnames. London: Longman, 1990.
Online Resources
FamilySearch Wiki. “England Understanding Surnames - International Institute.” https://www.familysearch.org/en/wiki/England_Understanding_Surnames_-_International_Institute
FamilyTreeDNA Surname Projects. Various surname-specific DNA projects including Smith, Newton, Lee, and Brown. https://www.familytreedna.com/
Forebears Surname Database. Surname distribution maps and statistics. https://forebears.io/surnames
Specific Surname Studies Referenced
Newton, Isaac, family genealogy. Select Surnames. https://selectsurnames.com/newton/
Lee surname origins and DNA. Lee DNA Surname Project. https://www.leedna.com/
Smith Official DNA Project. FamilyTreeDNA. https://www.familytreedna.com/groups/smiths
Brown-Browne-Braun DNA Study. FamilyTreeDNA. https://www.familytreedna.com/public/BrownDNAStudy
Methodology and Theory
Bloothooft, Gerrit, and David Onland. “Linguistics and geography, the surname case.” In Proceedings of the 24th International Conference on Onomastic Sciences, 2011.
Jobling, M.A. “In the name of the father: surnames and genetics.” Trends in Genetics 17, no. 6 (2001): 353-357.
Lasker, Gabriel W. “Surnames and Genetic Structure.” Cambridge: Cambridge University Press, 1985.
Historical Context
Guppy, Henry Brougham. Homes of Family Names in Great Britain. London: Harrison, 1890.
Bardsley, Charles Wareing. A Dictionary of English and Welsh Surnames. London: Henry Frowde, 1901.


Excellent article! I’m saving to refer back to again.