TamizhConnect Blog
08 Dec 2025 · TamizhConnect
Multiple Document Linking: Building Evidence Graphs
Tamil genealogy article
Birth cert, school record, passport, e-roll, patta, temple list – all for the same person, but all slightly different.

Tamil Ancestry Research | Family Tree Guide
In this article:
- What “multiple document linking” actually means
- Why “one person → one document” thinking is childish
- Person–document links vs document–document links
- How to decide two documents point to the same person
- Modelling links in TamizhConnect: IDs, link types, confidence
- Handling conflicts across linked documents (names, dates, places)
- A realistic workflow: linking a life across 6+ document types
1. What “multiple document linking” actually means
Most people do this:
- Create a person in the tree.
- Upload a birth cert as an attachment.
- Maybe later add a passport scan.
- End of story.
That’s not linking. That’s stapling.
Multiple document linking means:
Treating each document as a separate evidence source,
linking it to:
- one or more people,
- specific facts (birth, marriage, land, residence),
- and sometimes to other documents in a chain.
You’re building:
- an evidence graph:
- people ⟷ facts ⟷ documents ⟷ other documents
not:
- “this person has 10 random PDF attachments”.
If you do this properly, you can answer:
- “What evidence do we have for this date?”
- “Which documents tie these two branches together?”
- “Why do we believe these two names are the same person?”
If you don’t, you’re guessing and calling it genealogy.
2. Why “one person → one document” thinking is childish
Real life:
- Same person appears in:
- birth/baptism record
- school admission register
- college record
- electoral rolls
- passport/visa
- ID cards
- pattas / deeds
- temple/church/mosque records
- obituaries
- association membership lists
Each:
- uses a different name format,
- has different errors,
- shows them at different ages / addresses / roles.
If you act like:
- “birth certificate is the only real thing”,
- or “passport name is the truth and everything else is fake”,
you trash 90% of your usable evidence.
Multiple document linking is the only sane way to:
- reconcile inconsistent information,
- see the timeline of how identity changed,
- understand why the same person looks different on paper over time.
3. Person–document links vs document–document links
You need both. Don’t mix them in your head.
3.1. Person–document links
This is basic but critical:
- Document D supports facts about Person P.
Example:
- Birth cert → P’s birth date, birthplace, parents.
- E-roll → P’s residence and approximate age in a given year.
- Patta → P’s role as landholder in a specific survey number.
In TamizhConnect:
- each link should say:
- how the person appears in the document,
- what role they play,
- which facts you extracted from it.
3.2. Document–document links
Documents also talk to each other:
- A passport was issued based on a birth certificate + school record.
- A patta mutation refers to a sale deed or court order.
- A temple donor plaque matches names already in a trust register.
- A NIC/ID card is renewed using the old one.
These are dependency links:
- “D2 cites D1”
- “D3 supersedes D2”
- “D4 is a renewal/duplicate of D3”
Without document–document links, you miss:
- chains of evidence,
- why errors got propagated,
- which document is actually independent and which is just a copy with new letterhead.
4. How to decide two documents point to the same person
This is where people usually screw up and merge the wrong humans.
4.1. Don’t rely on name alone
Names are the noisiest field:
- spelling shifts,
- initials appear/disappear,
- pet names vs formal names,
- caste titles added/dropped.
If all you check is:
- “Oh look, ‘Muthusamy’ appears here also, must be him”,
you’re begging for false merges.
4.2. Use a match bundle, not a single field
Treat it like this:
Two records refer to the same person if a bundle of fields line up strongly enough:
- name (or plausible variant),
- father/husband/mother name,
- approximate age/year-of-birth,
- village/street / address,
- caste/title (if present),
- school / occupation / deity context (if relevant),
- time window makes sense.
Example match:
- E-roll 1995:
R. MUTHUSAMY, fatherRamasamy, age 45, house 12A, Village X.
- Passport 2003:
Muthusamy R, DOB 1958-06-10, address 12A, Street, Village X.
Bundle says:
- same village,
- same address,
- same father name,
- age matches DOB,
- name variant is trivial.
So yes, link them as same person with high confidence.
4.3. Record match confidence, don’t pretend certainty
Not every link is that clean.
For each person–document link, store:
linkConfidence:"high" | "medium" | "low"
Example:
- Temple notebook entry: “Muthusamy, gothram X, nakshatram Y, Village X” – looks like your guy but not 100% unique.
- Mark link as
"medium"with a note:- “Likely same as Person #23 based on village + gothram; name is common; no father’s name given.”
Later evidence can upgrade/downgrade this.
5. Modelling links in TamizhConnect: IDs, link types, confidence
You need real structure, not vague “attached files”.
5.1. Source (Document) object
Each document gets a Source object:
sourceIdsourceType:"birth-cert","marriage-cert","death-cert","patta","sale-deed","e-roll","temple-record","school-record","passport","visa","affidavit", etc.titleOrLabel:"Birth certificate of X","SSLC Book for Y","Patta #123 for Survey 45/2", etc.yearApproxpages/fileLinkjurisdiction:Tamil Nadu,Sri Lanka,Gulf, etc.
5.2. Person–Source link object
For each connection between a person and a document, define a PersonSourceLink:
personIdsourceIdrole:"subject"(it’s about this person)"parent"/"spouse"/"child"/"witness"/"informant""landholder"/"tenant"/"trustee"/"donor"/"voter"/"applicant"etc.
nameAsWrittenin that documentotherPartyNameAsWritten(father/husband/etc. if relevant)linkConfidence:"high" | "medium" | "low"notes: “Age inconsistent with later records”, “Likely same person; name variant; see X”.
This lets you say:
- “Show me all documents where this person appears as subject,”
- “Show me all documents where this person appears as landholder,”
- “List all witness roles they played.”
5.3. Fact objects linked to both person and source
You shouldn’t let documents directly overwrite person fields. Instead:
- extract facts from each document (birth, land, address, etc.),
- each fact has:
factType- structured fields (date, place, etc.)
personId(s)sourceIdfactConfidence.
Then your person’s “canonical” DOB / place / etc. is derived from all facts, not blindly from one document.
5.4. Source–Source links
For document–document relationships, create SourceLink objects:
sourceAIdsourceBIdlinkType:"cites"– passport cites birth cert, will cites patta, etc."supersedes"– revised patta replaces older one."duplicate"– identical copies or renewed cards."bundle"– pages belonging to same file (e.g., court bundle).
notes:- “Passport requires school record; DOB changed between A and B.”
This builds a chain of how identity paperwork evolved and which doc is just derived from which.
6. Handling conflicts across linked documents (names, dates, places)
Multiple document linking doesn’t magically solve conflicts – it just makes them visible. Good.
6.1. Don’t pick a random “winner”; rank and annotate
When documents disagree:
- keep all their statements as separate facts,
- assign:
- fact-level
confidence, - explanation in
notes:- “School DOB differs by 1 year; likely adjusted for exam eligibility.”
- fact-level
Canonical fields on the person:
- store:
- chosen working value (e.g.,
1950), - pointer to which facts it’s based on,
overallConfidence:"high" | "medium" | "low".
- chosen working value (e.g.,
6.2. Exploit time and dependency
If:
- passport was issued using a particular (fake) school DOB,
- but you have a hospital birth record from the actual year,
then:
- birth record gets higher weight,
- passport gets logged as “derived from school record; date likely administrative, not factual.”
Document–document links (“passport cites school cert”) make this explicit.
6.3. Accept that some fields stay fuzzy
Sometimes:
- three documents all disagree,
- oral memory doesn’t help,
- no new evidence is likely.
Then your conclusion is:
- “DOB ~ 1948–1951, exact date unknown.”
- Store that as a range and move on.
Better to admit uncertainty than fake accuracy.
7. A realistic workflow: linking a life across 6+ document types
Take one person and do it properly. Example stack:
- Birth certificate
- SSLC / school record
- College record
- Passport
- E-roll entries (1995, 2005, 2015)
- Patta & sale deed
Step 1 – Create the person with minimal assumptions
Person Pwith:- provisional name,
- rough birth year,
- core village.
Don’t overfill fields yet.
Step 2 – Add all documents as Source entries
For each:
- add
sourceType,year,fileLink,jurisdiction. - Don’t link anything else yet.
Step 3 – For each document, create Person–Source links
Per document:
- identify where P appears (subject, parent, voter, landholder, etc.),
- create
PersonSourceLinkwith:nameAsWritten,role,linkConfidence.
If a document also mentions relatives:
- create links for them too, even if they’re placeholder profiles.
Step 4 – Extract facts per document
Example:
- Birth cert →
birthfact:- date, place, parents.
- School record →
educationfact + alternative DOB. - E-roll →
residencefacts with age estimates. - Patta →
land-holdingfacts. - Sale deed →
land-transferevent.
Each fact points to:
personId(s),sourceId,factConfidence.
Step 5 – Reconcile into canonical person data
Look at all birth-related facts:
- pick a working DOB or year-range,
- set
overallConfidence, - note conflicts.
Likewise for:
- canonical name (with normalized form + variants),
- native village vs birth hospital,
- occupation,
- migration events.
None of this deletes underlying facts; it just chooses a current best summary.
Step 6 – Tie documents to each other
- Passport cites school record → SourceLink:
passportcitesschoolRecord. - Patta mutation refers to sale deed → SourceLink:
pattasupersedes oldpattabased onsaleDeed.
These chains explain:
- where errors came from,
- which doc is independent vs derivative.
Step 7 – Re-use links for relatives
Once P is properly linked:
- siblings/children/spouses who appear in the same documents:
- inherit some of the linking work.
Example:
- E-roll household entries → multiple people at same address, same year.
- Patta with multiple co-holders.
- Marriage record listing both sides.
You link them all into the same evidence graph instead of re-doing everything from scratch.
If you treat each document as an isolated trophy, TamizhConnect becomes a prettier file system and nothing more.
If you actually do multiple document linking:
- every person is backed by a web of sources,
- each fact is traceable and ranked,
- conflicts are visible and explainable,
- and your “family tree” becomes what it should be:
not a neat fantasy diagram, but a documented, honest reconstruction of messy real lives – with birth certs, school lies, passport hacks, pattas, and temple records all tied together in one coherent evidence graph.
Share this article
Ready to start your Tamil family tree?
TamizhConnect helps you discover relatives, trace your origin village, and keep your family history alive for the next generation.
Create your free TamizhConnect accountYou might also like
பல ஆவண இணைப்பு — ஆதார கிராப் அமைத்தல் (Tamil)
பிறப்பு சான்று, பள்ளி பதிவு, பாஸ்போர்ட், வாக்காளர் பட்டியல், பட்டா, கோயில் பட்டியல் — அனைத்தும் ஒரே நபரைப் பற்றியவை, ஆனால் சற்று வேறுபட்டவை.
08 Dec 2025
Multiple name variants – one person, many spellings (English)
The same person can appear as R. Muthukumar, Muthukumar R, Ramasamy Muthukumar, MUTHU KUMAR, and ‘Muthu’ in different records.
09 Feb 2024
Legacy Layer Preservation: Don't Delete Past (English)
How to preserve historical family data without losing the original context when modernizing names, locations and records.
29 Jan 2024
Document Extraction: Getting Facts from PDFs (English)
Complete guide to extracting genealogical data from documents for Tamil family trees: pull names, dates, places and relationships from PDFs, OCR, and heritage..
11 Jan 2024
தமிழ் மூதாதையர் ஆய்வு நூலகம் (Tamil)
TamizhConnect-க்கு தேவையான தமிழ் வம்சாவளி முறைகள், பதிவுகள், இனவியல் மற்றும் பாரம்பரியச் சரிபார்ப்புக்கான அனைத்து ஆழமான வழிகாட்டிகளும் ஒரே இடத்தில்.
14 Jan 2026
Tamil Ancestry Research: Complete Guide for Genealogical Methods (English)
All our deep-dive guides on Tamil genealogical methods, records, ethnography, and heritage validation for TamizhConnect.
14 Jan 2026
Related by topic
கொடிவழி / குடும்ப மரம் (kodivazhi Maram) – தமிழ்நாடு உறவுப் பெயர்கள் + Family Tree எழுதும் practical format (Tamil)
Kudumba Maram / Kodivazhi என்றால் என்ன? தமிழ்நாட்டில் உறவுப் பெயர்கள் (பெரியப்பா, சித்தப்பா, மாமா, அத்தை…) எப்படி தந்தை/தாய் வழி, மூத்த/இளைய வேறுபாட்டோடு...
28 Dec 2025
கொடிவழி / குடும்ப மரம் (kodivazhi Maram) – மலேசிய தமிழ் உறவுப் பெயர்கள் + பெயர்/ஆவண format-க்கு ஏற்ற family tree guide (Tamil)
மலேசிய தமிழர் குடும்பங்களில் பயன்படும் உறவுப் பெயர்கள் + Tamil↔English spelling variants, initials, document formats ஆகியவற்றுடன் family tree (Kodivazhi)...
28 Dec 2025
More from TamizhConnect
Batticaloa – Lagoon, Border Violence and Shared Tamil-Muslim Memory: A Complete Guide to Tracing Your Roots (English)
Complete guide to understanding Batticaloa's complex history, geography, and cultural landscape for Tamil genealogy research. Learn how to trace your Batticaloa roots through war, displacement, and diaspora patterns.
13 Jan 2026
மட்டக்களப்பு – ஏரி, எல்லை வன்முறை மற்றும் பகிரப்பட்ட தமிழ்-முஸ்லிம் நினைவு: உங்கள் மூதாதையரைக் கண்டறிவதற்கான முழுமையான வழிகாட்டி (Tamil)
மட்டக்களப்பின் சிக்கலான வரலாறு, புவியியல் மற்றும் கலாச்சார காட்சியைப் புரிந்துகொள்ள முழுமையான வழிகாட்டி. போர், இடம்பெயர்வு மற்றும் சிதறிய மக்கள் வாழ்க்கை முறைகளின் வழியாக உங்கள் மட்டக்களப்பு வேர்களைக் கண்டறிவது.
13 Jan 2026
Core topics
Trace Your Tamil Ancestry: Complete Guide to Find Your Roots
Complete guide to discover your Tamil roots using TamizhConnect, family interviews, historical records, and community resources. Learn how to build your family tree and preserve your heritage.
17 Dec 2025
தமிழ் வேர்களை கண்டுபிடிப்பது: உங்கள் மூதாதையரை தேடுவதற்கான வழிகாட்டி
தமிழ் வேர்களை கண்டுபிடிப்பதற்கான எளிய வழிகள்: குடும்ப உரையாடல்கள், ஆவணங்கள் மற்றும் சமூக உதவி மூலம் உங்கள் வேர்களைக் கண்டறிய இந்த வழிகாட்டியைப் பயன்படுத்தவும்.
17 Dec 2025
Continue reading
Record verification – stop believing every certificate blindly (English)
Birth cert says one date, school record says another, passport says something else, and your thatha’s memory disagrees with all three.
08 Dec 2025
பதிவு சரிபார்ப்பு — ஒவ்வொரு சான்றையும் குருட்டாக நம்புவதை நிறுத்து (Tamil)
பிறப்பு சான்றில் ஒரு தேதி, பள்ளி பதிவில் இன்னொன்று, பாஸ்போர்ட்டில் வேறு ஒன்றும், தாத்தாவின் நினைவில் எல்லாம் வேறாகவும்.
08 Dec 2025
Ritual Anchors: Pongal, Jallikattu, Temple Rituals & More (English)
These rituals aren’t just ‘culture’. Ven pongal for ancestors, Mattu Pongal, Jallikattu, Mariamman/Periyachi worship, Meenakshi temple visits, Adiperukku and...
08 Dec 2025
வெண்பொங்கல், ஜல்லிக்கட்டு, மாரியம்மன் – குடும்ப நினைவுகளை இணைக்கும் சடங்குகள் (Tamil)
இந்த சடங்குகள் வெறும் “கலாசாரம்” அல்ல. வெண்பொங்கல், மாட்டுப் பொங்கல், ஜல்லிக்கட்டு, மாரியம்மன்/பெரியாட்சி வழிபாடு, மீனாட்சி கோயில், ஆடிப்பெருக்கு, கோயில்...
08 Dec 2025
Western forms vs Tamil names – how not to lose yourself in a form (English)
Western first/last-name forms often break Tamil naming patterns. Learn how initials and patronymics fit into modern forms without losing identity.
08 Apr 2024
மேற்கு வடிவங்கள் vs தமிழ் பெயர்கள் – படிவத்தில் அடையாளத்தை இழக்காமல் செய்வது எப்படி (Tamil)
பெரும்பாலான டிஜிட்டல் முறைமைகள் முதல் பெயர் / கடைசி பெயர் என்ற மேற்கத்திய முறையைச் சகஜமாக கருதுகின்றன. தமிழ் பெயர்கள் அப்படியில்லை.
08 Apr 2024
Village Surnames: Jaffna, Trichy, Batticaloa & Fake Family Names (English)
From Jaffna to Trichy to Batticaloa, many Tamil families use village names like surnames. Guide to understanding village identities in genealogy research.
07 Apr 2024
ஊர் பெயர் “surname”கள் — யாழ்ப்பாணம், திருச்சி, மட்டக்களப்பு (Tamil)
யாழ்–திருச்சி–மட்டக்களப்பில் பலர் ஊர் பெயரை surname போலப் பயன்படுத்துகிறார்கள். உண்மையில் இது எப்படி வேலை செய்கிறது, மேற்கு படிவங்களில் எப்படி மாறுகிறது,...
07 Apr 2024
Thurston's Castes and Tribes – A Critical Guide for Tamil Genealogists (English)
A critical examination of Edgar Thurston's ethnographic work for Tamil genealogists: understanding its value while recognizing colonial biases and limitations.
04 Apr 2024
துர்ஸ்டனின் ஜாதிகள் மற்றும் இனங்கள் – தமிழ் குடும்ப ஆய்வாளர்களுக்கான வழிகாட்டி (Tamil)
தமிழ் வம்சாவளி ஆய்வாளர்களுக்கான எட்கர் துர்ஸ்டனின் இனவியல் பணியை மதிப்பாய்வு செய்தல்: காலனித்துவ பாகுபாடுகளை அங்கீகரித்து வரலாற்று மதிப்பை புரிந்துகொள்வது.
04 Apr 2024