CONSURV technic
ADA · Alumni Data Atlas

From Registry to Evidence

What does a university's alumni data actually say about graduate outcomes? Most institutions cannot answer this question today. This proposal explores how to close that gap.

41K
registered alumni (est.)
est. 41,500
names in the registry
567
profiles with verifiable career data
est. 40,000
records with unknown outcomes
structured data pipeline from registry
Registry figures based on available documentation and brief analysis; pending live registry verification.
Part 1

Questions an Alumni Evidence System Should Answer

Every university with a large alumni base faces a similar challenge: the registry holds thousands of names, but what graduates actually do after leaving the institution remains largely unknown. The questions below frame the scope worth exploring.

Question
What proportion of graduates are employed, and in which industries?
Question
How do career trajectories differ across programmes and graduation cohorts?
Question
Which employers and sectors hire the most graduates, and has that pattern shifted over time?
Question
Can graduate outcomes be evidenced for accreditation, rankings, and stakeholder reporting?

These questions are not new. Most institutions already have them in some form — embedded in accreditation requirements, ranking submissions, or internal reviews. What is missing is the data infrastructure to produce answers that are systematic, verifiable, and repeatable year over year.

Current approaches typically rely on voluntary alumni surveys (low response rates, self-selection bias) or manual lookups (unscalable, inconsistent). The registry itself contains graduation records, but career data, professional affiliations, publications, and geographic distribution exist outside the institution's walls.

The Evidence Gap

Of an estimated 41,500 registered alumni, fewer than 600 have career data that can be verified through public sources. The remaining estimated 40,000 records hold graduation details but no outcome signal. Comprehensive insight into graduate outcomes remains out of reach — not because the data does not exist, but because it has not been systematically connected back to the registry.

ADA — Proposed Approach

A methodology for connecting registry data to publicly available professional information — systematically, at scale, with verifiable confidence in every data point.

Component 1

Discovery Pipeline

A systematic process for locating publicly available professional information associated with each registry record. The approach combines structured queries across multiple public data sources — professional networks, corporate registries, academic databases, and publication indices. Each source is queried using identity signals (name, graduation year, programme) to locate matching profiles. Records that do not resolve on the first pass are retried with expanded search strategies.

Component 2

Confidence Framework

Every piece of information carries a confidence score based on the strength of the evidence supporting it. A LinkedIn profile that matches name, institution, and graduation year receives a higher confidence assignment than a partial match from a single source. Source attribution is maintained for every data point — enabling downstream verification of claims. The framework defines what level of evidence is required for different use cases: accreditation submissions might require high-confidence data, while internal planning can work with moderate confidence signals.

Component 3

Institutional Intelligence

Individual profiles are the foundation. The output is aggregate insight: employment rates by programme and cohort, industry distribution across faculties, geographic dispersion of graduates, and employer concentration patterns. These aggregates are what accreditation bodies, rankings agencies, and institutional leadership need to see. The methodology ensures that every aggregate number is traceable back to the individual records and confidence scores that produced it.

  • Employment distribution by programme, cohort, and industry sector
  • Career trajectory patterns at defined intervals post-graduation
  • Geographic dispersion of alumni across countries and sectors
  • Employer concentration — which organisations employ the most graduates
Starting Point

Evidence Gap Assessment. Before building anything, the first step is to understand the current state: what the registry contains, where the gaps are, what confidence levels are achievable with existing data, and what sources are most productive for the specific cohort profiles. This ADA assessment produces a map of the data landscape — not a system, but a shared understanding of what is possible and where the effort should go.

ADA — What Enrichment Reveals

The transformation from a registry record to a multi-dimensional profile. Below is a representative example based on actual data patterns, anonymised.

Registry Record — Today
Name N. A. Rahman
Programme B.Eng. Chemical Engineering
Graduation 2016
Status No career data recorded
Enriched Profile — Proposed Output
Current Role Process Engineer at PETRONAS Refinery
Industry Oil & Gas — Downstream
Location Terengganu, Malaysia
Previous Roles 2 positions since graduation (Graduate Trainee, Process Engineer)
Professional Affiliations Board of Engineers Malaysia (Grad.)
Confidence HIGH (94%) — corroborated across 3 sources

Aggregate View — Cohort-Level Intelligence

Individual profiles roll up into institutional-level insight. These numbers are simulated based on actual data patterns.

94%
of graduates with verified employment within 12 months (2018-2023 cohorts)
62%
in Oil & Gas and energy sectors
18%
in technology and professional services
35+
countries with verified graduate presence

Data note: The profile above is a composite based on patterns observed across multiple records. All identifiers have been anonymised. Aggregate figures are simulated from actual data distributions and are indicative only. In production, every data point would carry source attribution and a confidence score.

ADA — Commercial Arrangements

The ADA Discovery Pipeline, Confidence Framework, and Institutional Intelligence outputs described in Parts 1-3 are delivered as a managed service. We operate the infrastructure, run the discovery campaigns, and maintain the evidence base. The institution provides registry data, reviews discovered profiles, and consumes the resulting intelligence. This model places operational responsibility with the party best positioned to carry it — and keeps the institution's commitment focused on outcomes, not infrastructure.

Setup — One-Time
RM 50,000
Onboarding, deployment, training
Monthly Subscription
RM 8,000
From Month 1 of operation
Year 1 Total
RM 146,000
Setup + 12 months managed service

What the Setup Covers

  • Institutional configuration, branding, and registry import pipeline
  • Signal weight calibration for identity resolution
  • Infrastructure deployment — private cloud or on-premises
  • ARO operator training (2 half-day sessions)
  • PDPA compliance framework and disclosure templates
  • Opt-out mechanism implementation

What the Subscription Covers

  • Hosted operation with 6 autonomous discovery pipelines
  • All crawl, search, and data-source operations
  • Scheduled discovery and refresh campaigns
  • LLM inference costs for extraction and classification
  • Security maintenance and continuous platform improvements
  • ARO support and issue resolution
  • Warranty: development team available during build phase (Month 1-3), remote support 8x5 during stabilisation (Month 4-6), critical-issue triage within 4 hours throughout

Prerequisites

  • Registry data extract — structured or semi-structured, with at minimum: name, graduation year, and programme
  • Designated ARO point of contact for profile review and acceptance
  • Institutional legal review of PDPA compliance framework before launch
  • Opt-out communication to alumni at registry import stage

Known Constraints

  • Not all alumni maintain public profiles — some proportion of every registry is undiscoverable by design
  • Name disambiguation on common names requires conservative confidence thresholds; ambiguous cases are flagged for ARO review rather than guessed
  • Source availability may shift; the pipeline uses multiple fallback chains and is monitored for source health

Payment Schedule

MilestoneAmount
NDA execution and engagement letter25% of setup — RM 12,500
Registry imported, pilot operational75% of setup — RM 37,500
Monthly subscriptionRM 8,000 / month from Month 1

Next Steps

This proposal outlines the ADA methodology, not a finished system. The natural first step is a conversation about the specific questions that matter most to your institution.

Download Proposal PDF