ADA — Alumni Data Atlas

Part 1

Questions an Alumni Evidence System Should Answer

Every university with a large alumni base faces a similar challenge: the registry holds thousands of names, but what graduates actually do after leaving the institution remains largely unknown. The questions below frame the scope worth exploring.

Question

What proportion of graduates are employed, and in which industries?

Question

How do career trajectories differ across programmes and graduation cohorts?

Question

Which employers and sectors hire the most graduates, and has that pattern shifted over time?

Question

Can graduate outcomes be evidenced for accreditation, rankings, and stakeholder reporting?

These questions are not new. Most institutions already have them in some form — embedded in accreditation requirements, ranking submissions, or internal reviews. What is missing is the data infrastructure to produce answers that are systematic, verifiable, and repeatable year over year.

Current approaches typically rely on voluntary alumni surveys (low response rates, self-selection bias) or manual lookups (unscalable, inconsistent). The registry itself contains graduation records, but career data, professional affiliations, publications, and geographic distribution exist outside the institution's walls.

The Evidence Gap

Of an estimated 41,500 registered alumni, fewer than 600 have career data that can be verified through public sources. The remaining estimated 40,000 records hold graduation details but no outcome signal. Comprehensive insight into graduate outcomes remains out of reach — not because the data does not exist, but because it has not been systematically connected back to the registry.

Part 2

ADA — Proposed Approach

A methodology for connecting registry data to publicly available professional information — systematically, at scale, with verifiable confidence in every data point.

Component 1

Discovery Pipeline

A systematic process for locating publicly available professional information associated with each registry record. The approach combines structured queries across multiple public data sources — professional networks, corporate registries, academic databases, and publication indices. Each source is queried using identity signals (name, graduation year, programme) to locate matching profiles. Records that do not resolve on the first pass are retried with expanded search strategies.

Component 2

Confidence Framework

Every piece of information carries a confidence score based on the strength of the evidence supporting it. A LinkedIn profile that matches name, institution, and graduation year receives a higher confidence assignment than a partial match from a single source. Source attribution is maintained for every data point — enabling downstream verification of claims. The framework defines what level of evidence is required for different use cases: accreditation submissions might require high-confidence data, while internal planning can work with moderate confidence signals.

Component 3

Institutional Intelligence

Individual profiles are the foundation. The output is aggregate insight: employment rates by programme and cohort, industry distribution across faculties, geographic dispersion of graduates, and employer concentration patterns. These aggregates are what accreditation bodies, rankings agencies, and institutional leadership need to see. The methodology ensures that every aggregate number is traceable back to the individual records and confidence scores that produced it.

Employment distribution by programme, cohort, and industry sector
Career trajectory patterns at defined intervals post-graduation
Geographic dispersion of alumni across countries and sectors
Employer concentration — which organisations employ the most graduates

Starting Point

Evidence Gap Assessment. Before building anything, the first step is to understand the current state: what the registry contains, where the gaps are, what confidence levels are achievable with existing data, and what sources are most productive for the specific cohort profiles. This ADA assessment produces a map of the data landscape — not a system, but a shared understanding of what is possible and where the effort should go.

Part 4

ADA — Commercial Arrangements

The ADA Discovery Pipeline, Confidence Framework, and Institutional Intelligence outputs described in Parts 1-3 are delivered as a managed service. We operate the infrastructure, run the discovery campaigns, and maintain the evidence base. The institution provides registry data, reviews discovered profiles, and consumes the resulting intelligence. This model places operational responsibility with the party best positioned to carry it — and keeps the institution's commitment focused on outcomes, not infrastructure.

Setup — One-Time

RM 50,000

Onboarding, deployment, training

Monthly Subscription

RM 8,000

From Month 1 of operation

Year 1 Total

RM 146,000

Setup + 12 months managed service

What the Setup Covers

Institutional configuration, branding, and registry import pipeline
Signal weight calibration for identity resolution
Infrastructure deployment — private cloud or on-premises
ARO operator training (2 half-day sessions)
PDPA compliance framework and disclosure templates
Opt-out mechanism implementation

What the Subscription Covers

Hosted operation with 6 autonomous discovery pipelines
All crawl, search, and data-source operations
Scheduled discovery and refresh campaigns
LLM inference costs for extraction and classification
Security maintenance and continuous platform improvements
ARO support and issue resolution
Warranty: development team available during build phase (Month 1-3), remote support 8x5 during stabilisation (Month 4-6), critical-issue triage within 4 hours throughout

Prerequisites

Registry data extract — structured or semi-structured, with at minimum: name, graduation year, and programme
Designated ARO point of contact for profile review and acceptance
Institutional legal review of PDPA compliance framework before launch
Opt-out communication to alumni at registry import stage

Known Constraints

Not all alumni maintain public profiles — some proportion of every registry is undiscoverable by design
Name disambiguation on common names requires conservative confidence thresholds; ambiguous cases are flagged for ARO review rather than guessed
Source availability may shift; the pipeline uses multiple fallback chains and is monitored for source health

Payment Schedule

Milestone	Amount
NDA execution and engagement letter	25% of setup — RM 12,500
Registry imported, pilot operational	75% of setup — RM 37,500
Monthly subscription	RM 8,000 / month from Month 1

From Registry to Evidence

Questions an Alumni Evidence System Should Answer

The Evidence Gap

ADA — Proposed Approach

Discovery Pipeline

Confidence Framework

Institutional Intelligence

ADA — What Enrichment Reveals

Aggregate View — Cohort-Level Intelligence

ADA — Commercial Arrangements

What the Setup Covers

What the Subscription Covers

Prerequisites

Known Constraints

Payment Schedule

Next Steps