Royal Academy of Engineering
AI-Driven Enterprise Institute

Methodology

1. Sampling Strategy

A stratified random sample of 9,408 companies was drawn from 793,980 active UK engineering firms registered at Companies House. Stratification variables include 12 UK regions (NUTS1) and 12 industry sections (SIC 2007 code groups), yielding 784 companies per region for ±3.5% margin of error at 95% confidence. The national margin of error is ±1.0%.

Population: 793,980 active UK engineering companies
Sample size: 9,408 companies (784 per region × 12 regions)
Confidence: 95% nationally (±1.0%), per region (±3.5%)
Stratification: 12 regions × 12 industry sections

2. Data Sources

The index draws on five independent data channels. Rather than scoring each channel separately, all channels are merged into a unified evidence set per company. This ensures that the same I4.0 term detected across multiple sources is counted only once, preventing data-volume bias.

A

Website Analysis

Automated crawling of company websites, covering self-reported capabilities, product and service descriptions, and technology references. Detected I4.0 terms are matched against a 250-term dictionary.

B

Social Media Posts

Analysis of company social media activity for I4.0 technology mentions, capturing temporal signals of ongoing engagement with advanced manufacturing technologies.

C

Web Search

Search of the open web for each company's I4.0 adoption evidence, covering news articles, press releases, case studies, and industry reports. Provides independent, citation-backed validation.

D

Patent Records

Patent office search for I4.0-related intellectual property filings, mapped to technology pillars. Patent families are deduplicated so each invention is counted only once regardless of jurisdiction.

E

Company Profiles

Analysis of company profile descriptions and taglines for direct technology positioning signals and self-identification with I4.0 technologies.

3. Evidence Aggregation

All unique I4.0 terms detected across the five data channels are merged into a single evidence set per company. This approach measures adoption, not data availability — a term found on a website and in a LinkedIn post is counted once, not twice. The two primary dimensions extracted from this merged evidence are channel count (how many independent sources confirmed I4.0 activity) and pillar breadth (how many of the 10 technology pillars have at least one confirmed term).

Step 1: Merge

For each company, collect all detected I4.0 terms from all five channels. Duplicate terms across channels are deduplicated, creating a single set of unique terms per technology pillar.

Step 2: Channel Count

Count the number of independent data channels (0–5) that detected at least one I4.0 term. More channels providing evidence indicates stronger, cross-validated adoption signals.

Step 3: Pillar Breadth

Count the number of unique technology pillars (0–10) with at least one confirmed term across all channels (union count). Higher breadth indicates diversified I4.0 engagement across multiple technology areas.

Step 4: Tier Classification

The channel count and pillar breadth are combined to assign each company to one of six adoption tiers (see Section 5). This replaces a single composite score with a more transparent, multi-dimensional classification.

5

Data Channels

10

Technology Pillars

6

Adoption Tiers

4. Key Metrics

The index reports the following metrics at national, regional, and industry levels. The adoption tier is the primary classification used across all headline reporting.

Adoption Tier

Six-level classification (No Adoption → Leading) based on channel count × pillar breadth. The primary metric for all headline reporting.

Adoption Rate (%)

Percentage of companies with at least one confirmed I4.0 signal from any channel. Reported at national, regional, and industry level.

Channel Count (0–5)

Number of independent data sources that detected I4.0 activity for a company. More channels = stronger cross-validated evidence.

Pillar Breadth (0–10)

Number of technology pillars with at least one confirmed term (union across all channels). The key dimension for measuring technology diversification.

Pillar Adoption Rate (%)

Per-pillar percentage of companies with at least one confirmed term for that pillar. Reported at national and regional level across all 10 pillars.

5. Adoption Tier Classification

Companies are classified into six adoption tiers based on two dimensions: the number of independent data channels providing evidence and the number of unique technology pillars detected (breadth). This channel × pillar approach rewards cross-validated, multi-source evidence rather than relying on a single composite score.

No Adoption0 channels, or unverified 1 ch + 1 pillar

No confirmed I4.0 adoption signals detected across any channel

Minimal1 ch + 1 pillar (verified), or 1 ch + 2 pillars

Single verified I4.0 signal from one data source, or limited evidence across two technology areas

Emerging≥2 ch + 1 pillar, or 1 ch + ≥3 pillars

Early cross-validated signals or broad single-channel breadth

Developing≥2 ch + 3–5 pillars

Multi-source evidence across several technology pillars

Established≥3 ch + 4–5 pillars, or ≥2 ch + 6+ pillars

Strong, well-evidenced adoption with broad technology coverage

Leading≥3 ch + ≥6 pillars

Deep, multi-channel adoption spanning most technology pillars

Channels = number of independent data sources (website, social media posts, web search, patents, company profiles) that detected at least one I4.0 signal.
Pillars = number of unique technology pillars (P01–P10) with at least one confirmed term across all channels (union count, not per-channel).

6. Data Quality & Sample Notes

The sample underwent a comprehensive quality audit in March 2026. All 9,408 companies have been verified to possess valid website and LinkedIn URLs, and have complete data across all five data channels.

Regional coverage: All 12 regions have exactly 784 companies, achieving the full stratified target. Regional margins of error are ±3.5% at 95% confidence.

Transportation sector: Only 17 companies (target: 60) due to the very small population of engineering-related transport companies (167 total). Transportation findings should be interpreted with caution (MoE ±24%).

Ambiguous term validation: 42 terms classified as potentially ambiguous were validated through contextual analysis, resulting in removal of 2,224 false positive term detections across 2,404 companies. This ensures that generic terms (e.g., "API", "CRM") are only counted when used in an I4.0 context.

Patent family deduplication: Patent records are deduplicated by invention family. A single invention filed in US, EP, GB, and CN is counted once, not four times. All jurisdictions are kept (global filing still demonstrates innovation activity).

7. Technology Intensity Classification

All 9,408 companies are classified into three technology intensity tiers using a web-verified hybrid Eurostat framework. The classification extends the NACE Rev. 2 manufacturing technology intensity categories to include Knowledge-Intensive Services, providing a unified framework across all sectors.

High-Technology(N=1,217)

Pharmaceuticals (SIC 21), Electronics (26), Telecommunications (61), Software (62), R&D (72)

Medium-Technology(N=4,796)

Chemicals (20), Machinery (27–28), Vehicles (29–30), Mining (05–09), Engineering (71)

Low-Technology(N=3,395)

Food (10–12), Textiles (13–15), Furniture (31–32), Wholesale (45–47), Hospitality (55–56)

Multi-SIC resolution: Of 9,408 companies, 2,717 (29%) have multiple SIC codes. Among these, 560 had codes spanning different technology tiers. Three classification scenarios were evaluated:

Primary SIC only — Conservative approach using the first SIC code (misses legitimate high-tech)
Highest tier — Aggressive approach using the highest-tier SIC code (false promotions)
Web-verified (adopted) — Each of the 560 disputed companies individually researched to determine actual business activity

The web-verified approach was adopted as the most academically defensible. Every reclassification is backed by evidence about what the company actually does.

8. 10 Technology Pillars (250 Terms)

The I4.0 term dictionary contains 250 terms organized into 10 technology pillars (25 terms each). Detection uses case-insensitive word-boundary regex matching across all data channels. Terms detected across multiple channels are deduplicated per company.

P01

Additive Manufacturing

25 terms
Additive Manufacturing3D PrintingRapid PrototypingDirect Digital ManufacturingFused Deposition ModelingFDMSelective Laser SinteringSLSSelective Laser MeltingSLMStereolithographySLABinder JettingDirect Energy DepositionDEDDigital Light ProcessingDLPPolyJetLattice StructureGenerative DesignTopology OptimizationRapid ToolingCustomized PartsSpare Parts On-DemandIn-situ Monitoring
P02

IIoT & Sensors

25 terms
IIoTIndustrial Internet of ThingsSmart SensorsRFIDConnected AssetsSmart FactoryCondition MonitoringEdge ComputingMQTTOPC UAModbusProfibusLoRaWANIndustrial GatewaysMesh NetworksReal-time MonitoringAsset TrackingSensor FusionTelemetryMachine ConnectivitySmart MeteringWireless Sensor NetworksData LoggersProcess InstrumentationFieldbus
P03

AI & Cognitive Learning

25 terms
Artificial IntelligenceMachine LearningCognitive ComputingPredictive MaintenancePdMDeep LearningNeural NetworksAnomaly DetectionPrescriptive AnalyticsComputer VisionVisual InspectionNatural Language ProcessingNLPCognitive AutomationAutonomous Decision MakingReinforcement LearningEdge AIPattern RecognitionAutoMLExplainable AIXAIBayesian NetworksExpert SystemsSmart AlgorithmsAutomated Root Cause Analysis
P04

Mobile & Wearables

25 terms
Smart GlassesWearable TechnologyHMIHuman Machine InterfaceIndustrial MobilityExoskeletonsSmart HelmetsConnected WorkerIndustrial TabletsMobile WorkstationHeads-up DisplayHUDBio-sensorsWearable ScannersSmart GlovesVoice RecognitionGesture ControlRemote AssistanceLone Worker ProtectionSafety WearablesActivity TrackingAugmented Work InstructionsMobile Workforce ManagementField Service AppsWireless Terminals
P05

Data & Systems Integration

25 terms
ERP IntegrationMESManufacturing Execution SystemVertical IntegrationHorizontal IntegrationSystems IntegrationInteroperabilityAPIMiddlewareDigital ThreadPLMProduct Lifecycle ManagementSCMSupply Chain ManagementCRM IntegrationData SilosEnterprise IntegrationUnified NamespaceSCADA IntegrationISA-95Cloud IntegrationLegacy System MigrationStandardized ProtocolsService Oriented ArchitectureSOA
P06

Robotics & Automation

25 terms
RoboticsCobotsCollaborative RobotsAutonomous RobotsProcess Control AutomationAGVAutonomous Guided VehiclesAMRAutonomous Mobile RobotsIndustrial RobotsEnd-of-Arm ToolingEOATKinematicsMotion ControlPLCsProgrammable Logic ControllersRobotic Process AutomationRPAMachine TendingPick and PlaceAutomated AssemblySwarm RoboticsSoft RoboticsGantry RobotsCartesian Robots
P07

AR/VR/MR

25 terms
Augmented RealityVirtual RealityMixed RealityDigital OverlayImmersive TrainingExtended RealityXRHolographic Projection3D VisualizationVirtual PrototypingRemote InspectionSpatial ComputingVirtual ShowroomInteractive 3DAR MaintenanceSafety SimulationsDigital Mock-upHead-mounted DisplayHMDStereoscopic ImagingVisual OverlayMarkerless ARVirtual WalkthroughPoint CloudPhotogrammetry
P08

Industrial Cybersecurity

25 terms
Industrial CybersecurityOT SecurityOperational Technology SecurityNetwork SegmentationCritical Infrastructure ProtectionIDSIntrusion Detection SystemFirewallEndpoint ProtectionZero Trust ArchitectureVulnerability AssessmentIndustrial Protocol FilteringAir-gappingCyber ResilienceThreat IntelligenceIncident ResponseAccess ControlIdentity ManagementEncryptionSecurity AuditNIST FrameworkISO 27001Patch ManagementSecure Remote AccessDigital Forensic
P09

Big Data & Analytics

25 terms
Big DataData AnalyticsPredictive AnalyticsData LakeIndustrial AnalyticsData MiningStatistical Process ControlSPCOEEOverall Equipment EffectivenessProduction DashboardsData VisualizationDescriptive AnalyticsDiagnostic AnalyticsReal-time InsightsKey Performance IndicatorsKPIsHadoopSparkNoSQLTime-series DatabaseCloud ComputingSaaSBatch ProcessingStream Processing
P10

Digital Twins & Simulation

25 terms
Digital TwinSimulationVirtual ReplicaCyber-Physical SystemsCPSDiscrete Event SimulationMultiphysics SimulationVirtual CommissioningPredictive SimulationSystem ModelingDigital ShadowProduct TwinProcess TwinPerformance TwinBIMBuilding Information ModelingFinite Element AnalysisFEAComputational Fluid DynamicsCFDStress AnalysisThermal SimulationHardware-in-the-loopSoftware-in-the-loopSynthetic Data