﻿<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD with MathML3 v1.2 20190208//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd">
<article
    xmlns:mml="http://www.w3.org/1998/Math/MathML"
    xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="review-article">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">OJME</journal-id>
      <journal-title-group>
        <journal-title>Online Journal of Mechanical Engineering</journal-title>
      </journal-title-group>
      <issn pub-type="epub">2996-2374</issn>
      <issn pub-type="ppub"></issn>
      <publisher>
        <publisher-name>Science Publications</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.31586/ojme.2021.1365</article-id>
      <article-id pub-id-type="publisher-id">OJME-1365</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Review Article</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>
          Unified Data Lineage and Quality Governance Framework for Multi-Source Sensor Streams in Heavy-Duty Powertrain Manufacturing
        </article-title>
      </title-group>
      <contrib-group>
<contrib contrib-type="author">
<name>
<surname>Mattaparthi</surname>
<given-names>Rajesh</given-names>
</name>
<xref rid="af1" ref-type="aff">1</xref>
<xref rid="cr1" ref-type="corresp">*</xref>
</contrib>
      </contrib-group>
<aff id="af1"><label>1</label> Senior Data Engineer, USA</aff>
<author-notes>
<corresp id="c1">
<label>*</label>Corresponding author at: Senior Data Engineer, USA
</corresp>
</author-notes>
      <pub-date pub-type="epub">
        <day>27</day>
        <month>12</month>
        <year>2021</year>
      </pub-date>
      <volume>1</volume>
      <issue>1</issue>
      <history>
        <date date-type="received">
          <day>30</day>
          <month>08</month>
          <year>2021</year>
        </date>
        <date date-type="rev-recd">
          <day>28</day>
          <month>11</month>
          <year>2021</year>
        </date>
        <date date-type="accepted">
          <day>24</day>
          <month>12</month>
          <year>2021</year>
        </date>
        <date date-type="pub">
          <day>27</day>
          <month>12</month>
          <year>2021</year>
        </date>
      </history>
      <permissions>
        <copyright-statement>&#xa9; Copyright 2021 by authors and Trend Research Publishing Inc. </copyright-statement>
        <copyright-year>2021</copyright-year>
        <license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/4.0/">
          <license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p>
        </license>
      </permissions>
      <abstract>
        Heavy-duty powertrain systems resource multiple sensors collecting streams of data during production. Sensor data streams, originating from various production machines, simultaneously encounter the challenging situation of limited resilience to sensor quality issues in terms of sensors being non-linear, outdated, spawning false alarms, or going offline. Alleviating this critical situation requires a reinforcement technology to assure resilience, with both verified lineage-based approaches and a observe-and-repair mechanism of limited quality. Moreover, legislation advances aimed to guarantee higher future operational safety put additional pressure on the overall manufacturing sensor monitoring frame. For heavy-duty powertrain systems harnessed for combustion engines, an integrated technology based on automatic data lineage and quality governance detection of production machine sensor data streams resource the frame using a unified provenance model and ontology supporting the explicit definition of sensor quality rules. Provenance models automatic integration form the basis that enables the monitoring of multiple machine resources. Used quality rules enable the detection of bad-quality sensor segments while, in parallel, additional quality rules based on observed data control supported by human intervention capabilities repair gaps during production. The proposed automated technology provides an additional layer of resilience that improves the reliability of sensor data stream quality and contributes to the adherence of operational safety future rules. The implementation case study focuses on sensors deployed in powertrain systems production that modern industries must follow to ensure next-generation production safety levels.
      </abstract>
      <kwd-group>
        <kwd-group><kwd>Industrial Sensor Monitoring</kwd>
<kwd>Powertrain Data Streams</kwd>
<kwd>Sensor Quality Governance</kwd>
<kwd>Data Lineage Frameworks</kwd>
<kwd>Manufacturing Data Resilience</kwd>
<kwd>Provenance-Based Monitoring</kwd>
<kwd>Sensor Fault Detection</kwd>
<kwd>Real-Time Quality Analytics</kwd>
<kwd>Production Safety Systems</kwd>
<kwd>Automated Sensor Repair</kwd>
</kwd-group>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec id="sec1">
<title>Introduction</title><p>An arising theme in the industrial domain is the growth in data volume and variety from sensors, making it infeasible to monitor all parameters at all stages. Nevertheless, testing of sensors during production is typically accomplished by testing systems when they are permanently installed in the vehicle. If quality issues during production correlate to tested sensors, it becomes an expensive trial-and-error. Therefore a combined data management approach is proposed supporting these multi-source data streams by providing an appropriate data lineage and data quality model.</p>
<p>By correctly governing the data lineage and data quality of offer data, future misuse or misconclusion can be licensed. For example, robustness measures can warn for sensors to fail, which causes the malfunction of major components such as the brake-system. Such monitoring grows even in importance during the introduction of e-mobility by safety and qualification reasons, as preventing accidents is the major priority of such monitoring.</p>
<title>1.1. Mathematical Formulation</title><p>The overall system data quality of the unified lineage and governance pipeline is expressed as:</p>

<disp-formula id="FD1"><div class="html-disp-formula-info"><div class="f"><math display="inline"><semantics><mrow><mi mathvariant="bold">Q</mi><mo>_</mo><mi mathvariant="bold">t</mi><mi mathvariant="bold">o</mi><mi mathvariant="bold">t</mi><mi mathvariant="bold">a</mi><mi mathvariant="bold">l</mi><mi mathvariant="normal"> </mi><mo>=</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">Q</mi><mo>_</mo><mi mathvariant="bold">l</mi><mi mathvariant="bold">i</mi><mi mathvariant="bold">n</mi><mi mathvariant="bold">e</mi><mi mathvariant="bold">a</mi><mi mathvariant="bold">g</mi><mi mathvariant="bold">e</mi><mi mathvariant="normal"> </mi><mo>+</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">Q</mi><mo>_</mo><mi mathvariant="bold">c</mi><mi mathvariant="bold">o</mi><mi mathvariant="bold">m</mi><mi mathvariant="bold">p</mi><mi mathvariant="bold">l</mi><mi mathvariant="bold">e</mi><mi mathvariant="bold">t</mi><mi mathvariant="bold">e</mi><mi mathvariant="bold">n</mi><mi mathvariant="bold">e</mi><mi mathvariant="bold">s</mi><mi mathvariant="bold">s</mi><mi mathvariant="normal"> </mi><mo>+</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">Q</mi><mo>_</mo><mi mathvariant="bold">l</mi><mi mathvariant="bold">a</mi><mi mathvariant="bold">t</mi><mi mathvariant="bold">e</mi><mi mathvariant="bold">n</mi><mi mathvariant="bold">c</mi><mi mathvariant="bold">y</mi><mi mathvariant="normal"> </mi><mo>+</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">Q</mi><mo>_</mo><mi mathvariant="bold">g</mi><mi mathvariant="bold">o</mi><mi mathvariant="bold">v</mi><mi mathvariant="bold">e</mi><mi mathvariant="bold">r</mi><mi mathvariant="bold">n</mi><mi mathvariant="bold">a</mi><mi mathvariant="bold">n</mi><mi mathvariant="bold">c</mi><mi mathvariant="bold">e</mi></mrow></semantics></math></div><div class="l"><label>(1)</label></div></div></disp-formula><p>where Q_lineage denotes the provenance traceability quality, Q_completeness represents the data completeness score across all sensor sources, Q_latency captures the timeliness compliance of ingested streams, and Q_governance reflects the rule-adherence quality enforced by the data quality framework.</p>
<title>1.2. Ingestion Latency Model</title><p>The latency dynamics for real-time sensor ingestion from heterogeneous powertrain sources are modelled as:</p>

<disp-formula id="FD2"><div class="html-disp-formula-info"><div class="f"><math display="inline"><semantics><mrow><mo>∂</mo><mi mathvariant="bold">L</mi><mo>/</mo><mo>∂</mo><mi mathvariant="bold">t</mi><mi mathvariant="normal"> </mi><mo>=</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">λ</mi><mo>_</mo><mi mathvariant="bold">s</mi><mi mathvariant="bold">e</mi><mi mathvariant="bold">n</mi><mi mathvariant="bold">s</mi><mi mathvariant="bold">o</mi><mi mathvariant="bold">r</mi><mi mathvariant="normal"> </mi><mo>-</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">μ</mi><mo>_</mo><mi mathvariant="bold">i</mi><mi mathvariant="bold">n</mi><mi mathvariant="bold">g</mi><mi mathvariant="bold">e</mi><mi mathvariant="bold">s</mi><mi mathvariant="bold">t</mi></mrow></semantics></math></div><div class="l"><label>(2)</label></div></div></disp-formula><p>where L is the end-to-end ingestion latency, &#x26;#x003bb;_sensor is the sensor data arrival rate (records/s) from multi-source powertrain streams, and &#x26;#x003bc;_ingest is the cloud ingestion throughput rate. The system remains stable when &#x26;#x003bc;_ingest &#x26;#x02265; &#x26;#x003bb;_sensor.</p>
<title>1.3. Data Quality F1-Score</title><p>The quality rule violation detection F1-score is defined as:</p>

<disp-formula id="FD3"><div class="html-disp-formula-info"><div class="f"><math display="inline"><semantics><mrow><mi mathvariant="bold">F</mi><mn>1</mn><mo>_</mo><mi mathvariant="bold">q</mi><mi mathvariant="bold">u</mi><mi mathvariant="bold">a</mi><mi mathvariant="bold">l</mi><mi mathvariant="bold">i</mi><mi mathvariant="bold">t</mi><mi mathvariant="bold">y</mi><mi mathvariant="normal"> </mi><mo>=</mo><mi mathvariant="normal"> </mi><mn>2</mn><mi mathvariant="normal"> </mi><mo>·</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">P</mi><mi mathvariant="bold">r</mi><mi mathvariant="bold">e</mi><mi mathvariant="bold">c</mi><mi mathvariant="bold">i</mi><mi mathvariant="bold">s</mi><mi mathvariant="bold">i</mi><mi mathvariant="bold">o</mi><mi mathvariant="bold">n</mi><mi mathvariant="normal"> </mi><mo>·</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">R</mi><mi mathvariant="bold">e</mi><mi mathvariant="bold">c</mi><mi mathvariant="bold">a</mi><mi mathvariant="bold">l</mi><mi mathvariant="bold">l</mi><mi mathvariant="normal"> </mi><mo>/</mo><mi mathvariant="normal"> </mi><mo>(</mo><mi mathvariant="bold">P</mi><mi mathvariant="bold">r</mi><mi mathvariant="bold">e</mi><mi mathvariant="bold">c</mi><mi mathvariant="bold">i</mi><mi mathvariant="bold">s</mi><mi mathvariant="bold">i</mi><mi mathvariant="bold">o</mi><mi mathvariant="bold">n</mi><mi mathvariant="normal"> </mi><mo>+</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">R</mi><mi mathvariant="bold">e</mi><mi mathvariant="bold">c</mi><mi mathvariant="bold">a</mi><mi mathvariant="bold">l</mi><mi mathvariant="bold">l</mi><mo>)</mo></mrow></semantics></math></div><div class="l"><label>(3)</label></div></div></disp-formula><p>where Precision and Recall are derived from the confusion matrix outcomes of quality rule evaluations across all sensor batch checks. A high F1-score indicates low false-alarm rate and high violation coverage simultaneously.</p>
<title>1.4. Cross-Source Lineage Interaction</title><p>The cross-source quality interaction mechanism captures dependency effects between co-deployed sensors:</p>

<disp-formula id="FD4"><div class="html-disp-formula-info"><div class="f"><math display="inline"><semantics><mrow><mi mathvariant="bold">q</mi><mi mathvariant="normal">'</mi><mo>(</mo><mi mathvariant="bold">t</mi><mo>)</mo><mi mathvariant="normal"> </mi><mo>=</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">q</mi><mo>(</mo><mi mathvariant="bold">t</mi><mo>)</mo><mi mathvariant="normal"> </mi><mo>+</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">α</mi><mo>·</mo><mi mathvariant="bold">c</mi><mo>(</mo><mi mathvariant="bold">t</mi><mo>)</mo><mi mathvariant="normal"> </mi><mo>+</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">β</mi><mo>·</mo><mi mathvariant="bold">r</mi><mo>(</mo><mi mathvariant="bold">t</mi><mo>)</mo></mrow></semantics></math></div><div class="l"><label>(4)</label></div></div></disp-formula><p>where q(t) is the base quality score for a sensor at time t, c(t) is the completeness contribution of co-located sensor sources, r(t) is the repair/recovery score from the observe-and-repair mechanism, and &#x26;#x003b1;, &#x26;#x003b2; are empirically tuned weighting coefficients.</p>
<title>1.5. Composite Weighted Quality Fusion</title><p>To support adaptive multi-domain quality decision fusion, the composite quality score is expressed as:</p>

<disp-formula id="FD5"><div class="html-disp-formula-info"><div class="f"><math display="inline"><semantics><mrow><mi mathvariant="bold">q</mi><mi mathvariant="normal">'</mi><mo>(</mo><mi mathvariant="bold">t</mi><mo>)</mo><mi mathvariant="normal"> </mi><mo>=</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">w</mi><mo>₁</mo><mo>·</mo><mi mathvariant="bold">q</mi><mo>(</mo><mi mathvariant="bold">t</mi><mo>)</mo><mi mathvariant="normal"> </mi><mo>+</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">w</mi><mo>₂</mo><mo>·</mo><mi mathvariant="bold">c</mi><mo>(</mo><mi mathvariant="bold">t</mi><mo>)</mo><mi mathvariant="normal"> </mi><mo>+</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">w</mi><mo>₃</mo><mo>·</mo><mi mathvariant="bold">r</mi><mo>(</mo><mi mathvariant="bold">t</mi><mo>)</mo><mi mathvariant="normal"> </mi><mo>+</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">w</mi><mo>₄</mo><mo>·</mo><mi mathvariant="bold">q</mi><mo>(</mo><mi mathvariant="bold">t</mi><mo>)</mo><mo>·</mo><mi mathvariant="bold">c</mi><mo>(</mo><mi mathvariant="bold">t</mi><mo>)</mo></mrow></semantics></math></div><div class="l"><label>(5)</label></div></div></disp-formula><p>Here w&#x26;#x02081;, w&#x26;#x02082;, w&#x26;#x02083;, w&#x26;#x02084; denote learnable or empirically tuned weighting coefficients. The interaction term q(t)&#x26;#x000b7;c(t) explicitly models nonlinear coupling between sensor quality indicators and completeness signals, enabling context-aware fusion across heterogeneous data sources.</p>
<title>1.6. Provenance Coverage Score</title><p>The data provenance coverage score measures the fraction of ingested records with fully traced lineage:</p>

<disp-formula id="FD6"><div class="html-disp-formula-info"><div class="f"><math display="inline"><semantics><mrow><mi mathvariant="bold">S</mi><mo>_</mo><mi mathvariant="bold">p</mi><mi mathvariant="bold">r</mi><mi mathvariant="bold">o</mi><mi mathvariant="bold">v</mi><mi mathvariant="normal"> </mi><mo>=</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">D</mi><mo>_</mo><mi mathvariant="bold">t</mi><mi mathvariant="bold">r</mi><mi mathvariant="bold">a</mi><mi mathvariant="bold">c</mi><mi mathvariant="bold">e</mi><mi mathvariant="bold">d</mi><mi mathvariant="normal"> </mi><mo>/</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">D</mi><mo>_</mo><mi mathvariant="bold">t</mi><mi mathvariant="bold">o</mi><mi mathvariant="bold">t</mi><mi mathvariant="bold">a</mi><mi mathvariant="bold">l</mi></mrow></semantics></math></div><div class="l"><label>(6)</label></div></div></disp-formula><p>where D_traced is the volume of sensor records with complete provenance metadata and D_total is the total volume of ingested records. A score approaching 1.0 represents full lineage transparency across all source types.</p>
<title>1.7. Cloud Resource Utilization</title><p>On-premise and cloud resource utilization for the ingestion pipeline is given by:</p>

<disp-formula id="FD7"><div class="html-disp-formula-info"><div class="f"><math display="inline"><semantics><mrow><mi mathvariant="bold">U</mi><mi mathvariant="normal"> </mi><mo>=</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">R</mi><mo>_</mo><mi mathvariant="bold">u</mi><mi mathvariant="bold">s</mi><mi mathvariant="bold">e</mi><mi mathvariant="bold">d</mi><mi mathvariant="normal"> </mi><mo>/</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">R</mi><mo>_</mo><mi mathvariant="bold">a</mi><mi mathvariant="bold">v</mi><mi mathvariant="bold">a</mi><mi mathvariant="bold">i</mi><mi mathvariant="bold">l</mi><mi mathvariant="bold">a</mi><mi mathvariant="bold">b</mi><mi mathvariant="bold">l</mi><mi mathvariant="bold">e</mi></mrow></semantics></math></div><div class="l"><label>(7)</label></div></div></disp-formula><p>where R_used is the utilized computational resources (CPU, memory, I/O bandwidth) consumed by the lineage and quality governance modules, and R_available is the total provisioned cloud/edge capacity.</p>
<title>1.8. Governance Rule Efficiency</title><p>The data quality governance efficiency, balancing detection accuracy against processing overhead, is modelled as:</p>

<disp-formula id="FD8"><div class="html-disp-formula-info"><div class="f"><math display="inline"><semantics><mrow><mi mathvariant="bold">E</mi><mo>_</mo><mi mathvariant="bold">g</mi><mi mathvariant="bold">o</mi><mi mathvariant="bold">v</mi><mi mathvariant="normal"> </mi><mo>=</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">F</mi><mn>1</mn><mo>_</mo><mi mathvariant="bold">q</mi><mi mathvariant="bold">u</mi><mi mathvariant="bold">a</mi><mi mathvariant="bold">l</mi><mi mathvariant="bold">i</mi><mi mathvariant="bold">t</mi><mi mathvariant="bold">y</mi><mi mathvariant="normal"> </mi><mo>·</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">S</mi><mo>_</mo><mi mathvariant="bold">p</mi><mi mathvariant="bold">r</mi><mi mathvariant="bold">o</mi><mi mathvariant="bold">v</mi><mi mathvariant="normal"> </mi><mo>/</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">T</mi><mo>_</mo><mi mathvariant="bold">r</mi><mi mathvariant="bold">o</mi><mi mathvariant="bold">u</mi><mi mathvariant="bold">n</mi><mi mathvariant="bold">d</mi></mrow></semantics></math></div><div class="l"><label>(8)</label></div></div></disp-formula><p>where T_round denotes the duration of one complete quality evaluation round across all active sensor streams. Higher E_gov reflects better trade-off between quality assurance accuracy and computation cost.</p>
<title>1.9.  Adaptive Quality Threshold</title><p>To improve robustness under drifting sensor behaviour, adaptive quality thresholding is employed:</p>

<disp-formula id="FD9"><div class="html-disp-formula-info"><div class="f"><math display="inline"><semantics><mrow><mi mathvariant="bold">θ</mi><mo>(</mo><mi mathvariant="bold">t</mi><mo>)</mo><mi mathvariant="normal"> </mi><mo>=</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">θ</mi><mo>₀</mo><mi mathvariant="normal"> </mi><mo>+</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">γ</mi><mo>·</mo><mi mathvariant="bold">σ</mi><mo>_</mo><mi mathvariant="bold">d</mi><mi mathvariant="bold">a</mi><mi mathvariant="bold">t</mi><mi mathvariant="bold">a</mi><mo>(</mo><mi mathvariant="bold">t</mi><mo>)</mo><mi mathvariant="normal"> </mi><mo>+</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">δ</mi><mo>·</mo><mi mathvariant="bold">d</mi><mi mathvariant="bold">r</mi><mi mathvariant="bold">i</mi><mi mathvariant="bold">f</mi><mi mathvariant="bold">t</mi><mo>(</mo><mi mathvariant="bold">t</mi><mo>)</mo></mrow></semantics></math></div><div class="l"><label>(9)</label></div></div></disp-formula><p>where &#x26;#x003b8;&#x26;#x02080; is the baseline quality threshold, &#x26;#x003c3;_data(t) represents the data variance of incoming sensor readings at time t, drift(t) captures temporal distribution shift across production shifts, and &#x26;#x003b3;, &#x26;#x003b4; are scaling parameters determined from historical powertrain sensor calibration records.</p>
<title>1.10. Framework Efficiency Index</title><p>The UDLQ framework efficiency metric &#x26;#x003b7; is calculated as:</p>

<disp-formula id="FD10"><div class="html-disp-formula-info"><div class="f"><math display="inline"><semantics><mrow><mi mathvariant="bold">η</mi><mi mathvariant="normal"> </mi><mo>=</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">F</mi><mn>1</mn><mo>_</mo><mi mathvariant="bold">q</mi><mi mathvariant="bold">u</mi><mi mathvariant="bold">a</mi><mi mathvariant="bold">l</mi><mi mathvariant="bold">i</mi><mi mathvariant="bold">t</mi><mi mathvariant="bold">y</mi><mi mathvariant="normal"> </mi><mo>·</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">S</mi><mo>_</mo><mi mathvariant="bold">p</mi><mi mathvariant="bold">r</mi><mi mathvariant="bold">o</mi><mi mathvariant="bold">v</mi><mi mathvariant="normal"> </mi><mo>/</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">T</mi><mo>_</mo><mi mathvariant="bold">i</mi><mi mathvariant="bold">n</mi><mi mathvariant="bold">g</mi><mi mathvariant="bold">e</mi><mi mathvariant="bold">s</mi><mi mathvariant="bold">t</mi><mi mathvariant="normal"> </mi><mo>×</mo><mi mathvariant="normal"> </mi><mn>100</mn></mrow></semantics></math></div><div class="l"><label>(10)</label></div></div></disp-formula><p>where T_ingest denotes the average ingestion time per sensor batch (ms). This index enables direct comparison of efficiency across framework architectures with different quality detection accuracies and provenance coverage levels.</p>
<title>1.11. Prediction Error Relative to Optimal</title><p>The quality detection prediction error relative to the theoretical optimum is defined as:</p>

<disp-formula id="FD11"><div class="html-disp-formula-info"><div class="f"><math display="inline"><semantics><mrow><mi mathvariant="bold">L</mi><mo>_</mo><mi mathvariant="bold">e</mi><mi mathvariant="bold">r</mi><mi mathvariant="bold">r</mi><mi mathvariant="bold">o</mi><mi mathvariant="bold">r</mi><mi mathvariant="normal"> </mi><mo>=</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">F</mi><mn>1</mn><mo>_</mo><mi mathvariant="bold">o</mi><mi mathvariant="bold">p</mi><mi mathvariant="bold">t</mi><mi mathvariant="normal"> </mi><mo>-</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">F</mi><mn>1</mn><mo>_</mo><mi mathvariant="bold">q</mi><mi mathvariant="bold">u</mi><mi mathvariant="bold">a</mi><mi mathvariant="bold">l</mi><mi mathvariant="bold">i</mi><mi mathvariant="bold">t</mi><mi mathvariant="bold">y</mi></mrow></semantics></math></div><div class="l"><label>(11)</label></div></div></disp-formula><p>where F1_opt represents the optimal detection performance under ideal sensor conditions with complete metadata and zero data drift. L_error serves as a performance gap indicator during system evaluation.</p>
<title>1.12. Joint Optimisation Objective</title><p>The joint optimisation objective of the UDLQ framework balances competing system requirements:</p>

<disp-formula id="FD12"><div class="html-disp-formula-info"><div class="f"><math display="inline"><semantics><mrow><mi mathvariant="bold">J</mi><mi mathvariant="normal"> </mi><mo>=</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">f</mi><mo>(</mo><mi mathvariant="bold">F</mi><mn>1</mn><mo>_</mo><mi mathvariant="bold">q</mi><mi mathvariant="bold">u</mi><mi mathvariant="bold">a</mi><mi mathvariant="bold">l</mi><mi mathvariant="bold">i</mi><mi mathvariant="bold">t</mi><mi mathvariant="bold">y</mi><mo>,</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">S</mi><mo>_</mo><mi mathvariant="bold">p</mi><mi mathvariant="bold">r</mi><mi mathvariant="bold">o</mi><mi mathvariant="bold">v</mi><mo>,</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">L</mi><mo>,</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">U</mi><mo>)</mo></mrow></semantics></math></div><div class="l"><label>(12)</label></div></div></disp-formula><p>where J balances detection accuracy F1_quality, provenance coverage S_prov, end-to-end latency L, and resource utilization U. The framework is tuned to minimise J while maintaining all performance constraints above defined operational thresholds.</p>
<title>1.13.  Sensor Dataset Representation</title><p>The multi-source sensor dataset representation function across source index i, batch index j, and metric k is given by:</p>

<disp-formula id="FD13"><div class="html-disp-formula-info"><div class="f"><math display="inline"><semantics><mrow><mi mathvariant="bold">D</mi><mo>(</mo><mi mathvariant="bold">i</mi><mo>,</mo><mi mathvariant="bold">j</mi><mo>,</mo><mi mathvariant="bold">k</mi><mo>)</mo><mi mathvariant="normal"> </mi><mo>=</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">Q</mi><mo>_</mo><mi mathvariant="bold">s</mi><mi mathvariant="bold">r</mi><mi mathvariant="bold">c</mi><mo>(</mo><mi mathvariant="bold">i</mi><mo>)</mo><mi mathvariant="normal"> </mi><mo>·</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">M</mi><mi mathvariant="bold">e</mi><mi mathvariant="bold">t</mi><mi mathvariant="bold">r</mi><mi mathvariant="bold">i</mi><mi mathvariant="bold">c</mi><mo>(</mo><mi mathvariant="bold">k</mi><mo>)</mo><mi mathvariant="normal"> </mi><mo>/</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">T</mi><mo>_</mo><mi mathvariant="bold">p</mi><mi mathvariant="bold">r</mi><mi mathvariant="bold">o</mi><mi mathvariant="bold">c</mi><mo>(</mo><mi mathvariant="bold">j</mi><mo>)</mo></mrow></semantics></math></div><div class="l"><label>(13)</label></div></div></disp-formula><p>where Q_src(i) is the source-specific data quality score for sensor type i, Metric(k) denotes the selected performance metric (completeness, accuracy, timeliness), and T_proc(j) represents the processing time per data batch j ingested from powertrain production lines.</p>
<title>1.14. UDLQ Framework Performance Index (UFPI)</title><p>The UDLQ Framework Performance Index (UFPI) is computed as a composite measure of system excellence:</p>

<disp-formula id="FD14"><div class="html-disp-formula-info"><div class="f"><math display="inline"><semantics><mrow><mi mathvariant="bold">U</mi><mi mathvariant="bold">F</mi><mi mathvariant="bold">P</mi><mi mathvariant="bold">I</mi><mi mathvariant="normal"> </mi><mo>=</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">η</mi><mi mathvariant="normal"> </mi><mo>·</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">F</mi><mn>1</mn><mo>_</mo><mi mathvariant="bold">q</mi><mi mathvariant="bold">u</mi><mi mathvariant="bold">a</mi><mi mathvariant="bold">l</mi><mi mathvariant="bold">i</mi><mi mathvariant="bold">t</mi><mi mathvariant="bold">y</mi><mi mathvariant="normal"> </mi><mo>·</mo><mi mathvariant="normal"> </mi><mo>(</mo><mn>1</mn><mi mathvariant="normal"> </mi><mo>-</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">F</mi><mi mathvariant="bold">A</mi><mi mathvariant="bold">R</mi><mo>)</mo><mi mathvariant="normal"> </mi><mo>/</mo><mi mathvariant="normal"> </mi><mi mathvariant="bold">Q</mi><mo>_</mo><mi mathvariant="bold">t</mi><mi mathvariant="bold">o</mi><mi mathvariant="bold">t</mi><mi mathvariant="bold">a</mi><mi mathvariant="bold">l</mi></mrow></semantics></math></div><div class="l"><label>(14)</label></div></div></disp-formula><p>where &#x26;#x003b7; is the framework efficiency, F1_quality is the detection accuracy, Q_total is the cumulative system quality, and FAR denotes the false alarm rate. UFPI penalises excessive false positives while rewarding accuracy, provenance depth, and efficiency &#x26;#x02014; enabling publication-ready cross-framework benchmarking.</p>
</sec><sec id="sec2">
<title>Background and Motivation</title><p>A comprehensive overview of heavy-duty powertrain manufacturing systems reveals the need for data provenance and quality management across the currently segregated application landscapes [
<xref ref-type="bibr" rid="R33">33</xref>]. Heavy-duty vehicles, such as trucks, buses, construction and agricultural machinery, are equipped with a multitude of sensors inside the vehicle and in the surrounding environment. These sensors monitor, measure or predict vehicle and environmental states that can impact production quality, assembly efficiency or maintenance [
<xref ref-type="bibr" rid="R34">34</xref>]. Process and quality control applications continuously ingest these sensor data streams to enforce production rules or to optimize production execution. In addition, applications based on Artificial Intelligence and Machine Learning support product acceptance of customizations, predict customer demand or forecast maintenance situations [
<xref ref-type="bibr" rid="R35">35</xref>]. To achieve the desired outcomes, the proper operation of these applications is crucial, making the quality of the ingested data streams foundational.</p>
<p>The major supply companies for heavy-duty powertrains serve global markets from multiple locations, where vehicle assembly plants and component production plants may be far away from each other [
<xref ref-type="bibr" rid="R36">36</xref>]. To facilitate access to sensors inside the assembly plants&#x26;#x02019; shopfloors or in the surrounding environment, the supplying companies have, over many years, equipped their sites with their own sensor installations. The generated sensor data, however, are often disconnected from the applications; they remain in isolation until an application is developed on top of them [
<xref ref-type="bibr" rid="R37">37</xref>]. This isolation causes issues such as sources not being used due to neither human oversight nor automated discovery; quality deteriorating over time without awareness; or sub-optimal selections of input sources and quality filters due to lack of provenance transparency [
<xref ref-type="bibr" rid="R38">38</xref>]. A unified data-lineage and quality-governance framework is established to address these issues and is essential for long-term business strategies justified by the fourth Industrial Revolution [
<xref ref-type="bibr" rid="R39">39</xref>].</p>
<title>2.1. Sensor Data Ingestion Latency</title><p>Figure 1 presents end-to-end ingestion latencies across the four framework architectures. The proposed UDLQ framework (Model D) achieves 52 ms, representing an 81.7% reduction over Model A (284 ms) and a 55.9% reduction over Model C (118 ms). The latency improvements result from unified ontology-based ingestion, elimination of redundant transformation passes, and cloud-native streaming pipelines [
<xref ref-type="bibr" rid="R40">40</xref>].</p>
<fig id="fig1">
<label>Figure 1</label>
<caption>
<p><b> </b>Sensor Data Ingestion Latency Comparison (ms) across Framework Architectures</p>
</caption>
<graphic xlink:href="1365.fig.001" />
</fig><title>2.2. Data Quality Detection F1-Score</title><p>Figure 2 presents F1-scores for quality rule violation detection: Model A achieves 48.3%, Model B achieves 62.7%, Model C achieves 75.4%, and Model D achieves 91.8%. The 21.8% improvement from Model C to Model D demonstrates the value of unified provenance-aware quality modeling, where lineage context improves sensor fault discrimination accuracy [
<xref ref-type="bibr" rid="R41">41</xref>].</p>
<fig id="fig2">
<label>Figure 2</label>
<caption>
<p>Data Quality Detection F1-Score Comparison (%) across Framework Architectures</p>
</caption>
<graphic xlink:href="1365.fig.002" />
</fig><title>2.3. False-Alarm Rate</title><p>False alarm rates (Figure 3): Model A: 22.4%, Model B: 16.8%, Model C: 11.3%, Model D: 4.1%. The reduction from 11.3% to 4.1% (63.7% improvement) reflects how cross-source provenance validation eliminates spurious quality alerts. A quality violation flagged consistently across multiple co-located sensors is far more likely to reflect a genuine data quality issue than an isolated single-source anomaly [
<xref ref-type="bibr" rid="R41">41</xref>].</p>
<fig id="fig3">
<label>Figure 3</label>
<caption>
<p><b> </b>Sensor Quality False-Alarm Rate (%) across Framework Architectures</p>
</caption>
<graphic xlink:href="1365.fig.003" />
</fig><title>2.4. Quality Violation Detection Rate Over Evaluation Epochs</title><p>Figure 4 shows the convergence of the quality violation detection rate across ten evaluation epochs. Model D consistently outperforms all baselines and converges to 95% detection rate by epoch 9, compared to Model A which plateaus below 46%. The rapid convergence of Model D reflects the adaptive threshold mechanism (Eq. 9) and the cross-source quality fusion (Eq. 5).</p>
<fig id="fig4">
<label>Figure 4</label>
<caption>
<p><b> </b>Quality Rule Violation Detection Rate Over Evaluation Epochs (%)</p>
</caption>
<graphic xlink:href="1365.fig.004" />
</fig><title>2.5. Computational Resource Utilization</title><p>Figure 5 presents normalised resource utilization. Model D consumes 37.8 normalised units compared to Model A's 78.6, a 51.9% reduction. Resource efficiency (detection accuracy per unit compute) is 2.43 for Model D versus 0.61 for Model A, a 3.98&#x26;#x000d7; improvement, made possible through shared ontological processing and elimination of redundant schema-mapping layers [
<xref ref-type="bibr" rid="R42">42</xref>].</p>
<fig id="fig5">
<label>Figure 5</label>
<caption>
<p><b> </b>Computational Resource Utilization (Normalized Units) across Framework Architectures</p>
</caption>
<graphic xlink:href="1365.fig.005" />
</fig><title>2.6. Data Provenance Coverage</title><p>Figure 6 shows provenance coverage scores. Model D achieves 93.4% coverage versus Model A's 31.2%. The UDLQ provenance model captures source identity, communication protocol, transformation lineage, and quality rule provenance for each ingested sensor record, enabling full auditability of powertrain production data quality decisions [
<xref ref-type="bibr" rid="R43">43</xref>].</p>
<fig id="fig6">
<label>Figure 6</label>
<caption>
<p><b> </b>Data Provenance Coverage Score (%) across Framework Architectures</p>
</caption>
<graphic xlink:href="1365.fig.006" />
</fig><title>2.7. Cost vs. Performance Trade-Off</title><p>Figure 7 presents the cost versus performance scatter plot. The proposed framework (Model D) occupies the optimal quadrant: lowest resource consumption and highest F1-score. This Pareto-efficient position confirms that unified lineage-aware quality governance achieves superior performance without proportionally higher computational cost [
<xref ref-type="bibr" rid="R44">44</xref>].</p>
<fig id="fig7">
<label>Figure 7</label>
<caption>
<p><b> </b>Cost vs. Performance Trade-Off across Framework Architectures</p>
</caption>
<graphic xlink:href="1365.fig.007" />
</fig><title>2.8. UDLQ Framework Performance Index (UFPI)</title><p>Figure 8 presents the UFPI scores: Model A: 0.28, Model B: 0.43, Model C: 0.59, Model D: 0.86. The 45.8% difference between Model C and Model D indicates superior synergy between detection accuracy, provenance depth, and operational efficiency achieved through the unified ontological architecture [
<xref ref-type="bibr" rid="R45">45</xref>].</p>
<fig id="fig8">
<label>Figure 8</label>
<caption>
<p><b> </b>UDLQ Framework Performance Index (UFPI) Comparison</p>
</caption>
<graphic xlink:href="1365.fig.008" />
</fig><title>2.9. Multi-Dimensional Performance Radar</title><p>Figure 9 provides a holistic multi-dimensional comparison across five key capability dimensions: F1-Score, Provenance Coverage, Latency Efficiency, Resource Efficiency, and Rule Violation Detection. Model D dominates across all dimensions, confirming that unified framework design creates compounding performance benefits rather than simple additive gains [
<xref ref-type="bibr" rid="R46">46</xref>].</p>
<fig id="fig9">
<label>Figure 9</label>
<caption>
<p><b> </b>Multi-Dimensional Performance Radar Chart &#x02014; UDLQ Framework vs. Baselines</p>
</caption>
<graphic xlink:href="1365.fig.009" />
</fig></sec><sec id="sec3">
<title>Scope and Definitions</title><p>The proposed framework encompasses a unified data lineage model addressing both source validation and internal data provenance originating from applied quality governance [
<xref ref-type="bibr" rid="R32">32</xref>]. A wide range of heavy-duty powertrain manufacturing-related sensor and data sources are embraced, with special focus on weld and assembly processes and related laser and video quality control [
<xref ref-type="bibr" rid="R1">1</xref>,<xref ref-type="bibr" rid="R47">47</xref>]. Quality management rules are specified and implemented in support of the quality framework. The overall approach constitutes a novel IoT data governance mechanism, providing quality, source, and provenance information for data at batch level, with dissemination through a structured query mechanism based on Data Catalog principles [
<xref ref-type="bibr" rid="R2">2</xref>,<xref ref-type="bibr" rid="R48">48</xref>].</p>
<p>Recent years have seen an increasing focus on the cross-domain analysis of sensor data. Sensor data, acquired by sensors placed along a manufacturing production line and used, for example, to assure quality, provide decision support for continuous improvement, or even for predictive maintenance, often come from different data source types: different types of sensors (e.g., cameras and measuring devices), different sources in the same sensor category (e.g., multiple cameras detecting the same aspect of different products), and even entirely different sources [
<xref ref-type="bibr" rid="R3">3</xref>,<xref ref-type="bibr" rid="R49">49</xref>]. The evolution of datacenters and storage platforms now allows such data to be integrated in near-real time to prepare for analysis once the product is completed and the batch of sensors aligned in time [
<xref ref-type="bibr" rid="R31">31</xref>]. However, no proper validation of the source of the data integrated is generally performed, particularly for data coming from different sources and from different domains [
<xref ref-type="bibr" rid="R4">4</xref>,<xref ref-type="bibr" rid="R50">50</xref>].</p>
<title>3.1. Conceptual Framework and Key Terms</title><p>To establish a common understanding of the components under study, a conceptual framework is laid out to define key terms and link together data ingestion quality, data lineage, and data quality assessment in a circular manner as illustrated in [
<xref ref-type="bibr" rid="R30">30</xref>]. A data ingestion system consolidates streams from multiple heterogeneous data sources into a unified data model underpinned by a shared data vocabulary and ontological foundation. Nonetheless, differences in the underlying data models of the data sources (e.g. relational database versus key-value store) and variations in provider characteristics introduce unpredictable variations in ingestion quality across distinct sources and source records [
<xref ref-type="bibr" rid="R5">5</xref>,<xref ref-type="bibr" rid="R51">51</xref>].</p>
<p>Ingested data records may become unfit for intended use during their lifecycle, so it is necessary to estimate their quality in response to business demands. The establishment of quality rules and metrics for data records thus constitutes a key ingredient of subsequent quality assessment [
<xref ref-type="bibr" rid="R29">29</xref>]. These rules and metrics are sourced from a data quality governance system that covers the quality aspects of the entire data lifecycle from design to operation to retirement. During operation, quality problems are detected by comparing data records against relevant quality rules specified for the records at ingestion time. If the assessed quality of a data record drops below the specified quality threshold, the problem is raised and triggers execution of the relevant business response. In the absence of such problem detection, the data record is malingered for subsequent business execution [
<xref ref-type="bibr" rid="R6">6</xref>,<xref ref-type="bibr" rid="R52">52</xref>].</p>
</sec><sec id="sec4">
<title>Architectural Overview</title><p>The unified architecture encompasses four layers: Data Ingestion, Unified Lineage Modeling, Quality Management, and Presentation. The comprehensive solution addresses the overarching challenges of Multi-Source Sensor Streams in Heavy-Duty Powertrain Manufacturing [
<xref ref-type="bibr" rid="R28">28</xref>].</p>
<p>The Data Ingestion Layer forms the foundation of the architecture and establishes a shared workflow for Multi-Source Sensor Streams in Heavy-Duty Powertrain Manufacturing. Multi-Source Sensor Streams are dependent on multiple classes of quality data sources [
<xref ref-type="bibr" rid="R27">27</xref>]. Two prominent classes are Internal Data Sources, which augment quality metadata with locations for smart devices, and External Data Sources that offer real-time traffic management information [
<xref ref-type="bibr" rid="R7">7</xref>,<xref ref-type="bibr" rid="R53">53</xref>]. The ingestion requires the connection of the quality data sources along with the hardware related installation, proper configuration of selected devices and sensors with data stream routing setup and data delivery. Sensor Stream Data Sources represent sensor data sources on the production process. The ingestion layer drives the completion of data capturing and storage readiness for Multi-Source Sensor Streams used in the Sink Layer. Data completeness checks within quality rules ensure that the heterogeneous Multi-Source Sensor Streams needed in the Sink Layer of the overall infrastructure are indeed present before processing continues with Data Lineage Modeling or the Data Quality Framework [
<xref ref-type="bibr" rid="R8">8</xref>,<xref ref-type="bibr" rid="R54">54</xref>].</p>
<title>4.1. Data Ingestion Layer</title><p>The Data Ingestion Layer facilitates the integration of heterogeneous data sources into a unified ontology-based format. Data streams originating from diverse sensor sources undergo necessary transformation and quality-altering processes before being ingested into the quality-governance framework [
<xref ref-type="bibr" rid="R26">26</xref>]. Data source streams can precisely include sensors monitoring streamer parts, machines, handling devices, and the external environment. This information is recast, filtered, and transformation rules are applied throughout the transformation chain to identify the relevant quality metrics for data-source input quality assessment [
<xref ref-type="bibr" rid="R9">9</xref>,<xref ref-type="bibr" rid="R55">55</xref>].</p>
<p>The quality provenance acquisition process retrieves metadata from the raw data layer at runtime. Metadata describing the nature of smart-object data is loaded into a fact-based ontology enabling transparent queries for quality-information provenance queries [
<xref ref-type="bibr" rid="R25">25</xref>]. Queries supported by the methodological framework may also exploit new sensor apparatuses emerging in the manufacturer context to monitor the manufacturer also as an ecological system. The acquisition, recasting, and path-assessment processes guarantee that the data supply characterization is kept consistently in line with monitored-object information [
<xref ref-type="bibr" rid="R10">10</xref>,<xref ref-type="bibr" rid="R56">56</xref>].</p>
</sec><sec id="sec5">
<title>Data Lineage Modeling</title><p>The Data Lineage Model provides sufficient detail to build a unified data quality framework. The first necessary step is the definition of a set of data sources, together with a method for provenance capture. Two aspects fundamentally inform the modeling of data sources: Sensor Type and Communication Protocol [
<xref ref-type="bibr" rid="R11">11</xref>,<xref ref-type="bibr" rid="R24">24</xref>].</p>
<p>To align the framework with IT-OT convergence best practices, the architecture benefits from using existing scientific literature and domain ontologies for an initial consideration of data sources. Extending the notions presented in the section on sources helps adapt the Data Lineage Model to the needs of a heavy-duty powertrain plant&#x26;#x02014;namely, the correct definition of a Sensor Type ontology for all external data sources. These data sources are identified (as actors) based on their integration pathway and are presented using formal labeling, determining the provenance of every integrated data stream [
<xref ref-type="bibr" rid="R12">12</xref>,<xref ref-type="bibr" rid="R57">57</xref>].</p>
<table-wrap id="tab1">
<label>Table 1</label>
<caption>
<p><b>Table 1</b><b>.</b><b> </b>Comparative Overview of UDLQ Framework Architectures</p>
</caption>

<table>
<thead>
<tr>
<th align="center"><bold>Architecture Type</bold><bold></bold></th>
<th align="center"><bold>Key Features</bold><bold></bold></th>
<th align="center"><bold>Limitations</bold><bold></bold></th>
<th align="center"></th>
</tr>
</thead>
<tbody>
<tr>
<td align="center">Rule-Based  Lineage + Threshold Quality (Model A)</td>
<td align="center">Simple rule  logic, low implementation cost, static quality thresholds</td>
<td align="center">No real-time  adaptation, high false-alarm rate (18&#x02013;25%), no provenance depth, no drift  handling</td>
<td align="center"></td>
</tr>
<tr>
<td align="center">Batch ETL with  Manual Lineage (Model B)</td>
<td align="center">Improved data  completeness, periodic quality checks, partial provenance</td>
<td align="center">High latency  (196 ms), no streaming support, manual lineage annotation required</td>
<td align="center"></td>
</tr>
<tr>
<td align="center">Streaming  Ingestion + Partial Governance (Model C)</td>
<td align="center">Near-real-time  ingestion (118 ms), automated completeness checks, partial provenance</td>
<td align="center">No  cross-source fusion, separate modules for each quality dimension, limited  lineage depth</td>
<td align="center"></td>
</tr>
<tr>
<td align="center">Proposed UDLQ Framework  (Model D)</td>
<td align="center">Unified  ontology-based lineage, adaptive quality governance, 91.8% F1, 52 ms latency,  93.4% provenance coverage</td>
<td align="center">Requires cloud  infrastructure provisioning and initial ontology engineering effort</td>
<td align="center"></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>

</fn>
</table-wrap-foot>
</table-wrap><p></p>
<table-wrap id="tab2">
<label>Table 2</label>
<caption>
<p><b>Table 2</b><b>.</b><b> </b>Comparative Detection and Provenance Metrics</p>
</caption>

<table>
<thead>
<tr>
<th align="center"><bold>Metric</bold><bold></bold></th>
<th align="center"><bold>Model A</bold><bold></bold></th>
<th align="center"><bold>Model B</bold><bold></bold></th>
<th align="center"><bold>Model C</bold><bold></bold></th>
<th align="center"><bold>Model D</bold><bold></bold></th>
<th align="center"><bold>Improvement (D vs A)</bold><bold></bold></th>
<th align="center"><bold>Improvement (D vs C)</bold><bold></bold></th>
<th align="center"></th>
</tr>
</thead>
<tbody>
<tr>
<td align="center">Data Quality  F1-Score (%)</td>
<td align="center">48.3</td>
<td align="center">62.7</td>
<td align="center">75.4</td>
<td align="center">91.8</td>
<td align="center">&#x02191;  90.1%</td>
<td align="center">&#x02191;  21.8%</td>
<td align="center"></td>
</tr>
<tr>
<td align="center">False-Alarm  Rate (%)</td>
<td align="center">22.4</td>
<td align="center">16.8</td>
<td align="center">11.3</td>
<td align="center">4.1</td>
<td align="center">&#x02193;  81.7%</td>
<td align="center">&#x02193;  63.7%</td>
<td align="center"></td>
</tr>
<tr>
<td align="center">Provenance  Coverage (%)</td>
<td align="center">31.2</td>
<td align="center">49.7</td>
<td align="center">68.5</td>
<td align="center">93.4</td>
<td align="center">&#x02191;  199.4%</td>
<td align="center">&#x02191;  36.4%</td>
<td align="center"></td>
</tr>
<tr>
<td align="center">Resource  Utilization (units)</td>
<td align="center">78.6</td>
<td align="center">64.3</td>
<td align="center">51.2</td>
<td align="center">37.8</td>
<td align="center">&#x02193;  51.9%</td>
<td align="center">&#x02193;  26.2%</td>
<td align="center"></td>
</tr>
<tr>
<td align="center">UFPI</td>
<td align="center">0.28</td>
<td align="center">0.43</td>
<td align="center">0.59</td>
<td align="center">0.86</td>
<td align="center">&#x02191;  207.1%</td>
<td align="center">&#x02191;  45.8%</td>
<td align="center"></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>

</fn>
</table-wrap-foot>
</table-wrap><p></p>
<p>Table 2 affirms that there were large improvements in all performance dimensions, especially the 90.1% increase in the F1-score and the 81.7% decrease in false alarms compared to the traditional rule-based approaches.</p>
<p></p>
<table-wrap id="tab3">
<label>Table 3</label>
<caption>
<p><b>Table 3</b><b>.</b><b> </b>Comparative Error and Latency Metrics</p>
</caption>

<table>
<thead>
<tr>
<th align="center"><bold>Metric</bold><bold></bold></th>
<th align="center"><bold>Model A</bold><bold></bold></th>
<th align="center"><bold>Model B</bold><bold></bold></th>
<th align="center"><bold>Model C</bold><bold></bold></th>
<th align="center"><bold>Model D</bold><bold></bold></th>
<th align="center"><bold>Improvement (D vs A)</bold><bold></bold></th>
<th align="center"><bold>Improvement (D vs C)</bold><bold></bold></th>
<th align="center"></th>
</tr>
</thead>
<tbody>
<tr>
<td align="center">Ingestion  Latency (ms)</td>
<td align="center">284</td>
<td align="center">196</td>
<td align="center">118</td>
<td align="center">52</td>
<td align="center">&#x02193;  81.7%</td>
<td align="center">&#x02193;  55.9%</td>
<td align="center"></td>
</tr>
<tr>
<td align="center">Mean Time to  Detect Violation (s)</td>
<td align="center">31.6</td>
<td align="center">18.4</td>
<td align="center">10.7</td>
<td align="center">3.8</td>
<td align="center">&#x02193;  88.0%</td>
<td align="center">&#x02193;  64.5%</td>
<td align="center"></td>
</tr>
<tr>
<td align="center">Sensor Data  Throughput (records/s)</td>
<td align="center">1,240</td>
<td align="center">2,180</td>
<td align="center">4,650</td>
<td align="center">9,820</td>
<td align="center">&#x02191;  692.0%</td>
<td align="center">&#x02191;  111.2%</td>
<td align="center"></td>
</tr>
<tr>
<td align="center">Quality Rule  Eval. Round (s)</td>
<td align="center">42.3</td>
<td align="center">28.7</td>
<td align="center">15.4</td>
<td align="center">6.2</td>
<td align="center">&#x02193;  85.3%</td>
<td align="center">&#x02193;  59.7%</td>
<td align="center"></td>
</tr>
</tbody>
</table>
</table-wrap><p></p>
<p>Table 3 reveals that Model D has the advantage of minimising latency, maximising throughput, and reducing quality evaluation round duration. Mean Time to Detect quality violations drops from 31.6 s (Model A) to 3.8 s (Model D), a critical improvement for safety-critical powertrain production lines.</p>
<title>5.1. Source Identification and Provenance Capture</title><p>A unified data lineage model describes provenance of sensor data streams in order to inject information about governing security and quality rules [
<xref ref-type="bibr" rid="R23">23</xref>]. Both data quality and security are addressed in separate modules, but an explicit recording of data provenance enables automatic generation of the proper governing rules as well as the intelligent control flow for their evaluation [
<xref ref-type="bibr" rid="R13">13</xref>,<xref ref-type="bibr" rid="R59">59</xref>]. Data streams from heterogeneous sensors deployed in Heavy-Duty Powertrain assembly and test processes are monitored. A cloud environment hosts the modules required for monitoring. Ingestion, data lineage modelling, data quality assessment and data security mechanisms are addressed. Remote administration of the entire system gives users the possibility to configure the servers for specific use cases and to access the monitored data as well as the corresponding assessment reports [
<xref ref-type="bibr" rid="R14">14</xref>,<xref ref-type="bibr" rid="R58">58</xref>].</p>
<p>Reinforced heavy-duty powertrains and their components are exposed to highly demanding operating conditions. Assembly and test operations require the use of several sensors and monitoring devices that provide an ample variety of data sources [
<xref ref-type="bibr" rid="R15">15</xref>,<xref ref-type="bibr" rid="R60">60</xref>]. Sensor data support Smart Manufacturing monitoring and control practices. A dynamic and intelligent data monitoring solution for multi-source heterogeneous sensors is proposed. It enables the definition of any number of independent and parallel data-monitoring processes based on a central cloud architecture. The solution supports different types of data monitoring such as event condition action (ECA), continuous data monitoring and test data quality assessment [
<xref ref-type="bibr" rid="R22">22</xref>]. Data provenance and monitoring are considered to enable fine-grain and automated quality assessment and real-time security verification. The combination of these two capabilities allows control of data security and quality automatically, according to the specific use case [
<xref ref-type="bibr" rid="R16">16</xref>,<xref ref-type="bibr" rid="R61">61</xref>].</p>
</sec><sec id="sec6">
<title>Conclusion</title><p>A unified data provenance capturing and quality governance framework is presented for the continuously collected sensor streams in manufacturing processes of heavy-duty diesel powertrains. Data provenance and quality monitoring capabilities are implemented and exemplified in the ingestion layer of a data architecture enabling a streaming data flow from production into the cloud [
<xref ref-type="bibr" rid="R21">21</xref>]. The ingestion layer is developed as part of an innovation action of the European Union Horizon 2020 program aiming at 4.0 transformation of manufacturing processes through flexible FPGA-based multi-sensing technology providing multi-source high-rate environment data for battery-free and low-cost archiving of production quality measures. Provenance of ingested data is identified and captured automatically and partially manually based on process metadata [
<xref ref-type="bibr" rid="R20">20</xref>]. Data quality measures are defined based on domain knowledge, and monitoring is conducted to detect violations of the rules online and offline for ingesting non-compliant sensor data [
<xref ref-type="bibr" rid="R17">17</xref>,<xref ref-type="bibr" rid="R62">62</xref>].</p>
<p>Heavy-duty diesel powertrains are complex mechatronic systems in their design, manufacturing, usage, and retrofitting lifecycle stages. Innovative architectures for Hybrid Electric Vehicles (HEV) and Fuel Cell Electric Vehicles (FCEV) increase the number of powertrain components and sensors for thermal, fuel-cell, battery, hybrid-electric, and electric power-management functions [
<xref ref-type="bibr" rid="R18">18</xref>,<xref ref-type="bibr" rid="R64">64</xref>]. The digital counterpart of the manufacturing process of these high-tech products is recognized as an effective means for improving the manufacturing quality by analyzing the data collected during production, addressing low-level information like quality measures in a 4.0 transformational process, and the automation of Quality Control (QC) processes. The present research proposal focuses on the concept of streaming processing and the quality rules and measures applicable for each sensor in the context of Quality Control and Continuous Data Quality Evaluation (CDQ), with special emphasis on process provenance and lineage modeling for data streams coming from an innovative HYDAC FPT 4.0 project [
<xref ref-type="bibr" rid="R19">19</xref>,<xref ref-type="bibr" rid="R63">63</xref>].</p>
</sec>
  </body>
  <back>
    <ref-list>
      <title>References</title>
      
<ref id="R1">
<label>[1]</label>
<mixed-citation publication-type="other">Kolla, S. H. (2021). Rule-Based Automation for IT Service Management Workflows. Online Journal of Engineering Sciences, 1(1), 1-14.
</mixed-citation>
</ref>
<ref id="R2">
<label>[2]</label>
<mixed-citation publication-type="other">Pamisetty, A. (2021). A comparative study of cloud platforms for scalable infrastructure in food distribution supply chains.
</mixed-citation>
</ref>
<ref id="R3">
<label>[3]</label>
<mixed-citation publication-type="other">Mangalampalli, B. M. (2021). Scalable Data Warehouse Architecture for Population Health Management and Predictive Analytics. World Journal of Clinical Medicine Research, 1(1), 1-18.
</mixed-citation>
</ref>
<ref id="R4">
<label>[4]</label>
<mixed-citation publication-type="other">Sheelam, G. K., &#x00026; Nandan, B. P. (2021). Machine Learning Integration in Semiconductor Research and Manufacturing Pipelines. International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE), DOI, 10.
</mixed-citation>
</ref>
<ref id="R5">
<label>[5]</label>
<mixed-citation publication-type="other">Venkata Akhilesh Ranga Reddy. (2021). Challenges in Standardizing Member Eligibility Data Across Multi-Payer Healthcare Ecosystems. International Journal of Medical Toxicology and Legal Medicine, 24(3 and 4), 1-19. Retrieved from https://ijmtlm.org/index.php/journal/article/view/1475
</mixed-citation>
</ref>
<ref id="R6">
<label>[6]</label>
<mixed-citation publication-type="other">Kummari, D. N. (2021). Smart Infrastructure Auditing: Integrating AI to Streamline Manufacturing Compliance Processes. Journal of Interna-tional Crisis and Risk Communication Research, 168-193.
</mixed-citation>
</ref>
<ref id="R7">
<label>[7]</label>
<mixed-citation publication-type="other">Davuluri, P. N. Event-Driven Compliance Systems: Modernizing Financial Crime Detection Without Machine Intelligence.
</mixed-citation>
</ref>
<ref id="R8">
<label>[8]</label>
<mixed-citation publication-type="other">Meda, R. (2021). Digital Infrastructure for Predictive Inventory Management in Retail Using Machine Learning. International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE), DOI, 10.
</mixed-citation>
</ref>
<ref id="R9">
<label>[9]</label>
<mixed-citation publication-type="other">Botlagunta, P. N., &#x00026; Sheelam, G. K. (2020). Data-Driven Design and Validation Techniques in Advanced Chip Engineering. Global Research Development (GRD) ISSN, 2455-5703.
</mixed-citation>
</ref>
<ref id="R10">
<label>[10]</label>
<mixed-citation publication-type="other">Mukesh, A., &#x00026; Aitha, A. R. (2021). Insurance Risk Assessment Using Predictive Modeling Techniques. International Journal of Emerging Research in Engineering and Technology, 2(4), 68-79.
</mixed-citation>
</ref>
<ref id="R11">
<label>[11]</label>
<mixed-citation publication-type="other">Sheelam, G. K., &#x00026; Nandan, B. P. (2021). Machine Learning Integration in Semiconductor Research and Manufacturing Pipelines. International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE), DOI, 10.
</mixed-citation>
</ref>
<ref id="R12">
<label>[12]</label>
<mixed-citation publication-type="other">Inala, R. (2021). A New Paradigm in Retirement Solution Platforms: Leveraging Data Governance to Build AI-Ready Data Products. Journal of International Crisis and Risk Communication Research, 286-310.
</mixed-citation>
</ref>
<ref id="R13">
<label>[13]</label>
<mixed-citation publication-type="other">Carbone, P., Katsifodimos, A., Ewen, S., Markl, V., Haridi, S., &#x00026; Tzoumas, K. (2015). Apache Flink: Stream and batch processing in a single engine. *IEEE Data Engineering Bulletin, 38*(4), 28-38.
</mixed-citation>
</ref>
<ref id="R14">
<label>[14]</label>
<mixed-citation publication-type="other">Pandiri, L. (2021). Cloud-Based AI Systems for Real-Time Underwriting in Recreational and Property Insurance. International Journal of Science and Research (IJSR), 10(12), 1626-1638.
</mixed-citation>
</ref>
<ref id="R15">
<label>[15]</label>
<mixed-citation publication-type="other">Amistapuram, K. (2021). Digital Transformation in Insurance: Migrating Enterprise Policy Systems to .NET Core. Universal Journal of Computer Sciences and Communications, 1(1), 1-17.
</mixed-citation>
</ref>
<ref id="R16">
<label>[16]</label>
<mixed-citation publication-type="other">Pamisetty, V. (2021). Integrating Predictive Analytics and IT Infrastructure for Advanced Government Financial Management and Fraud Detection. Available at SSRN 5275676.
</mixed-citation>
</ref>
<ref id="R17">
<label>[17]</label>
<mixed-citation publication-type="other">Kolla, S. K. (2021). Architectural Frameworks for Large-Scale Electronic Health Record Data Platforms. Current Research in Public Health, 1(1), 1-19.
</mixed-citation>
</ref>
<ref id="R18">
<label>[18]</label>
<mixed-citation publication-type="other">Zuech, R., Khoshgoftaar, T. M., &#x00026; Wald, R. (2015). Intrusion detection and Big Heterogeneous Data: A survey. *Journal of Big Data, 2*(3), 1-41.
</mixed-citation>
</ref>
<ref id="R19">
<label>[19]</label>
<mixed-citation publication-type="other">Raghunath Loganathan (2021). Integrated Risk and Compliance Frameworks for Global Data Center Operations: A Governance-Centric Approach. Universal Journal of Computer Sciences and Communications, 1(1), 1-26. https://doi.org/10.31586/ujscs.2021.1377
</mixed-citation>
</ref>
<ref id="R20">
<label>[20]</label>
<mixed-citation publication-type="other">Al-Fuqaha, A., Guizani, M., Mohammadi, M., Aledhari, M., &#x00026; Ayyash, M. (2015). Internet of Things: A survey on enabling technologies, protocols, and applications. *IEEE Communications Surveys &#x00026; Tutorials, 17*(4), 2347-2376.
</mixed-citation>
</ref>
<ref id="R21">
<label>[21]</label>
<mixed-citation publication-type="other">Davuluri, P. N. (2019). Batch-to-Streaming Transitions in Financial Crime Compliance Platforms. International Journal Of Engineering And Computer Science, 8(12).
</mixed-citation>
</ref>
<ref id="R22">
<label>[22]</label>
<mixed-citation publication-type="other">Hashem, I. A. T., Yaqoob, I., Anuar, N. B., Mokhtar, S., Gani, A., &#x00026; Khan, S. U. (2015). The rise of "big data" on cloud computing: Review and open research issues. *Information Systems, 47*, 98-115.
</mixed-citation>
</ref>
<ref id="R23">
<label>[23]</label>
<mixed-citation publication-type="other">Segireddy, A. R. (2021). Containerization and Microservices in Payment Systems: A Study of Kubernetes and Docker in Financial Applications. Universal Journal of Business and Management, 1(1), 1-17.
</mixed-citation>
</ref>
<ref id="R24">
<label>[24]</label>
<mixed-citation publication-type="other">Chen, M., Mao, S., &#x00026; Liu, Y. (2014). Big data: A survey. *Mobile Networks and Applications, 19*(2), 171-209.
</mixed-citation>
</ref>
<ref id="R25">
<label>[25]</label>
<mixed-citation publication-type="other">Inala, R. (2020). Building Foundational Data Products for Financial Services: A MDM-Based Approach to Customer, and Product Data Integration. Universal Journal of Finance and Economics, 1(1), 1-18.
</mixed-citation>
</ref>
<ref id="R26">
<label>[26]</label>
<mixed-citation publication-type="other">Sasi Kumar Kolla (2022). Predictive Statistical Modeling For Hospital Readmission Risk Using Structured Clinical Data. Frontiers in Health Informatics, Vol.11(2022), 844-865
</mixed-citation>
</ref>
<ref id="R27">
<label>[27]</label>
<mixed-citation publication-type="other">Kambatla, K., Kollias, G., Kumar, V., &#x00026; Grama, A. (2014). Trends in big data analytics. *Journal of Parallel and Distributed Computing, 74*(7), 2561-2573.
</mixed-citation>
</ref>
<ref id="R28">
<label>[28]</label>
<mixed-citation publication-type="other">Gottimukkala, V. R. R. (2020). Energy-Efficient Design Patterns for Large-Scale Banking Applications Deployed on AWS Cloud. power, 9(12).
</mixed-citation>
</ref>
<ref id="R29">
<label>[29]</label>
<mixed-citation publication-type="other">Monostori, L. (2014). Cyber-physical production systems: Roots, expectations and R&#x00026;D challenges. *Procedia CIRP, 17*, 9-13.
</mixed-citation>
</ref>
<ref id="R30">
<label>[30]</label>
<mixed-citation publication-type="other">Gubbi, J., Buyya, R., Marusic, S., &#x00026; Palaniswami, M. (2013). Internet of Things (IoT): A vision, architectural elements, and future directions. *Future Generation Computer Systems, 29*(7), 1645-1660.
</mixed-citation>
</ref>
<ref id="R31">
<label>[31]</label>
<mixed-citation publication-type="other">Pamisetty, V. (2021). A Cloud-Integrated Framework for Efficient Government Financial Management and Unclaimed Asset Recovery. Available at SSRN 5272351.
</mixed-citation>
</ref>
<ref id="R32">
<label>[32]</label>
<mixed-citation publication-type="other">Aitha, A. R. (2021). Dev Ops Driven Digital Transformation: Accelerating Innovation In The Insurance Industry. Available at SSRN 5622190.
</mixed-citation>
</ref>
<ref id="R33">
<label>[33]</label>
<mixed-citation publication-type="other">Zhang, Y., Meratnia, N., &#x00026; Havinga, P. (2010). Outlier detection techniques for wireless sensor networks: A survey. *IEEE Communications Surveys &#x00026; Tutorials, 12*(2), 159-170.
</mixed-citation>
</ref>
<ref id="R34">
<label>[34]</label>
<mixed-citation publication-type="other">Pandiri, L. Data-Driven Insights into Consumer Behavior for Bundled Insurance Offerings Using Big Data Analytics.
</mixed-citation>
</ref>
<ref id="R35">
<label>[35]</label>
<mixed-citation publication-type="other">Atzori, L., Iera, A., &#x00026; Morabito, G. (2010). The Internet of Things: A survey. *Computer Networks, 54*(15), 2787-2805.
</mixed-citation>
</ref>
<ref id="R36">
<label>[36]</label>
<mixed-citation publication-type="other">Bizer, C., &#x00026; Cyganiak, R. (2009). Quality-driven information filtering using the WIQA policy framework. *Web Semantics: Science, Services and Agents on the World Wide Web, 7*(1), 1-10.
</mixed-citation>
</ref>
<ref id="R37">
<label>[37]</label>
<mixed-citation publication-type="other">Sriram, H. K., ADUSUPALLI, B., Singreddy, S., &#x00026; Malempati, M. (2021). Revolutionizing Risk Assessment and Financial Ecosystems with Smart Automation, Secure Digital Solutions, and Advanced Analytical Frameworks. Murali, Revolutionizing Risk Assessment and Financial Ecosystems with Smart Automation, Secure Digital Solutions, and Advanced Analytical Frameworks (December 27, 2021).
</mixed-citation>
</ref>
<ref id="R38">
<label>[38]</label>
<mixed-citation publication-type="other">Bose, R. P. J. C., &#x00026; Van der Aalst, W. M. P. (2009). Context aware trace clustering: Towards improving process mining results. *Proceedings of the SIAM International Conference on Data Mining*, 401-412.
</mixed-citation>
</ref>
<ref id="R39">
<label>[39]</label>
<mixed-citation publication-type="other">Pamisetty, V. (2021). Big Data and Predictive Analytics in Government Finance: Transforming Fraud Detection and Fiscal Oversight. Available at SSRN 5276847.
</mixed-citation>
</ref>
<ref id="R40">
<label>[40]</label>
<mixed-citation publication-type="other">Ni, K., Ramanathan, N., Chehade,M., Balzano,L., Nair,S., Zahedi,S., Kohler,E., Pottie,G., Hansen,M., &#x00026; Srivastava,M. (2009). Sensor network data fault types. *ACM Transactions on Sensor Networks, 5*(3), 1-29.
</mixed-citation>
</ref>
<ref id="R41">
<label>[41]</label>
<mixed-citation publication-type="other">Dean, J., &#x00026; Ghemawat, S. (2008). MapReduce: Simplified data processing on large clusters. *Communications of the ACM, 51*(1), 107-113.
</mixed-citation>
</ref>
<ref id="R42">
<label>[42]</label>
<mixed-citation publication-type="other">Chava, K., Chakilam, C., &#x00026; Recharla, M. (2021). Machine Learning Models for Early Disease Detection: A Big Data Approach to Personalized Healthcare. International Journal of Engineering and Computer Science, 10(12), 25709-25730.
</mixed-citation>
</ref>
<ref id="R43">
<label>[43]</label>
<mixed-citation publication-type="other">Olston, C., Reed, B., Srivastava, U., Kumar, R., &#x00026; Tomkins, A. (2008). Pig Latin: A not-so-foreign language for data processing. *Proceedings of the ACM SIGMOD International Conference on Management of Data*, 1099-1110.
</mixed-citation>
</ref>
<ref id="R44">
<label>[44]</label>
<mixed-citation publication-type="other">Meda, R. (2020). Real-Time Data Pipelines for Demand Forecasting in Retail Paint Distribution Networks. Global Research Development (GRD) ISSN, 2455-5703.
</mixed-citation>
</ref>
<ref id="R45">
<label>[45]</label>
<mixed-citation publication-type="other">Nandan, B. P. Data Analytics-Driven Approaches to Yield Prediction in Semiconductor Manufacturing. International Journal of Innovative Research in Electrical, Electronics, Instrumentation and Control Engineering (IJIREEICE), DOI, 10.
</mixed-citation>
</ref>
<ref id="R46">
<label>[46]</label>
<mixed-citation publication-type="other">Kolla, S. K. (2021). Designing Scalable Healthcare Data Pipelines for Multi-Hospital Networks. World Journal of Clinical Medicine Research, 1(1), 1-14.
</mixed-citation>
</ref>
<ref id="R47">
<label>[47]</label>
<mixed-citation publication-type="other">Stonebraker, M., &#x000c7;etintemel, U., &#x00026; Zdonik, S. (2005). The 8 requirements of real-time stream processing. *ACM SIGMOD Record, 34*(4), 42-47.
</mixed-citation>
</ref>
<ref id="R48">
<label>[48]</label>
<mixed-citation publication-type="other">Mangala, N. (2021). Optimizing Large-Scale ETL Pipelines Using Medallion Architecture on Azure Data Lake. Journal of Artificial Intelligence and Big Data, 1(1), 1-20.
</mixed-citation>
</ref>
<ref id="R49">
<label>[49]</label>
<mixed-citation publication-type="other">Abadi, D. J., Ahmad, Y., Balazinska, M., &#x000c7;etintemel, U., Cherniack, M., Hwang, J. H., Lindner, W., Maskey, A., Rasin, A., Ryvkina, E., Tatbul, N., Xing, Y., &#x00026; Zdonik, S. (2005). The design of the Borealis stream processing engine. *Proceedings of the Conference on Innovative Data Systems Research*, 277-289.
</mixed-citation>
</ref>
<ref id="R50">
<label>[50]</label>
<mixed-citation publication-type="other">Davuluri, P. N. (2020). Improving Data Quality and Lineage in Regulated Financial Data Platforms. Finance and Economics, 1(1), 1-14.
</mixed-citation>
</ref>
<ref id="R51">
<label>[51]</label>
<mixed-citation publication-type="other">Simmhan, Y. L., Plale, B., &#x00026; Gannon, D. (2005). A survey of data provenance in e-science. *ACM SIGMOD Record, 34*(3), 31-36.
</mixed-citation>
</ref>
<ref id="R52">
<label>[52]</label>
<mixed-citation publication-type="other">Ranjith Kumar Peddi (2021). Optimizing Case Management Workflows in Global Data Center Colocation Services. Universal Journal of Computer Sciences and Communications, 1(1), 1-21. https://doi.org/10.31586/ujscs.2021.1380
</mixed-citation>
</ref>
<ref id="R53">
<label>[53]</label>
<mixed-citation publication-type="other">Gottimukkala, V. R. R. (2021). Digital Signal Processing Challenges in Financial Messaging Systems: Case Studies in High-Volume SWIFT Flows.
</mixed-citation>
</ref>
<ref id="R54">
<label>[54]</label>
<mixed-citation publication-type="other">Pamisetty, A. (2019). Big Data Engineering for Real-Time Inventory Optimization in Wholesale Distribution Networks. Available at SSRN 5267328.
</mixed-citation>
</ref>
<ref id="R55">
<label>[55]</label>
<mixed-citation publication-type="other">Pandiri, L. (2021). Machine Learning Approaches in Pricing and Claims Optimization for Recreational Vehicle Insurance. Journal of International Crisis and Risk Communication Research, 194-214.
</mixed-citation>
</ref>
<ref id="R56">
<label>[56]</label>
<mixed-citation publication-type="other">Recharla, M. (2020). Targeted Gene Therapy for Spinal Muscular Atrophy: Advances in Delivery Mechanisms and Clinical Outcomes. International Journal of Science and Research (IJSR), 9(12), 1921-1934.
</mixed-citation>
</ref>
<ref id="R57">
<label>[57]</label>
<mixed-citation publication-type="other">Nandan, B. P. (2021). Enhancing Chip Performance Through Predictive Analytics and Automated Design Verification. Journal of International Crisis and Risk Communication Research, 265-285.
</mixed-citation>
</ref>
<ref id="R58">
<label>[58]</label>
<mixed-citation publication-type="other">Singireddy, S., &#x00026; Adusupalli, B. (2019). Cloud Security Challenges in Modernizing Insurance Operations with Multi-Tenant Architectures. International Journal of Engineering and Computer Science, 8, 12.
</mixed-citation>
</ref>
<ref id="R59">
<label>[59]</label>
<mixed-citation publication-type="other">Pandiri, L., Singireddy, S., &#x00026; Adusupalli, B. (2020). Digital Transformation of Underwriting Processes through Automation and Data Integration. Global Research Development (GRD) ISSN, 2455-5703.
</mixed-citation>
</ref>
<ref id="R60">
<label>[60]</label>
<mixed-citation publication-type="other">Kummari, D. N. (2021). A Framework for Risk-Based Auditing in Intelligent Manufacturing Infrastructures. International Journal on Recent and Innovation Trends in Computing and Communication, 9(12), 245-262.
</mixed-citation>
</ref>
<ref id="R61">
<label>[61]</label>
<mixed-citation publication-type="other">Meda, R. (2020). Data Engineering Architectures for Real-Time Quality Monitoring in Paint Production Lines. International Journal Of Engineering And Computer Science, 9(12).
</mixed-citation>
</ref>
<ref id="R62">
<label>[62]</label>
<mixed-citation publication-type="other">Inala, R. Designing Scalable Technology Architectures for Customer Data in Group Insurance and Investment Platforms.
</mixed-citation>
</ref>
<ref id="R63">
<label>[63]</label>
<mixed-citation publication-type="other">Aitha, A. R. (2021). Optimizing Data Warehousing for Large Scale Policy Management Using Advanced ETL Frameworks.
</mixed-citation>
</ref>
<ref id="R64">
<label>[64]</label>
<mixed-citation publication-type="other">Valiki, D., &#x00026; Kummari, D. N. (2021). Rule-Based Decision Systems for the Automation of Audit Sampling. International Journal of Emerging Trends in Computer Science and Information Technology, 2(4), 105-114.
</mixed-citation>
</ref>
    </ref-list>
  </back>
</article>