What’s the difference between accuracy and precision?

Accuracy is the degree of closeness to true value. Precision is the degree to which an instrument or process will repeat the same value. In other words, accuracy is the degree of veracity while precision is the degree of reproducibility. What does accuracy mean? If a measurement is accurate, it means that it agrees closely … Read more

Use Case: API vs CSV

When comparing the integration of CSV files and APIs with the DataWarehouse, several factors impact data integrity, including data accuracy, consistency, security, and the ability to handle large datasets. Below are some key comparisons and potential risks associated with each method: 1. Data Accuracy and Consistency CSV Integration: API Integration: 2. Data Security CSV Integration: … Read more

My first Power BI

Create Parameters Create two parameters… Use a Token on myBMT The token allows the Datawarehouse to identify with the source of the report, as PowerBI cannot send environment details. (answers on a postcard please, if you know how) So the token is used to give the report a unique identifier myBMT | Token (bmt-dwh-uks-app-my.azurewebsites.net) Navigate … Read more

QA Requirements

Quality standards that apply to the development. Software Lifecycle used Software Quality tool(s) and methods used. The overarching goal of Data Warehouse Development Quality Plan is proactive problem-solving, where any anomalies or discrepancies are swiftly identified and rectified before they escalate into issues. Through continuous monitoring, analysis, and data observability, the reliability, accuracy, and accessibility … Read more

Introduction

Purpose of Software Quality Plan The purpose of a Data Warehouse Development Quality Plan is to ensure that the development, deployment, and maintenance of the data warehouse, along with its associated components such as third-party data sources, Azure Data Factory (ADF) pipelines, and Data Marts, meet established quality standards. This plan serves as a comprehensive … Read more

Development Programme

Project Plan or other project management document. Data Discovery Stage: Objective: Identify data sources, understand data requirements, and define the scope of the DataMart project. Key Activities: Deliverables: Design Stage: Objective: Design the architecture, schema, and data models for the DataMart. Key Activities: Deliverables: MVP (Minimum Viable Product) Stage: Objective: Develop and deploy a minimum … Read more

Support

Organisation/Team responsible for management of Service Desk requests/incidents. Escalation Routes All reports WILL provide a method of raising a Service Desk request for support.  Once raised the following escalation routes will be followed. Issues Reporting (IT Service Desk) Stage Activity Comments Call Received Greet & Verify IdentityDocument Caller Details   Problem Identification Listen & Clarify … Read more

Control of Data

Identification of controlled data, its classification, and associated processes. Data Identification Inventory of Data Assets Data Cataloguing: Create a comprehensive inventory of all data assets across your organisation, including databases, data warehouses, data marts, and data lakes. Data Source Mapping: Identify and map all data sources, including internal systems and third-party sources. All Data Sources … Read more

Design Methodologies and Environment

Design methodology, development platform, hardware requirements, database. All Software Quality Procedures Continuous Integration and Continuous Deployment (CI/CD) Continuous Integration and Continuous Deployment (CI/CD) ensures that changes and updates to data pipelines are tested, integrated, and deployed to production, facilitating consistent and reliable data processing and delivery. In dynamic data environments where sources, formats, and requirements … Read more

Reporting

Reporting of progress to customer/project manager[SW1] [JK2] . Data Engineering SHALL use a Jira(R) style ticket management tool to plan, track and release tickets and tasks for the purpose demonstrating the entire development lifecycle.  Allowing the wider team to move work forward, stay aligned and communicate in context. Data Engineering has chosen to use an Open Source … Read more