🎓 DSS Exam Study Guide

Complete interactive English version · DSS · BI · Data Science · Analytics · AI · Visualization · Dashboards

Source scope checked

This English interactive guide was rebuilt after re-checking the full DSS exam PDF, not only the previous Romanian HTML. The PDF scope is:

  • Turban, Pollard & Wood, Chapter 6: Business Intelligence, Data Science and Data Analytics, pp. 199-248.
  • Sharda, Delen & Turban, Chapter 1: Overview of BI, Analytics, Data Science, and AI, pp. 2-65.
  • Sharda, Delen & Turban, Chapter 3: Nature of Data, Statistical Modeling, and Visualization, pp. 117-138 and 163-192.
  • Important exam instruction: exclude the statistics part and focus on visualization, dashboards, visualization types, and lab-related aspects.

Correction from the previous version: this version adds the missing parts about business pressures, computerized decision support, DSS framework, AI, analytics ecosystem, data-to-knowledge continuum, data quality, preprocessing details, reporting cycle, visual analytics, dashboard design, and richer quiz/flashcards.

3-hour study plan

45'
Chapter 6: BI, DS, Big Data
35'
Chapter 1: DSS, Analytics, AI
45'
Chapter 3: Data + Visualization
35'
Quiz + error review
20'
Cheat sheet + flashcards
00:00-00:45

Chapter 6

  • Three analytics levels: descriptive, predictive, prescriptive.
  • BI vs Data Science: known unknowns vs unknown unknowns.
  • Big Data: big vs small data and the 4 Vs.
  • Descriptive tools: data mining, visualization, dashboards, mashups.
  • Predictive/prescriptive methods: text mining, spatial mining, regression, optimization/rules, machine learning.
00:45-01:20

Chapter 1

  • Business pressures that create the need for computerized decision support.
  • Simon decision phases and decision modeling.
  • Gorry-Scott-Morton DSS framework.
  • DSS characteristics, subsystems, DSS vs BI.
  • Evolution from DSS/MIS/EIS to BI, analytics, Big Data, AI.
  • AI overview, convergence of analytics + AI + IoT + Big Data, analytics ecosystem.
01:20-02:05

Chapter 3

  • Data-to-knowledge continuum; data types and taxonomy.
  • Analytics-ready data characteristics and preprocessing.
  • Business reporting categories and report quality.
  • Charts/graphs: when to use each one.
  • Information visualization vs visual analytics.
  • Dashboard layers, dashboard design, dashboard best practices.
02:05-02:40

Quiz

  • Answer without notes first.
  • Read the explanation after each question.
  • Redo missed topics using the cheat sheet.
02:40-03:00

Final memorization

  • Review all numbered lists: 3 levels, 4 phases, 4 Vs, 4 DSS subsystems, 7 Data Science lifecycle stages, 10 data-quality items, 14 DSS characteristics.
  • Use flashcards for definitions.
  • Focus on dashboards and visualization because the PDF explicitly emphasizes them.

Chapter 6 overview: BI, Data Science, and Data Analytics

Chapter 6 connects decision making with analytics. It explains how BI helps describe the past and present, how data science extends analytics into prediction and prescription, and how descriptive tools such as dashboards and mashups support managers.

Four phases of decision making

  1. Intelligence: identify the problem or opportunity, collect information, define goals and criteria.
  2. Design: create possible courses of action, build/analyze alternatives, evaluate them against criteria.
  3. Choice: select one alternative.
  4. Review / implementation / monitoring: execute, monitor, control, and return to earlier phases if needed.

Exam trap: some sources call the fourth stage Review, Implementation, Monitor, or Control. The idea is the same: execute and monitor the chosen solution.

Bounded rationality, satisficing, optimizing

Bounded rationalityDecision makers are limited by cognition, time, available data, and the tractability of the problem.
SatisficingChoosing an alternative that is good enough rather than provably best.
OptimizingFinding the best achievable alternative under constraints. Analytics helps decision makers move closer to optimizing.

The 3 levels of data analytics

LevelQuestion answeredTypical toolsValue/complexity
DescriptiveWhat happened? What is happening?Aggregation, reporting, dashboards, data miningLowest complexity; foundation for BI
PredictiveWhat could happen? Why might it happen?Statistical models, forecasting, text mining, machine learningMedium complexity; predicts future events
PrescriptiveWhat should we do for the best outcome?Optimization, simulation, decision modeling, MLHighest value/complexity; recommends action

BI vs Data Science

CriteriaBusiness IntelligenceData Science
Main roleDescribe and monitor current/past business statePredict future behavior and prescribe actions
Data focusMostly static, structured, often from a single sourceHigh-volume, high-speed, multi-structured data from many sources
Question typeKnown unknowns: we know the question but not the valueUnknown unknowns: we may not even know the best question/model yet
Typical outputReports, dashboards, KPIs, drill-down viewsModels, predictions, prescriptions, data products
Business useOperational visibility, tactical decisionsDiscovery, innovation, optimization, competitive advantage
Known knownSomething we know and know that we know.
Known unknownSomething we do not know, but we know we need to find out. Traditional BI is strong here.
Unknown unknownSomething we do not know and do not realize we do not know. Data science is strong here.

Traditional, modern, embedded, and augmented BI

Traditional BIA snapshot of what is happening now and what happened in the past, often via reports, dashboards, mashups, and visualization. Mostly descriptive.
Modern BIFlexible, self-service, interactive, mobile, real-time, and governed. It helps users find, merge, query, share, and analyze data quickly.
Embedded BIBI tools built directly into existing business applications.
Self-service analyticsBusiness users generate queries, reports, and dashboards with minimal IT support.
Augmented analyticsMachine learning and AI automate data preparation, insight generation, and insight explanation.

Seven attributes of modern BI software

  1. Speed: real-time questions and answers even with large/diverse data.
  2. Visualization: self-service analysis, drill-down, shareable dashboards.
  3. Single source of truth: combines data from different sources.
  4. Real-time collaboration: live data can be filtered, sorted, discussed, transformed, and shared.
  5. Comprehensive governance: safe, trusted, accurate, audited dashboards and reports.
  6. Scalability: start small and grow.
  7. Mobility: works on phones and tablets.

Data Science lifecycle: 7 stages

StageMeaning
1. CaptureUnderstand business requirements, KPIs, data acquisition, data entry, extraction.
2. StoreClean, process, secure, warehouse, and architect data.
3. ModelData mining, model creation, model evaluation, compare models against KPIs.
4. AnalyzeExploratory, descriptive, predictive, prescriptive analytics, text mining.
5. CommunicateBI, reporting, visualization, dashboards, stakeholder validation.
6. DeployDeploy model, real-world testing, manage dependencies, gain business buy-in.
7. ReiterateOperate, monitor KPIs, optimize, retrain model when performance degrades.

Data Science team roles

RoleMain responsibility
Data & Analytics ManagerLeads the team, project management, communication, database/analytics overview.
Data ScientistCleans and organizes big data; predictive modeling; storytelling and visualization.
Data AnalystCollects, processes, and statistically analyzes data; spreadsheets, SQL/NoSQL.
Data ArchitectBlueprints data integration, centralization, protection, and maintenance.
Data EngineerDevelops, constructs, tests, and maintains data architectures and APIs.
StatisticianUses statistical theory/methods, data mining, ML, Hadoop/SQL/cloud tools.
Database AdministratorEnsures database availability, performance, backup/recovery, security.
Business AnalystBridge between business and IT; process improvement, BI understanding, visualization.
Citizen Data ScientistUses advanced analytics tools without formal data-scientist training.

Software and platforms

  • Python: object-oriented, extensible, many free data-analysis libraries.
  • R: open-source, statistical and graphical methods, ML, regression, time-series.
  • Apache Hadoop: open-source, HDFS, distributed processing, scalable storage.
  • Apache Spark: uses RDDs, often much faster than Hadoop because of in-memory processing.
  • Descriptive BI leaders mentioned: Microsoft Power BI, Tableau, Qlik, ThoughtSpot.
  • Advanced analytics leaders mentioned: Alteryx, SAS, Azure Databricks, Tibco, Dataiku, MathWorks.
RDDResilient Distributed Dataset: fault-tolerant, immutable, distributed collection of objects processed in parallel.

Big Data: definition, sources, and 4 Vs

Big DataA data set too large or complex to be analyzed with traditional methods.
VMeaning
VolumeMassive size: terabytes, petabytes, exabytes, zettabytes.
VarietyMany formats: structured, semi-structured, unstructured; numbers, text, audio, images, video.
VelocitySpeed of storage, analysis, reporting, and response: batch, near-time, real-time, live stream.
VeracityData quality/trust: authentic, available, trustworthy, cleansed, complete.

Common sources: social media, sensors, IoT/device data, logs, video, images, web transactions, ERP/CRM, financial data.

Big Data vs Small/Traditional Data

FeatureBig DataTraditional/small enterprise data
SourcesSocial media, sensors, logs, devices, images/video, IoTERP, CRM, web transactions, general ledger, operational systems
SizePetabytes/exabytes/zettabytes possibleGigabytes/terabytes more typical
ResponseOften requires immediate or near-real-time responseOften batch or not immediate
StructureUnstructured and multi-structured commonStructured data more common
ProblemHard to store/process/analyze traditionallyFits conventional database/BI workflows better

Big Data goals and challenges

  • Goals: establish data-driven culture, innovate/disrupt with technology, accelerate services, launch products, improve processes.
  • Cultural barriers: data silos, lack of professionals, lack of management support, resistance to change.
  • Technical barriers: managing volume/variety/velocity/veracity and presenting results effectively.
  • Human expertise remains essential: analytics results need judgment, interpretation, and ethical use.

Four descriptive analytics tools

ToolExam definition/use
Data miningSoftware/methods that analyze data from multiple perspectives, categorize data, and discover correlations or patterns. Often the first descriptive step.
Data visualizationRepresenting abstract data as images, graphs, charts, diagrams, maps, or animations to make meaning easier to understand.
Digital dashboardsSingle-screen interfaces that combine multiple visualizations and KPIs for at-a-glance monitoring and interaction.
Data mashupsCombining two or more data sets from internal/external systems without relying on the middle ETL step into a data warehouse or heavy IT support.
Affinity analysisData mining technique that discovers co-occurrence relationships, e.g., products often bought together.
Drill downMoving from summary/general information to more detailed information.

Data visualization and AR

  • Visualization works because humans understand patterns, colors, and spatial relationships quickly.
  • Common business value: identify improvement areas, clarify customer behavior, improve product placement, predict sales by location.
  • Heat maps use color intensity to reveal patterns; more complex heat maps can analyze website clicks and search behavior.
  • Augmented Reality is a high-end visualization form that combines 3D, smart mapping, machine learning, and natural language processing.

Dashboards in Chapter 6

Dashboards combine multiple data feeds and visualizations into a single interactive screen. They are not merely static reports; they connect to accounting, ERP, CRM, web analytics, and other systems to provide current operational visibility.

Dashboard componentMeaning
DesignVisualization method plus captions; often includes infographic-style display.
Performance metricsKPIs and real-time content. Metrics must match dashboard purpose.
APIConnects disparate data sources and feeds.
AccessPreferred via secure web browser, often from mobile devices.
  • Benefits: visibility, continuous improvement, single sign-on/access, budget deviation tracking, accountability.
  • Metrics examples: finance: net profit, cash balance, A/R and A/P; sales: leads, proposals, close rate; web: visitors, keywords, traffic sources; operations: inventory, on-time delivery, product output.

Mashups for actionable dashboards

  • Mashups reduce the time and effort needed to combine data sources.
  • Users can combine fields from different sources and import spreadsheets or competitor data.
  • They enable complex queries through drag-and-drop tools for non-experts.
  • Important distinction: mashups often work behind the scenes; dashboards are visible and interactive. Mashups can feed dashboards.

Five predictive/prescriptive methods

MethodWhat it doesExample value
Text miningExtracts high-quality information from unstructured text: concepts, topics, patterns, sentiment.Analyze customer comments, claims notes, social posts, documents.
Spatial data mining / GISUses geographic/location data, geocoding, maps, and spatial relationships.Choose store locations, target sales visits, map complaints, analyze markets.
Regression modelingFinds relationships between dependent and independent variables; time-series estimates movement over time.Demand forecasting, price/demand analysis, risk modeling.
Decision optimization and rules-based decision makingCalculates variable values that optimize an objective; rules help novices decide like experts.Scheduling, routing, allocation, underwriting, logistics.
Machine learningAlgorithms learn patterns in big data and use them for classification, prediction, pattern discovery, anomaly detection.Personalization, image recognition, failure prediction, fraud/anomaly detection.

Time-series analysis: trend, rate of change, and cycles. ML tasks: categorize, predict, identify patterns, detect unexpected behavior.

Chapter 1 overview: decision support, analytics, data science, AI

Chapter 1 explains why organizations need computerized support for managerial decision making, how DSS evolved into BI/analytics/AI, and how analytics and AI fit in a broader ecosystem.

Changing business environment and need for decision support

  • Organizations face complex, changing, uncertain, and competitive environments.
  • Strategic, tactical, and operational decisions must often be made quickly and sometimes in real time.
  • Decisions require data, information, and knowledge; manual processing is often too slow.
  • Computerized support helps with speed, consistency, communication, collaboration, and complex analysis.
  • Automation is increasingly affecting knowledge work and decision work.

Decision-making process and decision modeling

  1. Intelligence: scan reality and identify the problem/opportunity.
  2. Design: develop and analyze possible solutions/models.
  3. Choice: select the solution/model alternative.
  4. Implementation: put the solution to work.
  5. Monitoring: evaluate results and feed back into earlier phases.

Chapter 1 uses Simon’s model and emphasizes that decision making/modeling is iterative, not a simple straight line.

Why decision making is difficult

  • Data may be unavailable, expensive, imprecise, subjective, or insecure.
  • Important data may be qualitative or “soft.”
  • There may be information overload.
  • Outcomes can appear over a long time, making evaluation hard.
  • Future conditions may differ from historical data.
  • Many decisions involve multiple objectives and constraints.

Managers, decisions, and support needs

Management levelTypical decisionsSupport emphasis
Operational controlDaily, routine, repetitive operationsEfficiency, transaction processing, reports, structured decisions
Managerial controlResource acquisition/use, departmental performanceMonitoring, analysis, semistructured decisions
Strategic planningLong-term goals and policiesExternal data, uncertainty, unstructured decisions, models/scenarios

Gorry-Scott-Morton DSS framework

The framework combines two dimensions: type of control and degree of structuredness, producing a 3×3 matrix.

DimensionCategories
Type of controlOperational control, managerial control, strategic planning
Degree of structurednessStructured, semistructured, unstructured
  • Structured decisions: repetitive, routine, standard procedures, can often be automated.
  • Semistructured decisions: some parts are clear/model-based, others require judgment.
  • Unstructured decisions: complex, fuzzy, novel, require human judgment.

Decision Support System (DSS)

DSSA computer-based information system that supports managers in semistructured or unstructured decision situations; it extends but does not replace the decision maker.
  1. Supports semistructured/unstructured decisions.
  2. Supports managers at all levels.
  3. Supports individuals and groups.
  4. Supports interdependent/sequential decisions.
  5. Supports all phases of decision making.
  6. Supports different decision processes and styles.
  7. Is flexible and adaptable.
  8. Uses friendly interfaces, graphics, and sometimes natural language.
  9. Improves decision effectiveness more than mere efficiency.
  10. Leaves the decision maker in control.
  11. Can be developed partly by end users.
  12. Includes models.
  13. Can access many data sources.
  14. Can be stand-alone, integrated, or web-based.

DSS subsystems

SubsystemPurpose
Data ManagementDatabase and DBMS; internal/external data; may connect to data warehouse.
Model ManagementModel base and MBMS; statistical, financial, optimization, simulation, forecasting models.
User InterfaceDialog/interface through browser, mobile, voice, graphics, interaction.
Knowledge-Based ManagementAdds intelligence/expertise and can support other components.

DSS vs BI

DSSBI
Built to solve or support a specific decision problem.Monitors situations and identifies problems/opportunities.
Model-driven and problem-solving oriented.Report/dashboard/data-driven monitoring is central.
Often supports semistructured/unstructured decisions.Often supports descriptive analysis and performance management.

Evolution of computerized decision support

PeriodKey concepts
1960s-1970sTPS, MIS, routine reports, early DSS.
1980sExecutive Information Systems, Expert Systems, ERP.
1990sData Warehousing, OLAP, dashboards, scorecards, Business Intelligence.
2000sAnalytics, data/text/web mining, BPM, cloud, predictive analytics.
2010sBig Data, Hadoop, social, mobile, AI/ML, analytics/data science.
2020sAutomation, robotics, deep learning, augmented/AI-driven decision support.

BI methodology and architecture

BI typically combines data sources, data warehousing, business analytics, business performance management, and user interfaces such as dashboards.

ComponentMeaning
Data warehouse (DW)Integrated repository that supports reporting and analysis.
Business analytics toolsQuery, reporting, OLAP, mining, visualization, predictive tools.
BPMBusiness Performance Management: strategy, metrics, scorecards, performance monitoring.
User interfaceDashboards, portals, visual reports, scorecards.
Data MartSmaller, subject/department-focused data warehouse subset.
Operational Data Store (ODS)Operational/staging store updated during operations; often feeds DW.
Enterprise Data Warehouse (EDW)Large-scale integrated data warehouse across the enterprise.
OLTPOnline Transaction Processing: immediate operational transactions, e.g., ATM/POS.
OLAPOnline Analytical Processing: ad-hoc analysis, queries, reporting, multidimensional analysis.

Analytics overview from Chapter 1

TypeQuestionEnablersTypical output
DescriptiveWhat happened / what is happening?Reporting, dashboards, scorecards, DW, OLAPWell-defined problems, historical visibility
PredictiveWhat will happen?Data/text/web mining, forecasting, MLAccurate projections, patterns, probabilities
PrescriptiveWhat should we do?Optimization, simulation, decision modeling, expert systemsRecommended actions and best decisions

Prescriptive analytics often builds on descriptive and predictive analytics. It is usually not wise to jump to prescription without understanding and prediction.

Selected analytics domains

  • Sports: player recruitment, pricing, training, strategy, injury prevention, Moneyball-style insights.
  • Healthcare: risk prediction, care management, patient analytics, fraud prevention, resource utilization.
  • Retail: assortment, segmentation, pricing, promotions, location, inventory, market basket analysis, customer behavior.

Artificial Intelligence overview

AITechnologies that perform tasks requiring human-like intelligence: learning, language understanding, reasoning, perception, decision making, and problem solving.
  • AI can learn and improve over time.
  • AI can understand human language, answer questions, recognize patterns, and automate work.
  • Benefits include lower cost, faster work, consistency, productivity, profitability, and competitive advantage.
  • Applications include cybercrime detection, e-commerce decisions, high-frequency trading, agriculture, healthcare, robotics, smart devices, and decision automation.

Types of AI by autonomy

TypeMeaning
Assisted intelligenceAutomates existing tasks and helps humans do current work better.
Augmented intelligenceHuman and machine work together; AI enhances human decisions.
Autonomous intelligenceSystems act independently with minimal human intervention.

Why analytics initiatives can fail

  • Predictive models may have unintended effects.
  • Models must be used ethically, responsibly, and mindfully.
  • Analytics may work well for some applications but not others.
  • Garbage-in, garbage-out: data and assumptions determine quality.
  • Data can be incomplete, obsolete, inaccurate, inconsistent, or low quality.
  • Data from different sources can vary in format and quality.

Convergence of analytics, AI, IoT, and Big Data

  • Big Data empowers AI through cheaper processing, large online data sets, and scalable/deep-learning algorithms.
  • IoT produces sensor/device streams that feed analytics and AI.
  • AI and analytics together can support speech recognition, anomaly detection, underwriting, claims processing, smart buildings, autonomous systems, and connected products.
  • Convergence matters because many modern systems combine data collection, prediction, automation, and action.

Analytics ecosystem

The analytics ecosystem includes technology providers, accelerators/intermediaries, and user organizations.

CategoryExamples/meaning
Outer petals: technology providersData generation infrastructure, data management infrastructure, DW providers, middleware, data service providers, analytics software developers, application developers.
Inner petals: acceleratorsAcademic institutions/certification agencies, regulators/policy makers, analysts/influencers.
CoreAnalytics user organizations: the organizations applying analytics to decisions and performance.

Number to memorize: the ecosystem model has 11 sectors grouped into 3 categories.

Chapter 3 scope: Nature of Data and Preprocessing

The exam instruction excludes the statistical modeling part. This guide focuses on data nature, data taxonomy, preprocessing, business reporting, visualization, visual analytics, and dashboards.

Data-to-knowledge continuum

StageMeaning
DataRaw facts, observations, transactions, measurements, sensor readings.
InformationData processed into organized, meaningful context.
KnowledgeInformation interpreted and connected to experience, patterns, rules, and decisions.

Analytics exists to move from raw data toward useful knowledge and decisions.

Analytics-ready data: 10 characteristics

CharacteristicMeaning
Source reliabilityOriginality and appropriateness of the source; whether the source can be trusted.
Content accuracyData correctly represent the real entity/event and match the intended measurement.
AccessibilityData are readily obtainable by authorized users.
Security & privacyData are protected from unauthorized access and privacy rules are respected.
RichnessAll required elements and enough dimensions/detail are included.
ConsistencyData are collected/combined consistently across sources and time.
Currency / timelinessData are up-to-date enough for the decision.
GranularityData are at the right level of detail; aggregates cannot always be reliably disaggregated.
ValidityData values match expected formats/ranges/definitions.
RelevancyVariables are relevant to the analysis and business question.

Simple taxonomy of data

TypeSubtypesExamples / exam notes
Structured dataCategorical, numericFits rows/columns and predefined fields.
Categorical / discreteNominal, ordinalNominal has no order; ordinal has rank/order.
Numeric / continuousInterval, ratioInterval has no true zero; ratio has true zero and ratios are meaningful.
Unstructured dataText, image, audio, videoNo simple row/column format; often needs text/image/audio processing.
Semi-structured dataXML, JSON, logs, web dataSome tags/structure but not as rigid as relational tables.
1-of-N representationA method for converting categorical variables into binary pseudo-variables.

Art and science of data preprocessing

Preprocessing is essential because analytics projects fail when data problems are ignored. The four main tasks are consolidation, cleaning, transformation, and reduction.

StepPurposeTypical methods/problems
Data consolidationCollect, select, and integrate data from sources.SQL, web services, ETL/ELT, source selection, schema matching.
Data cleaningFix missing values, noise, inconsistencies, duplicates, outliers.Imputation, smoothing, deduplication, validation, anomaly handling.
Data transformationPut data into suitable form for analysis.Normalization, discretization, aggregation, attribute construction, categorical encoding.
Data reductionReduce rows/columns/complexity while preserving meaning.Sampling, dimensionality reduction, variable selection, aggregation, balancing skewed data.
ImputationFilling missing values with mean, median, minimum, maximum, mode, or other estimated values.
Stratified samplingSampling that preserves proportional representation of important subgroups.
Dimensional reductionReducing variables/features, e.g., PCA/ICA or feature selection.

Preprocessing exam traps

  • Aggregated data cannot always be safely disaggregated.
  • Missing values are not always safe to delete; deletion can bias results.
  • The most challenging reduction is often column/variable reduction because it can remove explanatory power.
  • Data cleaning is not just formatting; it includes missing values, noise, duplicates, and validity issues.
  • Lab work often tests whether you can inspect data quality before visualizing it.

Business reporting

Business reportA document containing organized information in narrative, graphical, and/or tabular form, prepared periodically or ad hoc.

Five functions of business reports

  1. Ensure departments are functioning properly.
  2. Provide information.
  3. Present results of analysis.
  4. Persuade others to act.
  5. Create organizational memory.

Three major categories

  1. Metric management reports: SLAs, KPIs, Six Sigma/TQM-style performance tracking.
  2. Dashboard-type reports: multiple performance indicators on one page, often with traffic-light colors.
  3. Balanced Scorecard-type reports: financial, customer, business process, learning and growth perspectives.

Good reporting qualities: clarity, brevity, completeness, correctness.

Data / information visualization

Information visualizationUse of visual representations to help people understand data and information, especially what happened or what is happening.
  • Visualization is central to BI and analytics because it reduces cognitive load and reveals patterns.
  • It is connected to information graphics, scientific visualization, exploratory data analysis, and human-computer interaction.
  • Historical examples include Playfair’s line/bar/time-series charts and Minard’s Napoleon campaign chart, praised for encoding multiple dimensions.

Chart selection: basic charts

ChartBest useCommon trap
Line chartTime series and change over time.Not ideal for many unrelated categories.
Bar chartComparing nominal/categorical data; vertical, horizontal, stacked.Often better than pie when many categories exist.
Pie chartSimple part-to-whole proportions with few categories.Avoid with more than about 4 categories or small differences.
Scatter plotRelationship between two variables; trends, clusters, outliers.Does not prove causation by itself.
Bubble chartScatter plot plus bubble size/color for extra variables.Can become cluttered quickly.

Chart selection: specialized charts

ChartBest use
HistogramFrequency distribution of numeric variables; not the same as a bar chart.
Gantt chartProject schedule, durations, overlaps, resources, milestones.
PERT / network chartLarge project planning; precedence relationships; activity-on-node or activity-on-arrow.
Geographic mapLocation-based data and GIS analysis.
Bullet graphProgress toward a target; compact alternative to gauges/meters/thermometers.
Heat mapTwo dimensions/intensity shown through color; strong/weak intersections.
Highlight tableHeat map plus numbers for exact detail.
Tree mapHierarchical data using nested rectangles.
Gapminder/bubble time chartMultidimensional data over time; e.g., wealth and health of nations.

Abela chart taxonomy: choose by message

What you want to showCommon visual choices
RelationshipScatter plot, bubble chart.
ComparisonBar chart, line chart, column chart.
DistributionHistogram, scatter plot, box plot if statistics included.
CompositionPie chart, stacked bar, stacked area, tree map.

Information visualization vs visual analytics

Information VisualizationVisual Analytics
Answers: What happened? What is happening?Answers: Why is it happening? What will happen next?
Strongly associated with BI and descriptive analytics.Combines visualization with analytical reasoning, predictive analytics, segmentation, correlation, forecasting.
Often focuses on presentation and exploration.Often supports deeper investigation and decision making.

Data storytelling

  1. Think of the analysis as a story with a narrative structure.
  2. Be authentic; the story should flow naturally from the data.
  3. Be visual; think like a film editor and remove distracting material.
  4. Make it easy for both audience and presenter.
  5. Invite and guide discussion.

Storytelling matters because reports and dashboards should lead to understanding and action, not just display data.

Visual analytics platforms: key value

  • Empower a broader audience with data exploration and approachable analytics.
  • Use interactive web interfaces to broaden access.
  • Answer complex questions faster.
  • Improve collaboration and information sharing.
  • Free IT from building every report by giving users controlled self-service access.
  • Allow growth from simple reporting to advanced analytics at the organization’s pace.

Information dashboards

Information dashboardA visual, consolidated, at-a-glance screen that presents critical information, usually KPIs, often allowing drill-down/drill-through.

Eckerson’s 3 information layers

LayerMeaning
MonitoringGraphical, abstracted KPI data for quick awareness.
AnalysisSummarized dimensional data used to identify root causes.
ManagementDetailed operational data used to support action.

Dashboards are used for executives, operations, sales, marketing, finance, HR, logistics, healthcare, education, and more.

Dashboard design: good characteristics

  1. Use visual components such as charts, sparklines, gauges, and stoplights appropriately.
  2. Be transparent and require minimal training.
  3. Combine data from multiple systems.
  4. Allow drill-down and drill-through.
  5. Provide dynamic views with timely refresh.
  6. Use minimal custom code where possible.

Dashboard best practices

  1. Benchmark KPIs against industry or organizational standards.
  2. Wrap metrics with contextual metadata.
  3. Validate design with usability specialists or representative users.
  4. Prioritize alerts and exceptions.
  5. Enrich the dashboard with business-user comments where useful.
  6. Present information in three levels: dashboard, static report, self-service cube/detail.
  7. Pick the right visual construct for the message.
  8. Use guided analytics to move average users toward expert use.

Typical comparisons in BI dashboards

  • Current values vs past values.
  • Actual values vs forecasted values.
  • Actual values vs target values.
  • Values vs benchmarks/averages.
  • Multiple instances of the same measure.
  • One measure vs another, e.g., revenue vs cost.

Lab checklist: before building a dashboard

  1. Define the business question and audience.
  2. Identify KPIs and the decision/action each KPI supports.
  3. Check data source reliability, accuracy, timeliness, granularity, and relevance.
  4. Clean and transform data before visualization.
  5. Choose charts by purpose: relationship, comparison, distribution, composition.
  6. Avoid clutter: one screen should support quick understanding.
  7. Use filters/drill-downs when users need exploration.
  8. Test with users and revise based on confusion or wrong interpretation.

Flashcards

Click a card to flip it. Use search to filter terms.

Quiz

Answer each question and read the explanation. You can reset or reveal all answers.

Score

Must-memorize numbered lists

NumberConcept
3Analytics levels: descriptive, predictive, prescriptive.
3Dashboard layers: monitoring, analysis, management.
3Report categories: metric management, dashboard-type, balanced scorecard.
3AI autonomy types: assisted, augmented, autonomous.
3Analytics ecosystem groups: outer petals, inner petals, core.
4Decision phases: intelligence, design, choice, implementation/review/monitoring.
4Big Data Vs: volume, variety, velocity, veracity.
4Descriptive tools: data mining, visualization, dashboards, mashups.
4DSS subsystems: data, model, user interface, knowledge-based.
4Dashboard components: design, performance metrics, API, access.
4BI architecture components: DW, business analytics, BPM, UI.
4Preprocessing steps: consolidation, cleaning, transformation, reduction.
4ML tasks: categorize, predict, identify patterns, detect unexpected behavior.
4Good report qualities: clarity, brevity, completeness, correctness.
4Abela categories: relationship, comparison, distribution, composition.
5Predictive/prescriptive methods: text mining, spatial mining, regression, optimization/rules, ML.
5Business report functions: functioning departments, information, analysis results, persuasion, organizational memory.
6Good dashboard design characteristics.
7Data Science lifecycle stages.
7Modern BI attributes.
8Dashboard best practices.
9Gorry-Scott-Morton cells: 3 controls × 3 structuredness levels.
10Analytics-ready data characteristics.
11Analytics ecosystem sectors.
14DSS characteristics.

High-probability exam traps

  • BI is mainly descriptive and handles known unknowns; Data Science predicts/prescribes and handles unknown unknowns.
  • Predictive says what is likely to happen; prescriptive recommends what to do.
  • DSS supports decision makers; it does not replace them.
  • DSS is usually for specific problem support; BI monitors and identifies problems/opportunities.
  • OLTP is transaction processing; OLAP is analytical processing.
  • A histogram is about frequencies/distributions, not categorical comparison like a bar chart.
  • Pie charts are only good for simple proportions with few categories.
  • Heat map uses color intensity; highlight table adds numbers.
  • Mashups are often behind the scenes; dashboards are visible.
  • Data quality and preprocessing come before visualization.
  • Dashboard design must match KPIs and decisions, not just look attractive.
  • The PDF says to exclude statistics and emphasize visualization and dashboards.