CHAPTER # 4

DATA RESOURCE MANAGEMENT DATA RESOURCE MANAGEMENT

• • • • • • •

Traditional File Processing Database Management Approach Logical Data Elements Database Structures Database Development Types of Databases Data Warehouses And Data Mining

Traditional File Processing: •

Systems in which data are organized, stored, and processed in independent files of data records.



In the traditional file processing approach each business application was designed to use one or more specialized data files containing only specific types of data records.



Problems:

o o o o

Data Redundancy: Independent data files included a lot of duplicated data. Lack of Data Integration: specific request of data from various files cannot be fulfilled. Data dependence (files, storage devices, software) Lack of Data Integrity or Standardization. In file processing systems, it was easy for data elements such as stock numbers and customer addresses to be defined differently by different end users and applications.

Database Management Approach: •

To solve the problems encountered with the file processing approach, the database management approach was conceived as the foundation of modern methods for managing organizational data.



The database management approach consolidates data records, formerly held in separate files, into databases that can be accessed by many different application programs.



In addition, a database management system (DBMS) serves as a software interface between users and databases, which helps users easily access the data in a database.



Thus, database management involves the use of database management software to control how databases are created, interrogated, and maintained to provide information that end users need.

Page # 1

Compiled By : Muhammad Shabbir Hussain

CHAPTER # 4





DATA RESOURCE MANAGEMENT

Examples of popular mainframe and server versions of DBMS software are IBM’s DB2Universal Database, Oracle 10g by Oracle Corp., and MySQL, a popular open-source DBMS.

The three main functions of DBMS software are to create, maintain and use the database of an organization.

Page # 2

Compiled By : Muhammad Shabbir Hussain

CHAPTER # 4

DATA RESOURCE MANAGEMENT

DATABASE INTERROGATION: End users use a DBMS query feature or report generator • Response is video display or printed report • No programming is required Query language • Immediate response to ad hoc data requests Report generator • Quickly specify a format for information you want to present as a report • SQL Queries • Structured, international standard query language found in many DBMS packages • Query form is SELECT…FROM…WHERE…

Logical Data Elements: Character • A single alphabetic, numeric, or other symbol Field or data item • Represents an attribute (characteristic or quality)of some entity (object, person, place, event) • Examples: salary, job title Record • Grouping of all the fields used to describe the attributes of an entity • Example: payroll record with name, SSN, pay rate File or table Page # 3

Compiled By : Muhammad Shabbir Hussain

CHAPTER # 4 •

DATA RESOURCE MANAGEMENT

A group of related records

Database • An shared collection of logically related data elements • Concept of Primary key and Foreign Keys

Page # 4

Compiled By : Muhammad Shabbir Hussain

CHAPTER # 4

DATA RESOURCE MANAGEMENT

DATABASE STRUCTURES: 1. 2. 3. 4. 5.

Hierarchical Structure Network Structure Relational Structure Multi-dimensional Structure Object-oriented Structure

1. Hierarchical Structure: • • •

Early DBMS structure Records arranged in tree-like structure All of the relationships among records are one-to-many because each data element is related to only one element above it.

2. Network Structure: • •

Page # 5

Used in some mainframe DBMS packages Many-to-many relationships

Compiled By : Muhammad Shabbir Hussain

CHAPTER # 4

DATA RESOURCE MANAGEMENT

3. Relational Structure: • • • • •

Most widely used structure Data elements are stored in tables Row represents a record; column is a field Can relate data in one file with data in another, if both files share a common data element Relational Operations: o Select  Create a subset of records that meet a stated criterion  Example: employees earning more than $30,000 o Join  Combine two or more tables temporarily  Looks like one big table o Project  Create a subset of columns in a table5

4. Multi-dimensional Structure: • • • •

Page # 6

Variation of relational model Uses multidimensional structures to organize data Data elements are viewed as being in cubes Popular for analytical databases that support Online Analytical Processing (OLAP)

Compiled By : Muhammad Shabbir Hussain

CHAPTER # 4

DATA RESOURCE MANAGEMENT

5. Object Oriented Structure: • The object-oriented model is considered one of the key technologies of a new generation of multimedia Web-based applications. •

An object consists of data values describing the attributes of an entity, plus the operations that can be performed upon the data.



Encapsulation o Combine data and operations



Inheritance o New objects can be created by replicating some or all of the characteristics of parent objects

• •

Used in object-oriented database management systems (OODBMS) Supports complex data types more efficiently than relational databases

DATABASE DEVELOPMENT: Database Administrator (DBA) • In charge of enterprise database development • Improves the integrity and security of organizational databases • Uses Data Definition Language (DDL) to develop and specify data contents, relationships, and structure • Stores these specifications in a data dictionary or a metadata repository. Page # 7

Compiled By : Muhammad Shabbir Hussain

CHAPTER # 4

DATA RESOURCE MANAGEMENT

A data dictionary • Contains data about data (metadata) • Relies on specialized software component to manage a database of data definitions It contains information on… • The names and descriptions of all types of data records and their interrelationships • Requirements for end users’ access and use of application programs • Database maintenance • Security

Database Development:

Page # 8

Compiled By : Muhammad Shabbir Hussain

CHAPTER # 4

DATA RESOURCE MANAGEMENT

Data Planning Process: Database development is a top-down process • Develop an enterprise model that defines the basic business process of the enterprise •

Define the information needs of end users in a business process



Identify the key data elements that are needed to perform specific business activities (entity relationship diagrams)

Database Design Process: Data relationships are represented in a data model that supports a business process • This model is the schema or subschema on which to base… o The physical design of the database o The development of application programs to support business processes Logical Design o Schema -overall logical view of relationships o Subschema -logical view for specific end users o Data models for DBMS Physical Design o How data are to be physically stored and accessed on storage devices

Page # 9

Compiled By : Muhammad Shabbir Hussain

CHAPTER # 4

DATA RESOURCE MANAGEMENT

Data Resource Management: Data resource management is a managerial activity • Uses data management, data warehousing, and other IS technologies • Manages data resources to meet the information needs of business stakeholders

TYPES OF DATABASES: 1. 2. 3. 4.

Operational Databases Distributed Databases External Databases Hypermedia Databases

Page # 10

Compiled By : Muhammad Shabbir Hussain

CHAPTER # 4

DATA RESOURCE MANAGEMENT

1. OPERATIONAL DATABASES: Stores detailed data needed to support business processes and operations • •

Also called subject area databases(SADB), transaction databases, and production databases Database examples: customer, human resource, inventory etc.

2. DISTRIBUTED DATABASES: Distributed databases are copies or parts of databases stored on servers at multiple locations • improves database performance at worksites Advantages • Protection of valuable data • Data can be distributed into smaller databases • Each location has control of its local data • All locations can access any data, any where Disadvantages • Maintaining data accuracy Replication • Look at each distributed database and find changes • Apply changes to each distributed database • Very complex

Page # 11

Compiled By : Muhammad Shabbir Hussain

CHAPTER # 4

DATA RESOURCE MANAGEMENT

Duplication • One database is master • Duplicate the master after hours, in all locations • Easier to accomplish 3. EXTERNEL DATABASES: Databases available for a fee from commercial online services, or free from the Web • Examples: hypermedia databases, statistical databases, bibliographic and full text databases • Search engines like Google or Yahoo are external databases 4. HYPERMEDIA DATABASES: A hypermedia database contains • Hyperlinked pages of multimedia • Interrelated hypermedia page elements, rather than interrelated data records

DATA WAREHOUSES: Stores static data that has been extracted from other databases in an organization • •

Central source of data that has been cleaned, transformed, and cataloged Data is used for data mining, analytical processing, analysis, research, decision support

Data warehouses may be divided into data marts • Subsets of data that focus on specific aspects of a company (department or business process)

Page # 12

Compiled By : Muhammad Shabbir Hussain

CHAPTER # 4

Page # 13

DATA RESOURCE MANAGEMENT

Compiled By : Muhammad Shabbir Hussain

CHAPTER # 4

DATA RESOURCE MANAGEMENT



Figure 5.17 illustrates the components of a complete data warehouse system.



Data from various operational and external databases are captured, cleaned, and transformed into data that can be better used for analysis.



This acquisition process might include activities like consolidating data from several sources, filtering out unwanted data, correcting incorrect data, converting data to new data elements, or aggregating data into new data subsets.



These data are then stored in the enterprise data warehouse, from which they can be moved into data marts or to an analytical data store that holds data in a more useful form for certain types of analysis.



Metadata (data that define the data in the data warehouse) are stored in a metadata repository and cataloged by a metadata directory.



Finally, a variety of analytical software tools can be provided to query, report, mine, and analyze the data for delivery via Internet and intranet Web systems to business end users.



Unlike a typical database in which changes can occur constantly, data in a data warehouse are static. which means that once the data are gathered up, formatted for storage, and stored in the data warehouse, they will never change.

DATA MINING: •

Data mining is a major use of data warehouse databases and the static data they contain



In data mining, the data in a data warehouse are analyzed to reveal hidden patterns and trends in historical business activity.



This analysis can be used to help managers make decisions about strategic changes in business operations to gain competitive advantages in the marketplace.



See Figure 5.19. Data mining can discover new correlations, patterns, and trends in vast amounts of business data (frequently several terabytes of data) stored in data warehouses



Data mining software uses advanced pattern recognition algorithms, as well as a variety of mathematical and statistical techniques, to sift through mountains of data to extract previously unknown strategic business information.

Page # 14

Compiled By : Muhammad Shabbir Hussain

CHAPTER # 4

Page # 15

DATA RESOURCE MANAGEMENT

Compiled By : Muhammad Shabbir Hussain

Lecture # 4 Data Resource Management.pdf

Connect more apps... Try one of the apps below to open or edit this item. Lecture # 4 Data Resource Management.pdf. Lecture # 4 Data Resource Management.

723KB Sizes 2 Downloads 231 Views

Recommend Documents

Lectures / Lecture 4
Mar 1, 2010 - Exam 1 is next week during normal lecture hours. You'll find resources to help you prepare for the exam, which will be comprehensive, on the.

Lectures / Lecture 4
Mar 1, 2010 - course website. After lecture today, there will also be a review section. • Assignments are graded on a /–, /, /+ basis whereas exams are graded.

Lecture: 4
Page 1 ... WAP to print ASCII value of a given digit or alphabet or special character. WAP to input two ... WAP to create a Guessing game using three player.

Lecture 4 Big Data Storage and Processing
A collection of large and complex data sets which are difficult to process using common database management tools or traditional data processing applications. Big data is not just about size. ▫ Finds insights from complex, noisy, heterogeneous, str

Lecture 4 of 4.pdf
Page 2 of 13. Lecture 4. • REFERENCING EXTERNAL FILES. • ODS. • LAG & RETAIN. • ARRAYS. • SAS GRAPH. •MACROS. •STATA. Page 2 of 13 ...

lecture 4: linear algebra - GitHub
Inverse and determinant. • AX=I and solve with LU (use inv in linalg). • det A=L00. L11. L22 … (note that Uii. =1) times number of row permutations. • Better to compute ln detA=lnL00. +lnL11. +…

Lecture 2 of 4.pdf
Page 1 of 40. Data Processing with PC-SAS. PubH 6325. J. Michael Oakes, PhD. Associate Professor. Division of Epidemiology. University of Minnesota.

Lecture 4: Bayes' Law
If a rat dies in the first experiment, it diminishes the probability he survives ... The red oval represent the event A that the sum of the two dice is 7 or 8. It contains ...

Lecture 3 of 4.pdf
Page 1 of 34. Data Processing with PC-SAS. PubH 6325. J. Michael Oakes, PhD. Associate Professor. Division of Epidemiology. University of Minnesota.

EE 396: Lecture 4
where Id : U→U is the identity map, and so if u1,u2 ∈ U, then .... Recall from signal processing, that if we have a linear system that is also shift-invariant (some-.

data mining lecture notes pdf
Download. Connect more apps... Try one of the apps below to open or edit this item. data mining lecture notes pdf. data mining lecture notes pdf. Open. Extract.

data mining lecture notes pdf
Loading… Page 1. Whoops! There was a problem loading more pages. data mining lecture notes pdf. data mining lecture notes pdf. Open. Extract. Open with.

Lecture-6 Data presentation systems.pdf
Whoops! There was a problem loading more pages. Lecture-6 Data presentation systems.pdf. Lecture-6 Data presentation systems.pdf. Open. Extract. Open with.

Lecture 4-Principles of Environmental Control and Micro-climate .pdf ...
Lecture 4-Principles of Environmental Control and Micro-climate .pdf. Lecture 4-Principles of Environmental Control and Micro-climate .pdf. Open. Extract.

SE-Lecture Notes-Unit-4.pdf
never be required to type operating system commands from within application software). Design for direct interaction with objects that appear on the screen.

SE-Lecture Notes-Unit-4.pdf
Retrying... SE-Lecture Notes-Unit-4.pdf. SE-Lecture Notes-Unit-4.pdf. Open. Extract. Open with. Sign In. Main menu. Displaying SE-Lecture Notes-Unit-4.pdf.

data resource management pdf
data resource management pdf. data resource management pdf. Open. Extract. Open with. Sign In. Main menu. Displaying data resource management pdf.

Lecture 7
Nov 22, 2016 - Faculty of Computer and Information Sciences. Ain Shams University ... A into two subsequences A0 and A1 such that all the elements in A0 are ... In this example, once the list has been partitioned around the pivot, each sublist .....

lecture 3: more statistics and intro to data modeling - GitHub
have more parameters than needed by the data: posteriors can be ... Modern statistical methods (Bayesian or not) .... Bayesian data analysis, Gelman et al.

Shmueli -Data Mining In Excel- Lecture Notes and Cases.pdf ...
Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Shmueli -Data Mining In Excel- Lecture Notes and Cases.pdf.

Lecture 1 Economic Data and Econometrics
September 15, 2016 ... particular, the GDP growth rates are now computed using the ... Taiwan's Annual Unemployment Rates: 1978–2015. 0. 1. 2. 3. 4. 5 ...... Saving. Lowest 20% 0.155 0.095 0.076 -0.001 rate. Highest 20% 0.378 0.359 0.275.

LECTURE - CHECKLIST
Consider hardware available for visual aids - Computer/ Laptop, LCD ... Decide timing- 65 minutes for lecture and 10 minutes for questions and answers.