System Management by Exception: One Example of BIRT Data Cubes Usage for Performance Data Analysis

Thursday, September 22, 2011

One Example of BIRT Data Cubes Usage for Performance Data Analysis

I have got the comment on my previous post “BIRT based Control Chart“ with questions about how actually in BIRT the data are prepared for Control Charting. Addressing this request I’d like to share how I use BIRT Cube to populate data to CrossTab object which was used then for building a control chart.

As I have already explained in my CMG paper (see IT-Control Chart), the data that describes the IT-Control Chart (or MASF control chart) has actually 3 dimensions (actually, it has 2 time dimensions and one measurement - metric as seen in the picture at the left). And the control chart is a just a projection to the 2D cut with actual (current or last) data overlaying. So, naturally, the OLAP Cubes data model (Data Cubes) is suitable for grouping and summarizing time stamped data to a crosstable for further analysis including building a control chart. In the past SEDS implementations I did not use Cubes approach and had to transform time stamped data for control charting using basic SAS steps and procs. Now I found that Data Cubes usage is somewhat simpler and in some cases does not require a programming at all if the modern BI tools (such as BIRT) are used.

Below are the some screenshots with comments that illustrates the process of building the IT-Control Chart by using BIRT Cube.

Data source (Input data) is a table with date/hour stamped single metric with at least 4 months history (in this case it is the CPU utilization of some Unix box). That could be in any database format; in this particular example it is the following CSV file:

The result (in the form of BIRT report designer preview) is on the following picture:

(Where UCL – Upper Control Limit; LCL is not included for simplicity)

Before building the Cube the three following data sets were built using BIRT “Data Explorer”:

(1) The Reference set or base-line (just “Data Set” on the picture) is based on the input raw data with some filtering and computed columns (weekday and weekhour) and
(2) the Actual data set which is the same but having the different filter: (raw[“date”} Greater “2011-04-02”)

(3) To combine both data sets for comparing base-line vs. actual, the “Data Set1” is built as a “Joint Data Set” by the following BIRT Query builder:

Then the Data Cube was built in the BIRT Data Cube Builder with the structure shown on the following screen:

Note only one dimension is used here – weekhour as that is needed for Cross table report bellow.

The next step is building report starting with Cross Table (which is picked as an object from BIRT Report designer “Pallete”):

The picture above shows also what fields are chosen from Cube to Cross table.

The final step is dropping “Chart” object from “Palette” and adding UCL calculation using Expression Builder for additional Value (Y) Series:

To see the result one needs just to run the report or to use a "preview' tab on the report designer window:

FINAL COMMENTS

- The BIRT report package can be exported and submitted for running under any portals (e.g. IBM TCR).

- Additional Cube dimensions makes sense to specify and use, such as server name or/and metric name.

- The report can be designed in BIRT with some parameters. For example, good idea is to use a server name as the report parameter.

- To follow the “SEDS” idea and to have the reporting process based on exceptions, the preliminary exception detection step is needed and can be done again within a BIRT report using the SQL script similar with published in one of the previous post:

Igor Trubin

He started in 1979 as IBM/370 system engineer. In 1986 he got his PhD. in Robotics at St. Petersburg Technical University (Russia) and then worked as a professor teaching CAD/CAM, Robotics for 12 years. He published 30+ papers and made several presentations for conferences related to the Robotics and Artificial Intelligent fields. In 1999 he moved to the US, worked at Capital One bank as a Capacity Planner. His first CMG.org paper was written and presented in 2001. The next one, "Exception Detection System Based on MASF Technique," won a Best Paper award at CMG'02 and was presented at UKCMG'03 in Oxford, England. He made other tech. presentations at IBM z/Series Expo, SPEC.org, Southern and Central Europe CMG and ran several workshops covering his original method of Anomaly and Change Point Detection (Perfomalist.com). Author of “Performance Anomaly Detection” class (at CMG.com). Worked 2 years as the Capacity team lead for IBM, worked for SunTrust Bank for 3 years and then at IBM for 3 years as Sr. IT Architect. Now he works for Capital One bank as IT Manager at the Cloud Engineering and since 2015 he is a member of CMG.org Board of Directors. Runs UT channel iTrubin

48 comments:

Igor TrubinSeptember 22, 2011
I would like to repeat the same exercise (Cube usage for Control charting) against the same data but stored in some MySQL table. Plus in opposed to non-programming approach I am interested in developing some SQL script to do the same data transformation and then to chart using BIRT or R.

Lastly my plan is to do it all using R meaning to develop some R based open source type of application (SEDS-lite) to
- Connect to database (MySQL as the test example)
- Filter out exceptions
- For each exception to transform data for Control charting
- Build control charts
- Put the list of exceptions and control charts on a web report.

Any help, comments or contribution offering are very welcome.
ReplyDelete
Replies
RaGeJanuary 13, 2012
Hi Igor, I've been trying to replicate this and am having some trouble. Do you actually compute the Standard Deviation while constructing the Data Cube? or are you using just the mean to calculate the UCL?
ReplyDelete
Replies
Tim BrowningJanuary 20, 2012
Greetings Noble Igor - I notice in your SQL code you are comparing last 7 days to the last 180 days (where the larger is your basis for a reference set). I think you should not include the last 7 days in the 180 day set because: (1) there could be outliers in recent data that will affect means and std, and (2) by including it you are comparing it to itself as a subset of the larger data (like autocorrelation).
ReplyDelete
Replies
Tim BrowningJanuary 20, 2012
Change baseline select to something like this:

WHERE DATE > (CURRENT DATE - 7 DAYS) - 180 DAYS

If you have identified and kept outliers in a separate table then:

WHERE DATE > (CURRENT DATE - 7 DAYS)-180 DAYS AND
DATE NOT IN
(Select DATE from OUTLIER_TABLE
WHERE DATE > CURRENT DATE - 180 DAYS)
ReplyDelete
Replies
Shoreline AnalyticsFebruary 19, 2020
This is really nice post, I found and love this content. I will prefer this, thanks for sharing. Data Cleaning Service.
ReplyDelete
Replies
hijaz shaikhJune 15, 2020
I wanted to thank you for this excellent read!! I definitely loved every little bit of it. I have you bookmarked your site to check out the new stuff you post. Lebensmittel
ReplyDelete
Replies
AlexJuly 08, 2020
Hello I am so delighted I located your blog, I really located you by mistake, while I was watching on google for something else, Anyways I am here now and could just like to say thank for a tremendous post and a all round entertaining website. Please do keep up the great work. survey data entry
ReplyDelete
Replies
MichaelseoJuly 14, 2020
Your writing is fine and gives food for thought. I hope that I’ll have more time to read your articles . Regards. I wish you that you frequently publish new texts and invite you to greet me data entry bookkeeping
ReplyDelete
Replies
shahzadkhatriJuly 17, 2020
This comment has been removed by a blog administrator.
ReplyDelete
Replies
m.aliJuly 24, 2020
This comment has been removed by a blog administrator.
ReplyDelete
Replies
awaisJuly 25, 2020
Nice post! This is a very nice blog that I will definitively come back to more times this year! Thanks for informative post. data entry bookkeeping
ReplyDelete
Replies
sameerAugust 05, 2020
One in the types is information systems that generally programmed on a mainframe, minicomputer, microcomputer or personal computer. Benefits of Data Entry Outsourcing - Outsourcing give benefits you financially and also strategically. bookkeeping data entry
ReplyDelete
Replies
sameerAugust 05, 2020
One in the types is information systems that generally programmed on a mainframe, minicomputer, microcomputer or personal computer. Benefits of Data Entry Outsourcing - Outsourcing give benefits you financially and also strategically. bookkeeping data entry
ReplyDelete
Replies
D. James AndersonAugust 14, 2020
So actually, when you join and pay the enrollment charge for these Data Entry Projects, you will get the preparation materials that will show you how to type more 'data entry advertisements' and persuade others to do something very similar.database data entry services
ReplyDelete
Replies
LewisAugust 28, 2020
Despite technological improvements, business organizations still rely on various data entry systems and data entry vendors accuracy is vital in such cases. Bad data may come from various sources and it is important to sort your data before any entry operations.
ReplyDelete
Replies
Elon MuskSeptember 06, 2020
Wow, What a Excellent post. I really found this to much informatics. It is what i was searching for.I would like to suggest you that please keep sharing such type of info.Thanks here
ReplyDelete
Replies
shane leeSeptember 07, 2020
Many industries use data science in order to automate different tasks. Businesses use historical data for training their machines to do repetitive tasks. And this is what simplifies arduous jobs done by humans a few years back. data science course in hyderabad
ReplyDelete
Replies
Mubeen Ali ServicesSeptember 10, 2020
A debt of gratitude is in order for sharing the information, keep doing awesome... I truly delighted in investigating your site. great asset... receipt data entry
ReplyDelete
Replies
ADMINSeptember 10, 2020
Positive site, where did u come up with the information on this posting?I have read a few of the articles on your website now, and I really like your style. Thanks a million and please keep up the effective work. data entry ecommerce
ReplyDelete
Replies
Willeum BruchSeptember 17, 2020
This substance is composed exceptionally well. Your utilization of arranging while mentioning your focuses makes your objective facts clear and straightforward. Much obliged to you. images data entry
ReplyDelete
Replies
zohaib khatriOctober 08, 2020
it's really cool blog. Linking is very useful thing.you have really helped crisis management training
ReplyDelete
Replies
Top SEOOctober 14, 2020
Thanks for a very interesting blog. What else may I get that kind of info written in such a perfect approach? I’ve a undertaking that I am simply now operating on, and I have been at the look out for such info. Best screw driver kit online
ReplyDelete
Replies
seostar2October 20, 2020
Invitation Management: Comprehensive, easy to use integrated invitation management tool, Internet of things tech events
ReplyDelete
Replies
Harry jackOctober 28, 2020
We exceptionally demoralize any programming for client ventures as it nullifies the point of an out-of-the case arrangement. We encourage clients to move toward any programming increments with alert. besimple.com/
ReplyDelete
Replies
AshokNovember 04, 2020
Excellent Blog! I would like to thank for the efforts you have made in writing this post. I am hoping the same best work from you in the future as well. I wanted to thank you for this websites! Thanks for sharing. Great websites!

data science course in India
ReplyDelete
Replies
AlexfraNovember 15, 2020
You have outdone yourself this time. It is probably the best, most short step by step guide that I have ever seen. Legacy data archiving
ReplyDelete
Replies
AshokNovember 16, 2020
Wow! Such an amazing and helpful post this is. I really really love it. It's so good and so awesome. I am just amazed. I hope that you continue to do your work like this in the future also.
Artificial Intelligence Course
ReplyDelete
Replies
logan pualMarch 15, 2021
A Non-Disclosure Agreement (NDA) will be marked or Corporate and Business Clients to guarantee privacy of information from accommodation of media to information conveyance.data-recovery-tips.co.uk
ReplyDelete
Replies
sameer aliMay 02, 2021
If you have just a few people to verify, the online option is the better choice. If you need to verify a large quantity, you'll need to create a file and mail it in.visit website
ReplyDelete
Replies
styenMay 03, 2021
E-learning permits offering types of assistance remotely. E-banking permits customers to utilize bank's services at whatever point they need without visiting bank's office. IT company Hamilton
ReplyDelete
Replies
naginamirliquatMay 19, 2021
I just want to let you know that I just check out your site and I find it very interesting and informative.. The Best Remote Team Management Tool
ReplyDelete
Replies
jessicasteveFebruary 03, 2022
You bear through a awesome vacancy. I sanity definitely quarry it moreover personally suggest to my buddys. I am self-possessed they determination be benefited from this scene. spss data analysis help
ReplyDelete
Replies
Ali janMarch 03, 2022
that critical time, the software sends to one of your marketing executive an alert for such a prospect who, from there on, can take predefined steps to ensure the prospect turns into a conversion. www.updigital.ca
ReplyDelete
Replies
ahmetDecember 16, 2022
instagram takipçi satın al
casino siteleri
MV30Q
ReplyDelete
Replies
emreJuly 10, 2023
çekmeköy
kepez
manavgat
milas
balıkesir
SS1
ReplyDelete
Replies
ırmakJuly 28, 2023
bayrampaşa
güngören
hakkari
izmit
kumluca
8QVYA
ReplyDelete
Replies
MücevherJuly 30, 2023
yurtdışı kargo
resimli magnet
instagram takipçi satın al
yurtdışı kargo
sms onay
dijital kartvizit
dijital kartvizit
https://nobetci-eczane.org/
VHO
ReplyDelete
Replies
DigitsDiscoverer101October 27, 2023
https://istanbulolala.biz/
5DJF
ReplyDelete
Replies
SpaceKorsanı95October 31, 2023
muş evden eve nakliyat
çanakkale evden eve nakliyat
uşak evden eve nakliyat
ardahan evden eve nakliyat
eskişehir evden eve nakliyat
JHPY
ReplyDelete
Replies
StardustSorcerer9November 01, 2023
muş evden eve nakliyat
çanakkale evden eve nakliyat
uşak evden eve nakliyat
ardahan evden eve nakliyat
eskişehir evden eve nakliyat
WX8M
ReplyDelete
Replies
MysticSorcerer9November 02, 2023
urfa evden eve nakliyat
malatya evden eve nakliyat
burdur evden eve nakliyat
kırıkkale evden eve nakliyat
kars evden eve nakliyat
GOWHV6
ReplyDelete
Replies
51598Stanley86AF5November 13, 2023
ADD67
Muğla Lojistik
Denizli Parça Eşya Taşıma
Vindax Güvenilir mi
Tekirdağ Boya Ustası
Hakkari Evden Eve Nakliyat
Silivri Fayans Ustası
Antalya Rent A Car
Kırıkkale Şehirler Arası Nakliyat
İzmir Evden Eve Nakliyat
ReplyDelete
Replies
EF8B1RichardB7C9ANovember 14, 2023
FB4CD
Gümüşhane Lojistik
Rize Lojistik
Bursa Şehir İçi Nakliyat
Zonguldak Evden Eve Nakliyat
Sivas Parça Eşya Taşıma
Aksaray Lojistik
Afyon Evden Eve Nakliyat
Siirt Parça Eşya Taşıma
Muş Şehir İçi Nakliyat
ReplyDelete
Replies
7B65FKenyaC692BDecember 15, 2023
42A20
resimli magnet
binance referans kodu
binance referans kodu
binance referans kodu
resimli magnet
binance referans kodu
referans kimliği nedir
referans kimliği nedir
resimli magnet
ReplyDelete
Replies
TakipciFebruary 18, 2024
77145
mercatox
mexc
kraken
paribu
kripto para telegram
kucoin
canlı sohbet uygulamaları
binance
telegram kripto grupları
ReplyDelete
Replies
____TakipciFebruary 23, 2024
BB3D3
btcturk
mobil 4g proxy
mercatox
canlı sohbet siteleri
filtre kağıdı
papaya
bitexen
huobi
referans kimliği
ReplyDelete
Replies

Add comment

Popular Post

_

Thursday, September 22, 2011

One Example of BIRT Data Cubes Usage for Performance Data Analysis

48 comments: