• No results found

Investigation of Pathway Analysis Tools for mapping omics data to pathways

N/A
N/A
Protected

Academic year: 2021

Share "Investigation of Pathway Analysis Tools for mapping omics data to pathways"

Copied!
53
0
0

Loading.... (view fulltext now)

Full text

(1)

Degree: Bachelor of Computer Science 180hp Supervisor(s): Céline Fernandez,

Major: Information Systems Annabella Loconsole

Tech n ology a n d s ociet y Com p u t er S cien ce

In vest iga t ion of Pa t h wa y An a lysis

Tools for m a ppin g om ics da t a t o

pa t h wa ys

-Focu s on l i p i d om i cs a n d gen om i cs d a t a

Un der sök n in g a v a n a lysver k t yg för a t t k a r t lä gga om ik da t a

t ill r ela t ion svä ga r

– F ok u s p å d a t a a v t y p en l i p i d om i k och g en om i k

(2)

A n ed u cation isn 't h ow m u ch you h ave com m itted to m em ory,

or even h ow m u ch you k n ow . It's bein g able to d ifferen tiate

betw een w h at you k n ow an d w h at you d on 't.

/An a t ole F r a n ce

Ac k n o w le d g e m e n t s

I wou ld lik e sa y t h a n k you t o ever yon e wh o h elped m e wit h m y t h esis. To m y su per visor s I t h a n k you for you r pa t ien ce, gu ida n ce a n d a ll t h e good feedba cks.

(3)

Abst r a ct

Th is t h esis exa m in es P ATs fr om a m u lt idisciplin a r y view. Th er e a r e a lot of P AT's exist in g t oda y a n a lyzin g specific t ype of om ics da t a , t h er efor e we in vest iga t e t h em a n d wh a t t h ey ca n do. By defin in g som e specific r equ ir em en t s su ch a s h ow m a n y om ics da t a t ypes it ca n h a n dle, t h e a ccu r a cy of t h e P AT ca n be obt a in ed t o get t h e m ost su it a ble P AT wh en it com es t o m a ppin g om ics da t a t o pa t h wa ys . Resu lt s sh ow t h a t n o P ATs fou n d t oda y fu lfills t h e specific set of r equ ir em en t s or t h e m a in goa l t h ou gh soft wa r e t est in g. Th e In gen u it y P AT is t h e closest t o fu lfill t h e r equ ir em en t s . Requ est ed by t h e en d u ser , t wo P ATs a r e t est ed in com bin a t ion t o see if t h ese ca n fu lfill t h e r equ ir em en t s of t h e en d u ser . U n ipr ot ba t ch con ver t er wa s t est ed wit h F E vE R a n d r esu lt s did n ot t u r n ou t su ccessfu lly sin ce t h e com bin a t ion of t h e t wo P ATs is n o bet t er t h a n t h e In gen u it y P AT. F ocu s t h en t u r n ed t o a n a lt er n a t ive com bin a t ion , a h om epa ge ca lled N CBI t h a t h a ve sea r ch en gin es con n ect ed t o sever a l fr ee P ATs a va ila ble t h u s fu lfillin g t h e r equ ir em en t s. Th r ou gh t h e sea r ch en gin e “om ics” da t a ca n be com bin ed a n d m or e t h a n on e in pu t ca n be t a k en a t a t im e. Sin ce t ech n ology is r a pidly m ovin g for wa r d , t h e n eed for n ew t ools for da t a in t er pr et a t ion a lso gr ows. It m ea n s t h a t in a n ea r fu t u r e we m a y be a ble t o fin d a P AT t h a t fu lfills t h e r equ ir em en t s of t h e en d u ser s.

Ke y w o rd s : Bioch em ist r y, Ca r diova scu la r disea se, Da t a ba se, Gen om ics, Lipids,

Lipidom ics, Met a bolom ics, P AT, Tech n ology

Sa m m a n fa t t n in g

Det t a exa m en sa r bet e gr a n sk a r a n a lysver kt yg u r et t t vä r vet en sk a pligt per spek t iv. Det fin n s en h el del olik a a n a lysver kt yg ida g som a n a lyser a r specifik a t yper a v om ik da t a och dä r för u n der sök er vi h u r m å n ga det fin n s sa m t va d de k a n gör a . Gen om a t t defin ier a et t a n t a l specifik a k r a v så som h u r m å n ga t yper a v om ik da t a den k a n h a n t er a , n oggr a n n h et a v ver k t yget s a n a lys så k a n m a n se vilk a som ä r m est lä m pliga a n a lysver kt ygen n ä r det gä ller k a r t lä ggn in g a v om ik da t a . Resu lt a t en visa r a t t det ida g in t e fin n s a n a lysver k t yg som u ppfyller de specifik t a n givn a k r a ven eller h u vu dsyft et gen om t est n in g a v pr ogr a m va r a n . In gen u it y a n a lysver kt yget ä r det n ä r m a st e vi k a n k om m a för de k r a v som vi sök er . P å begä r a n a v slu t a n vä n da r en t est a des t vå a n a lysver kt yg för a t t se om en k om bin a t ion a v dessa k a n u ppfylla slu t a n vä n da r en s k r a v. An a lysver k t yget U n ipr ot ba t ch con ver t er t est a s m ed F E vE R m en r esu lt a t ä r in t e fr a m gå n gsr ikt , då k om bin a t ion en a v dessa ver kt yg in t e ä r bä t t r e ä n In gen u it y a n a lysver kt yget . F ok u s vä n ds m ot en a lt er n a t iv k om bin a t ion som ä r en h em sida och h et er N CBI. H em sida n h a r en sök m ot or k oppla d t ill fler a olik a a n a lysver kt yg som ä r gr a t is a t t a n vä n da . Gen om sök m ot or n k a n ”om ik ” da t a k om bin er a s och m er ä n et t in m a t a t vä r de k a n h a n t er a s i t a get . E ft er som t ek n ik en sn a bbt gå r fr a m å t in n ebä r det dä r em ot a t t n ya a n a lysver k t yg beh övs för da t a h a n t er in g och in om en sn a r fr a m t id så h a r vi k a n sk e et t a n a lysver kt yg som u ppfyller k r a ven a v slu t a n vä n da r n a .

N y c k e lo rd : Biok em i, Ka r diova sk u lä r sju k dom , Da t a ba s, Gen om ik, L ipider , Lipidom ik,

(4)

Co n t e n t s

Abst r a ct ... 3

Sa m m a n fa t t n in g ... 3

1. In t r odu ct ion ... 5

1.1. P u r pose ... 5

1.2. P r oblem defin it ion s a n d Aim s ... 6

1.3. P r oblem discu ssion ... 6

1.4. Rela t ed wor k wit h P AT ... 8

2. Met h ods ... 8

2.1. Model in u se ... 8

2.1.1. R equ irem en t collection , d ocu m en tation an d valid ation ... 8

2.1.2. R equ irem en t processin g an d test case creation ... 9

2.1.3. Objective ... 12

2.1.4. U n d erlyin g objectives ... 12

2.2. Alt er n a t ive r esea r ch m et h ods ... 13

3. Biom edica l backgr ou nd ... 13

3.1. Gen et ics ... 13

3.1.1. Gen e ... 14

3.1.2. S N P ... 15

3.2. Bioch em ist r y of Lipids ... 16

3.2.1. L ipid d efin ition ... 17

3.2.2. Classes of L ipid s ... 17

3.2.3. E n zym es in volved in th e syn th esis of lipid s ... 18

3.2.4. L ipoprotein s ... 21

3.3. Gen om ics ... 21

3.4. Met a bolom ics ... 22

3.5. Lipidom ics ... 22

3.6. Ca r diova scu la r disea ses ... 22

4. Com pu t er Scien ce ba ckgr ou n d ... 23

4.1. Da t a ba ses, Da t a m in in g a n d Kn owledge discover y ... 23

4.2. P AT ... 23

5. Requ ir em ent s a n d Test elicit at ion ... 24

5.1. Requ ir em en t s ... 24 5.2. Test in g ... 25 5.3. Test ca ses ... 26 6. Resu lt ... 26 6.1. F in din g t h e P ATs ... 26 6.2. Sor t in g t h e P ATs ... 27 6.3. Test in g t h e P ATs ... 27 6.4. E va lu a t in g t h e P ATs ... 28

6.5. F in a l eva lu a t ion of t h e P ATs ... 28

6.6. Th e best P AT fr om t h e r a n k ed list ... 29

6.7. Com bin in g P ATs ... 31

6.8. F u n ct ion a lit ies ... 34

6.9. Qu a lit y ... 35

7. Discu ssion ... 36

7.1. Is it possible t o fin d a P AT t h a t pr ocesses m et a bolom ics a n d lipidom ics r a w da t a a s in pu t a n d com bin e t h em wit h gen et ic in for m a t ion ? ... 36

7.2. Wh a t a r e t h e fu n ct ion a lit ies offer ed by t h e a va ila ble a n a lysis t ools? ... 36

(5)

7.4. Wh y n ot In gen u it y a n d wh y U n ipr ot wit h F E vE R? ... 38

8. F u t u r e Va lu e ... 38

9. Refer en ces ... 38

Appen dix 1 – Test Ca ses ... 42

Appen dix 2 – Lipid, MI SN P a n d Met a bo SN P da t a sh eet ... 46

Appen dix 3 – Requ ir em en t s Ma t r ixes ... 50

Appen dix 4 – Respon s Tim es ... 53

1. In t r odu ct ion

Va st a m ou n t s of r esea r ch is don e in lipidom ics a n d gen om ics, m a kin g com pu t er s, In t er n et a n d va r iou s a n a lysis t ool's ver y com m on t oda y bot h in sim ple a n d a dva n ced for m s. As a n exa m ple a sim ple ca lcu la t ion ca n be per for m ed on on e com pu t er a n d t r a n sfer r ed or copied t o a n ot h er if n eeded. Mor e a dva n ced per for m a n ces som et im es r equ ir e a soft wa r e t ool t h a t ca n per for m a cer t a in t a sk on a given set of da t a in or der t o give a cer t a in r esu lt . Th e r esu lt is in t u r n u su a lly n ot logica lly or der ed a n d a visu a l pr esen t a t ion is n eeded. Th is is wh er e a pa t h wa y a n a lysis t ool (P AT) is n eeded. A pa t h wa y a n a lysis t ool (P AT) is a n a dva n ced t ool t h a t pr ocess es given da t a , com pa r es t h e given da t a wit h st or ed da t a in a da t a ba se a n d pr esen t s t h e r esu lt s obt a in ed visu a lly. A com pa n y t en ds t o h ir e a pr ogr a m m er t o develop a pa t h wa y a n a lysis t ool (P AT) in or der t o in t egr a t e it wit h in t h e or ga n iza t ion [34]. On e of t h e m a in gr ou p s of scien t ific u ser s is t h e gr ou p of r esea r ch er s in fields of bioin for m a t ics, gen et ics, gen om ics a n d m et a b olom ics. Resea r ch er s a r e depen den t of t h ese pa t h wa y a n a lysis t ools in t h eir scien t ific wor k. In som e scien t ific fields su ch a s gen om ics a n d m et a bolom ics, t h er e a r e t oo m a n y a n a lysis t ools (P AT), doin g a ll kin ds of differ en t t a sks. Too m a n y pa t h wa y a n a lysis t ools in a specific field ca n con fu se r esea r ch er s wh o do n ot h a ve en ou gh kn owledge in t ech n ology [5]. Th is m a kes it difficu lt t o decide wh a t pa t h wa y a n a lysis t ools a r e su it ed for cer t a in da t a a n d wit h in wh a t scien t ific field. Sin ce t ech n ology is a lso m ovin g for wa r d ext r em ely fa st , people wit h m u lt idisciplin a r y kn owledge a r e n eeded m or e a n d m or e [20]. F or r esea r ch er s wh o wor k wit h in t h e biom edica l field of m et a bolom ics a n d gen om ics t h er e a r e specific a n a lysis t ools. Th e pu r poses of t h ese pa t h wa y a n a lysis t ools (P AT) a r e t o h elp t h e u ser s in t h eir wor k, wh er e t h ey ca n visu a lize da t a t h a t m a y lea d t o n ew scien t ific discover y. Tech n ology a n d in for m a t ion sh a r in g h a s t a ken a big st ep for wa r d a n d h a s h elped su bst a n t ia lly in differ en t a r ea s a r ou n d t h e wor ld su ch a s in h ea lt h ca r e a n d m edicin e.

1.1.P u rp o s e

F in din g r elia ble pa t h wa y a n a lysis t ools (will be r efer r ed t o a s P AT fr om n ow on ) t h a t ca n do a ll t h e n ecessa r y da t a com pu t a t ion s a n d ca n visu a lly pr esen t t h e r esu lt s is r equ est ed by Célin e F er n a n dez fr om Clin ica l Resea r ch Cen t er (CRC) in Ma lm ö (will be r efer r ed t o a s t h e en d u ser ). CRC wor k s in discover in g n ew m edicin e, dia gn ost ic t ools a n d im pr oved t r ea t m en t s in or der t o im pr ove h ea lt h wor ldwide.

(6)

1.2.P ro b le m d e fi n i t i o n s a n d Ai m s

Sin ce t h er e a r e m a n y P AT a va ila ble wit h lot s of in for m a t ion , t h e followin g r esea r ch qu est ion s a r e defin ed in t h is t h esis:

 Is it possible t o fin d a P AT t h a t pr ocesses m et a bolom ics a n d lipidom ics r a w da t a a s in pu t a n d com bin es t h em wit h gen et ic in for m a t ion ?

 Wh a t a r e t h e fu n ct ion a lit ies offer ed by t h e a va ila ble a n a lysis t ools?

 Wh a t a r e t h e qu a lit ies of t h ese t ool's a n d h ow t o eva lu a t e t h em ? Th e object ives a r e defin ed in or der t o h elp a n swer t h e t h r ee r esea r ch qu est ion s. Th e m a in a im of t h is t h esis is t h e followin g:

To fin d a P AT t h a t ca n pr ocess a com bin a t ion of da t a in pu t s wit h t h e t ype of “om ics” da t a , i.e. lipidom ics/m et a bolom ics, genom ics da t a .

In or der t o r ea ch t h e m a in pu r pose, sever a l u n der lyin g object ives a r e n eeded. Th ese a r e t h e followin g:

1) F in d P ATs t h a t a r e a ble t o m a p pa t h wa ys of t h e followin g t ype of da t a : a ) Over a ll m et a bolom ics da t a

b) Lipidom ics da t a c) Gen om ics da t a

2) E va lu a t e t h e select ed P AT a n d t h eir fu n ct ion s. Test t h e cu r r en t a ccu r a cy of t h e exist in g P AT in or der t o a n swer if t h e ou t pu t fr om t h ese t ools sh ows t h e “cor r ect ” r esu lt s.

3) E va lu a t e t h e select ed P AT a ccor din g t o specific r equ ir em en t s given by t h e en d u ser ; see sect ion 1.3 for t h e specific r equ ir em en t s.

Aft er t h e eva lu a t ion of t h e P AT a ccor din g t o r equ ir em en t s, t wo opt ion s a r e possible:

Opt ion 1: On e or m or e P AT pa sses st eps 2 a n d 3 a n d is deliver ed t o t h e en d u ser .

Opt ion 2: If n o P AT fu lfillin g t h e r equ ir em en t s is fou n d. Alt er n a t ive solu t ion s will be t o see if it is possible t o a da pt a n y of t h e eva lu a t ed a n a lysis t ools, com bin e m or e t h a n on e or m a ke a n in h ou se developm en t of a P AT m eet in g t h e r equ ir em en t s of t h e en d u ser .

1.3.P ro b le m d i s c u s s i o n

In or der t o solve t h e pr oblem we m u st con sider wh a t P ATs a r e, h ow com plex t h ey a r e a n d wh a t t h ey ca n do. T h e fu n ct ion a lit ies of t h e P AT n eed t o be t est ed [28] t o see if t h ey fu lfill t h e specific r equ ir em en t s (S ee T able 1).

(7)

Ta b le 1. 8 specific r equ ir em en t s list ed t h a t n eeds t o be fu lfilled by a P AT. R e q u i re m e n t

ID

R e q u i re m e n t d e s c ri p t i o n

1 U ser is a ble t o see a n d select on t h e P AT wh a t t ype of da t a it m u st pr ocess (if t h e in pu t field is for m et a bolom ic, lipidom ic or gen om ic)

2 U ser m u st be a ble t o con t r ol if obt a in ed r esu lt is va lid fr om t h e P AT a ccor din g t o lit er a t u r e, In t er n et or la bor a t or y r esu lt s

3 Th e u ser m u st r eceive r esu lt s by t h e P AT wit h in a cer t a in t im e

4 Th e u ser ca n n a viga t e bet ween st a r t of sea r ch (in pu t da t a ) t o t h e en d of sea r ch (r esu lt s obt a in ed).

5 Th e u ser ca n get a visu a l pr esen t a t ion of m et a bolom ics, lipidom ics a n d gen om ics da t a fr om t h e P AT

6 Th e u ser ca n zoom in a n d ou t expa n din g t h e view t o n eigh bor in g possible r esu lt s t o see con n ect ed pa t h wa ys on t h e r eceived r esu lt s fr om t h e P AT.

7 Th e u ser ca n in pu t a specific t ype of da t a in t o t h e P AT (m et a bolom ic, lipidom ic or gen om ic)

8 Th e u ser ca n in pu t com bin ed om ics da t a a n d t h en m a p t h em t o pa t h wa ys

Acqu ir in g kn owledge fr om lit er a t u r e gives u s in for m a t ion a bou t t h e com plexit y of a P AT [27]. Th e fu n ct ion a lit ies fr om a P AT ca n be obt a in ed wit h h elp of soft wa r e t est in g of da t a in pu t s [9] a n d t h is wa y we ca n ch eck if t h e P AT sa t isfy t h e r equ ir em en t s of t h e pot en t ia l u ser s. Th e defin it ion of qu a lit y is of a bigger sca le a n d h a r der t o defin e sin ce qu a lit y h a s differ en t m ea n in gs t o differ en t people [36]. Th e qu a lit ies of t h e P AT a r e a ccept a ble if t h ey a r e fu lfillin g a ll t h e r equ ir em en t s [36] a ccor din g t o a set of r equ ir em en t specifica t ion s. We will be u sin g t h e r equ ir em en t specifica t ion s a ccor din g t o t a ble 1. H om epa ges a ssocia t ed wit h P AT a lso n eed t o be qu a lit y ch ecked a n d five select ed a spect s a r e u sed: A ccu racy an d Correctn ess (h ow t r u st wor t h y is t h e in for m a t ion pr ovided on t h e h om epa ges), Com pleten ess (a r e t h e h om epa ges com plet e or u n der con st r u ct ion ), R elevan ce (h ow r eleva n t is con t en t or in for m a t ion on a h om epa ge t o t h e P AT), T im e an d Pu n ctu ality (h ow fa st ca n a h om epa ge be fou n d wh en sea r ch in g), T raceability (is t h e in for m a t ion pr ovided on t h e h om epa ges t r a cea ble t o t h eir or igin a l sou r ce).

(8)

1.4.R e la t e d w o rk w it h P AT

Most P AT t oda y is m a de specifica lly wit h focu s on m et a bolom ics a n d gen om ics. Th is is du e t o t h e r esea r ch wor k in m et a bolic en gin eer in g, cellu la r m et a bolism a n d in t oxic gen om ics [16, 25]. Com pa n ies spen d va st a m ou n t s of m on ey developin g a P AT wh ile t r yin g t o com pet e wit h ea ch ot h er [8, 15]. Th e com pet it ion for t h e com pa n ies in volves bu ildin g, a da pt in g a n d eva lu a t in g ea ch ot h er 's P AT, t ellin g wh y t h eir P AT is bet t er t h a n t h e ot h er [8, 15, 33]. Sin ce t h e P AT is specifica lly developed for a biom edica l field [41], t h er e exist s n o fu ll-sca le a n a lysis on t h e en t ir e P AT yet . Ou r st u dy is a fir st a t t em pt a t su ch a n a n a lysis of a com plet e set of a ll P AT.

2. Met h ods

Th is sect ion descr ibes t h e scien t ific m et h ods u sed t o eva lu a t e t h e differ en t P AT. St a r t in g wit h t h e select ed m et h od in u se, h ow t h e in for m a t ion is ga t h er ed a n d det a ils on t h e object ives a n d u n der lyin g object ives.

2.1.Mo d e l i n u s e

Th e m a in pu r pose (t o fin d a P AT t h a t ca n pr ocess a com bin a t ion of da t a input s wit h t he t ype of “om ics” da t a , i.e. lipidom ics/m et abolomics, genom ics da t a ) of t h e pr oject wa s divided in t o fou r u n der lyin g object ive, ea ch wit h it s specific object ive. Met h ods t h a t will be per for m ed a r e ba sed on a n em pir ica l m odel wit h a st u dy on P AT in or der t o t est a n d a n a lyze ea ch of t h e P AT a n d t h eir h om epa ges. Test ca ses a r e design ed ba sed on t h e r equ ir em en t s fr om t h e en d u ser a t t h e fir st in t er view. Th e r equ ir em en t s a r e r ech ecked a few weeks la t er wit h t h e en d u ser in a secon d in t er view. On ce a ckn owledged, t h e soft wa r e t est in g begin s wit h r equ ir em en t s a n d t est ca ses, in or der t o see if in goin g da t a m a t ch es t h e ou t com in g da t a of t h e P AT. Da t a is ba sed on a gen e n a m e (e.g. N P P A), r efer en ce SN P a ccession ID (r s n u m ber su ch a s r s5068) or a lipid cla ss n a m e (su ch a s lipopr ot ein s). Ver ifica t ion (fr om t h e P AT) of t h e ou t com in g da t a t o see if it s r eleva n t is per for m ed by com pa r in g t h e r eceived r esu lt s wit h in for m a t ion fou n d in lit er a t u r e. A r a n ked list is m a de r a n kin g t h e best P AT fir st , ba sed on h ow m a n y r equ ir em en t s a r e m et . If n o P AT m eet s a ll t h e r equ ir em en t s, t h e en d u ser h a ve a r equ est t o a da pt or com bin e 2 specifica lly select ed t ools, wh ich en d u ser is a lr ea dy fa m ilia r ized wit h , wh ile t h e r a n ked list get s disca r ded .

2.1.1. R equ irem en t collection , d ocu m en tation an d valid ation

F ive m eet in gs a r e booked a t t h e Clin ica l Resea r ch Cen t er (CRC) in or der t o m a ke in t er views. All pa r t icipa n t s (r esea r ch er s in clu din g t h e en d u ser ) a r e goin g t o discu ss a bou t t h e pr oblem t h a t n eeds t o be solved. Discu ssion will focu s on P AT in gen er a l a n d specific fu n ct ion s a r e goin g t o be desir ed by t h e r esea r ch er s t h a t h a ve t o be on a P AT. Requ ir em en t s a r e m a de con n ect ed t o t h ese fu n ct ion s on a P AT a n d a n ew m eet in g is booked . Du r in g ea ch m eet in g ever yt h in g is wr it t en down a n d docu m en t ed. Aft er ea ch m eet in g,

(9)

r equ ir em en t s a r e collect ed t o be sor t ed a n d pr ocessed in or der t o m a ke t est ca ses. La t er a ch ecku p t a kes p la ce a t sa m e pla ce, t o see if ever yt h in g is on t h e r igh t t r a ck .

2.1.2. R equ irem en t processin g an d test case creation

Th e r equ ir em en t s a r e pr ocessed a n d for m u la t ed. Th ey a r e a lso sh or t en ed down fr om 15 t o eigh t r equ ir em en t s wit h t h e m ost im por t a n t t h in gs t h a t a P AT m u st do. E a ch of t h e r equ ir em en t s is given a n iden t ifica t ion n u m ber . Test ca se t em pla t es a r e sou gh t a n d on e t em pla t e is select ed, down loa ded a n d t h en cu st om ized (Fig 1). Specific t est ca ses a r e design ed t o su it t h e r equ ir em en t s a n d lin kin g t h em t o t h eir r espect ive r equ ir em en t (S ee T able

2). Th e design s of t h e specific t est ca ses a r e m a de by a ddin g t h e goa l of t h e

t est a lon g wit h t h e even t s t o a ch ieve t h e goa l. La st ly t h e expect ed r espon se is wr it t en , descr ibin g wh a t r esu lt s we sh ou ld expect by followin g t h e even t s. Th e wh ole pr ocess st a r t s by ca r efu lly ch eckin g a r equ ir em en t fr om t h e list a n d t r yin g t o see if t h ey ca n be m a de a s a sin gle t est ca se in on e go. If t h a t is n ot possible sever a l t est ca ses a r e n eed ed. If we look a t fir st r equ ir em en t in t a ble 1 a bove, we see t h a t 3 differ en t da t a t ypes n eed t o be t est ed. So we h a ve t o split t h e r equ ir em en t in t o m or e t h a n 1 t est ca se sin ce a ll P AT m a y n ot be a ble t o pr ocess a ll 3 da t a t ypes . We decide t o t a ke t h e fir st da t a t ype wh ich is for m et a bolom ic in pu t da t a . We a lso select a da t a in pu t t h a t we kn ow sh ou ld give a r espon se a n d pr esen t som e r esu lt s. F r om t h is we ca n wr it e down ou r even t s in t h e t est ca se by h a vin g a n in pu t a n d t h en get t in g a r espon se. So we ca n t h en a lso st a t e t h e expect ed r espon se. In ou r ca se it is t h a t t h e m et a bolom ic da t a t ype gives da t a in for m a t ion r ela t ed t o ou r da t a in pu t t h a t we m a de. N ext 2 t est ca ses will be sim ila r wit h t h e sm a ll differ en ce of h a vin g a differ en t in pu t da t a t ype. Sa m e a ppr oa ch m et h od is a pplied t o t h e r est of t h e t est ca ses. Requ ir em en t s a r e goin g t o be ch ecked, eva lu a t ed if it ca n be m a de a s on e t est or split t in g t h em in t o m or e t est ca ses for sa m e r equ ir em en t , wr it in g t h e even t s a n d t h e expect ed r espon se.

(10)
(11)

Ta b le 2. A t a ble sh owin g r equ ir em en t ID wit h descr ipt ion lin ked t o specific Test Ca se ID ID R e q u i re m e n t d e s c ri p t i o n Ty p e Li n k e d w i t h Te s t Ca s e ID

1 U ser is a ble t o see a n d select on t h e P AT wh a t t ype of da t a it m u st pr ocess (if t h e in pu t field is for m et a bolom ic, lipidom ic or gen om ic)

F u n c t i o n a l 1, 2 a n d 3 2 U ser m u st be a ble t o ch eck if t h e r esu lt s obt a in ed is va lid fr om t h e P AT a ccor din g t o lit er a t u r e or la bor a t or y r esu lt s N o n fu n c t i o n a l 4 3 Th e u ser m u st r eceive r esu lt s by t h e P AT wit h in a cer t a in t im e N o n fu n c t i o n a l 5 4 Th e u ser sh ou ld n a viga t e bet ween st a r t of sea r ch (in pu t da t a ) t o t h e en d of sea r ch (r esu lt s obt a in ed).

N o n

fu n c t i o n a l

6

5 Th e u ser sh ou ld get a visu a l pr esen t a t ion of m et a bolom ics, lipidom ics a n d gen om ics da t a fr om t h e P AT N o n fu n c t i o n a l 7 6 Th e u ser m u st be a ble t o zoom in a n d ou t expa n din g t h e view t o n eigh bor in g possible r esu lt s t o see con n ect ed pa t h wa ys on t h e r eceived r esu lt s fr om t h e P AT. F u n c t i o n a l 8 7 Th e u ser m u st in pu t a specific t ype of da t a in t o t h e P AT (m et a bolom ic, lipidom ic or gen om ic)

F u n c t i o n a l 9

8 Th e u ser m u st be a ble t o in pu t com bin ed om ics da t a a n d t h en m a p t h em t o pa t h wa ys

(12)

2.1.3. Objective

Th e m a in pu r pose is a ch ieved by a cqu ir in g kn owledge fr om lit er a t u r e su ch a s books a n d a r t icles a n d by doin g soft wa r e t est in g. Th e r esu lt s obt a in ed fr om t h e t est s a r e t h a n com pa r ed wit h r equ ir em en t s m a de by t h e pot en t ia l u ser s of t h e P AT.

2.1.4. U n d erlyin g objectives

Object ive 1:

Ga t h er in g of in for m a t ion by sea r ch in g books a n d a r t icles , fin din g lot s of P AT a n d obt a in wh a t da t a it ca n pr ocess . Down loa d P AT if possible t o a n a lyze t h em .

Object ive 2:

E va lu a t e t h e select ed P AT wit h t h eir fu n ct ion s a n d m et h ods by goin g t h r ou gh ea ch t ool, clickin g a r ou n d a n d in pu t t in g da t a . Test ca ses a r e design ed fr om t h e given r equ ir em en t s. Test s on t h e P AT a r e ba sed u pon :

a ) F r om t h e lit er a t u r e kn own m et a bolom ics, lipidom ics, a n d gen et ic pa t h wa ys a n d cor r ela t ion s

b) Com pa r ison bet ween r esu lt s obt a in ed fr om t h e lit er a t u r e a n d fr om t h e P AT

c) Com pa r ison bet ween exist in g la bor a t or y r esu lt s a n d t h e P AT d) H ow lon g it t a kes t o pr ocess da t a by t h e a n a lysis t ool

Cor r ect r esu lt s a r e con sider ed t o be t h ose t h a t com e fr om scien t ific a r t icles, books or la bor a t or y r esu lt s ver ified by scien t ist s. P a t h wa ys a n d cor r ela t ion s wit h m et a bolom ics, lipidom ics, a n d gen et ics a r e t est ed a ga in st lit er a t u r e kn own r esu lt s. Com pa r ison bet ween r esu lt s obt a in ed fr om P AT a ga in st a r t icle a n d book r esu lt s a r e goin g t o be don e fir st , a ft er wa r ds t h e exist in g la bor a t or y r esu lt s. Accu r a cy of t h e P AT a r e a cqu ir ed by t h e ou t pu t da t a a n d r esu lt s will eit h er a ccu r a t ely m a t ch a ll da t a or n ot . A sim ple t im er is u sed t o r ecor d t h e pr ocessin g t im e of a P AT. F in a lly a list of P AT will sh ow wh ich P AT pa ssed, fa iled a n d wh y t h ey fa iled ou r exa m in a t ion .

Object ive 3:

In or der t o h a ve a sa t isfied en d u ser , specific set of r equ ir em en t s a r e n eeded t h a t m u st be fu lfilled wit h a fin a l eva lu a t ion . Requ ir em en t s a r e collect ed a t a n ea r ly st a ge wit h in t er views fr om r esea r ch er s a n d t h e en d u ser wh o a lso r epr esen t ot h er pot en t ia l u ser s. Th e m ost desir ed a n d im por t a n t r equ ir em en t s wer e discu ssed a n d iden t ified t o be t h e followin g: Select ed a n a lysis t ool m u st be a ble t o:

a ) N a viga t e bet ween da t a a n d r esu lt s

b) Ma ke visu a l pr esen t a t ion of obt a in ed m et a bolom ics, lipidom ics or gen om ics da t a

c) H a ve zoom in a n d zoom ou t fu n ct ion s expa n din g t h e view t o n eigh bor in g possible r esu lt s con n ect ed t o pa t h wa ys on t h e r esu lt s obt a in ed

(13)

d) Th e P AT sh ou ld be a ble t o pr ocess m or e t h a n on e t ype of da t a (m et a bolom ic, lipidom ic or gen om ic)

e) Be a ble t o com bin e om ics da t a a n d t h en m a p t h eir pa t h wa ys N a viga t ion will be t est ed by lookin g a t t h e ou t pu t da t a (r esu lt s obt a in ed) t o t h e in goin g da t a (t h e begin n in g of wh er e da t a is in ser t ed). In ser t ion s of da t a a r e m a de in t h e r equ ir ed fields wh ile t r a cea bilit y or clicka ble t r a ckin g views a r e sou gh t wh en obt a in in g r esu lt s. An y visu a l pr esen t a t ion s on obt a in ed r esu lt s a r e a ccept ed bu t det a iled view of pa t h wa y com bin a t ion s a n d cor r ela t ion s a r e pr efer r ed. On ou t pu t da t a zoom fu n ct ion s a r e sou gh t t h a t is a sm a ll m a gn ifyin g gla ss wit h a plu s or m in u s sign in t h e P AT. To t est h ow m a n y t ype of da t a (m et a bolom ic, lipidom ic or gen om ic) t h e P AT ca n pr ocess, on e of ea ch da t a t ype will be select ed. Th r ee da t a t ypes t oget h er (m et a bolom ic, lipidom ic a n d gen om ic t oget h er ) a r e goin g t o be t est ed fir st , t wo da t a t ypes (m et a bolom ic wit h lipidom ic or gen om ic, lipidom ic wit h gen om ic or m et a bolom ic) a r e t est ed secon dly a n d la st ly on e by on e in pu t s of ea ch (m et a bolom ic, lipidom ic, gen om ic). If a P AT pa sses a ll a im s a ft er eva lu a t ion , a ll r esu lt s a n d t est m a t er ia l a r e in t en ded t o be t u r n ed over t o t h e en d u ser . F u r t h er su ppor t will be pr ovided in for m of a n swer in g qu est ion s on specific P AT. Test s on t h e P AT, U n ipr ot a n d F E vE R a r e goin g t o be don e if n o P AT will be fou n d t h a t fu lfill t h e r equ ir em en t s.

2.2.Alt e rn a t i v e re s e a rc h m e t h o d s

Th er e a r e a lt er n a t ive m et h ods t o con du ct t h is st u dy bu t it wou ld in volve wor k in g in a bioch em ist r y la bor a t or y t o obser ve, in t er view a n d obt a in r esu lt s fr om exper im en t s a n d a ft er wa r ds design in g wh ile a lso bu ildin g a com plet e P AT. An ot h er m et h od is t o m a ke a h om epa ge con n ect in g it t owa r ds a P AT t h a t is bein g u sed in t h e la bor a t or y. Met h od select ed in sect ion 3.1 a n d descr ibed m or e in sect ion 4 is bein g don e by r ea son s of get t in g good qu a lit y r esu lt s, t im e sa vin g a n d efficien cy.

3. Biom edica l ba ck gr ou n d

Th is sect ion con t a in s ba ckgr ou n d in for m a t ion n eeded in or der t o u n der st a n d t h e biom edica l pa r t . Ga t h er ed in for m a t ion is a bou t gen et ics, lipids a n d t h eir bioch em ist r y, m et a bolom ics, gen om ics a n d ca r d iova scu la r disea se.

3.1.Ge n e t i c s

Gen et ics is t h e st u dy of gen es wit h t h eir st r u ct u r es, sequ en ces a n d t h eir r ole in h er edit y. It is a wa y t o t r y a n d expla in h ow t h ey wor k, wh a t t h ey a r e a n d wh a t t h ey ca n do [32]. Gen et ics in volve scien t ific st u dies of gen es a n d t h eir effect s lea din g t o va r ia t ion in livin g or ga n ism s [32]. Mea n in g h ow cer t a in t r a it is or con dit ion s a r e bein g pa ssed down fr om on e gen er a t ion t o t h e n ext . Also h ow gen es a r e u n it is of h er edit y t h a t ca r r y in st r u ct ion s for m a kin g pr ot ein s

(14)

t h a t dir ect a ct ivit ies in cells a n d fu n ct ion s of ou r bodies. An exa m ple of fu n ct ion is in h er it ed disor der s lea din g t o disea ses [32]. Disor der s h a ve been det ect ed du e t o t h e la r ge a m ou n t of la bor a t or y exper im en t s a n d t ech n ology a dva n cem en t s, da t a st or in g pr ovide u se of P ATs, t h u s givin g fu n ct ion s t o sea r ch a n d m a t ch gen es wit h ea ch ot h er .

3.1.1. Gen e

Gen es a r e sm a ll m olecu la r u n it is t h a t ca r r y t h e h er edit y of livin g or ga n ism s. Th e gen e h olds t h e in for m a t ion t o bu ild a n d m a in t a in a n or ga n ism . E u ka r yot ic cells h a ve a n u cleu s, wh ich con t a in s t igh t ly pa cked DN A a n d a r e well pr ot ect ed [5]. Th e m a in bu ildin g blocks of a gen e con sist of cova len t ly lin ked n it r ogen ba ses A, T, C a n d G. Th e st r u ct u r es a r e t h en st r en gt h en ed by ca r bon a n d h ydr ogen bon ds. Th is m a kes a sequ en ce t h a t in t h e en d for m s a lon g dou ble h elix DN A ch a in . Th e DN A ch a in is t igh t ly pa cked t oget h er wit h h ist on es, wh ich a r e pr ot ein s, t o for m a n or ga n ized st r u ct u r e. Th e or ga n ized st r u ct u r e is ca lled ch r om osom es [11]. All t h e ch r om osom es a r e well pr ot ect ed wit h in t h e n u cleu s (Fig 2). Th e DN A ch a in in t u r n codes for m a n y fu n ct ion s of livin g or ga n ism s [5]. Gen et ic in for m a t ion a n d t r a it is a lso get s pa ssed on t o t h e offspr in g wh en m a t in g. In ou r gen om e t h er e a r e som e st r u ct u r a l gen es wh ich u pon r ea din g, t ell u s wh a t m a t er ia ls a r e n eeded in or der t o bu ild u p a cell or a n or ga n ism . Th is is ou r gen ot ype. Th e st r u ct u r a l gen es we a r e goin g t o u se a r e det er m in ed in com bin a t ion wit h t h e en vir on m en t a n d t h is is ca lled ou r ph en ot ype. Th e ph en ot ype is a lso a ffect ed by t h e en vir on m en t of ea r lier gen er a t ion s a n d t h is is ca lled epigen et ic [5]. Th ose ph en ot ypes a r e e.g. eye color a n d blood t ype. Th e gen ot ypes a r e iden t ica l in a ll h u m a n in dividu a ls u p t o a bou t 99 per cen t . Rem a in in g 1 per cen t va r y fr om per son t o per son cr ea t in g t h e fea t u r es t h a t m a kes u s a ll u n iqu e. Tin y differ en ces in t h e gen om e sequ en ces dist in gu ish a n in dividu a l fr om a n ot h er [5]. Th e t in y differ en ce on t h e ch a n ges of sin gle ba ses in volves r epr odu ct ion fr om t wo in dividu a ls cr ea t in g a n offspr in g a n d ch a n ges by Sin gle N u cleot ide P olym or ph ism (SN P ) a s m en t ion ed m or e in t ext below. Keepin g t r a ck of t in y differ en ces is h a r d a n d som e of t h ese t in y gen et ic va r ia t ion s a r e im por t a n t du e t o su scept ibilit y t o cer t a in disea ses (like a st h m a , dia bet es, scler osis a n d ca n cer ), u n less you h a ve a n a n a lysis t ool a t you r disposa l [5].

(15)

F i g u re 2. A sch em a t ic pr esen t a t ion of h u m a n DN A a ssem bled in t o a

ch r om osom e.

3.1.2. S N P

SN P is sh or t for Sin gle N u cleot ide P olym or ph ism a n d it is a sequ en ce va r ia t ion in DN A. Th is m ea n s t h a t a n it r ogen ba se is differ en t in a gen e sequ en ce for on e in dividu a l wh ile t h e r est of t h e gen e sequ en ce is st ill sim ila r t o a n ot h er in dividu a l [5]. F or a n exa m ple t h e gen e sequ en ce ATAGGC is a lm ost t h e sa m e a s t h e gen e sequ en ce ATCGGC, h owever , we h a ve a ch a n ge on t h e secon d A t o h a vin g a C in st ea d. Ch a n ges of on e n u cleot ide in t h e sequ en ce of ou r gen es a r e n a m ed Sin gle N u cleot ide P olym or ph ism (SN P ) a n d occu r t h r ou gh ou t t h e wh ole gen om e [3]. Sin gle N u cleot ide P olym or ph ism (SN P ) va r ia t ion s occu r in a ll species, lea din g t o gen et ic va r ia t ion s a n d m a y r esu lt in differ en t ph en ot ype of t h e or ga n ism . In [4] r esea r ch r esu lt s sh ow h ow differ en t ia t ion h a s occu r r ed. Th e gen et ic ch a n ges a r e ba sed on n a t u r a l select ion t o su it t h e m ost fa vor a ble a da pt ion of t h e gen es [3]. Som e of t h ese Sin gle N u cleot ide P olym or ph ism (SN P ) sequ en ces a r e even specific t o a n et h n ic gr ou p wh ile it m a y be m issin g in a n ot h er gr ou p. Accor din g t o [32] bot h t h e codin g a n d t h e n on codin g r egion s of t h e DN A ca n be a ffect ed. Sin gle N u cleot ide P olym or ph ism (SN P ) sequ en ces in volve su scept ibilit y t o disea ses a s m en t ion ed in t h e en d of sect ion 2.1.1. A scen a r io given will descr ibe wh y Sin gle N u cleot ide P olym or ph ism s (SN P : s) a r e im por t a n t [32]. Cou ples r egist er s for a h ea lt h ch eck a n d gives blood t o be a n a lyzed in or der t o det ect h ow h ea lt h y t h ey a r e. Th e blood goes t h r ou gh t r ea t m en t s so on ly sm a ll sequ en ces of n u cleot ides a r e left . Th e Sin gle N u cleot ide P olym or ph ism (SNP ) sequ en ce

of on e in dividu a l is t h e followin g:

(16)

ACGG”. Ot her person ha s t he following of

“GCCAGTATTGTCGATTTCACAAGTGCGTTTCTGTCGGGATGTCACACA

ACGG”. The sequences from bot h individual’s a re codes for a prot ein, coding t h e u pt a ke of fa t a n d su ga r in t h e h u m a n body. Th e sm a ll va r ia t ion s bet ween t h ese t wo in dividu a ls a r e m a r ked wit h a color . On e of t h em h a s h igh r isk of get t in g dia bet es. Wit h t h e h elp of t oda y’s t ech n ology, SNP a n a lyses a r e u sed t o det er m in a t e disea se su scept ibilit y [32]. An a lysis r evea ls t er r ible n ews for t h e cou ple, wer e t h e in dividu a l wit h t h e sin gle ba se ch a n ged t o G h a s t o st a r t u sin g in su lin wit h a syr in ge, u n less food h a bit ch a n ge wit h in a yea r or t wo. Th e scen a r io descr ibed a bove a r e ver y com m on in h ea lt h ca r e t oda y a n d a lso n ot t h e on ly wor k a r ea exploit in g gen et ic va r ia t ion s. In for en sic scien ce t h e gen et ic va r ia t ion s a r e exploit ed du r in g DN A fin ger pr in t in g [32].

3.2.B i o c h e m i s t ry o f Lip i d s

Bioch em ist r y is a lso ca lled biologica l ch em ist r y wh ich is t h e st u dy of ch em ica l pr ocesses in livin g or ga n ism s. Bioch em ist r y r egu la t es a n d gover n s over a ll livin g pr ocesses wit h in a ll livin g or ga n ism s [5]. Th is occu r s by bioch em ica l sign a lin g. Th e sign a lin g is sor t of a n in for m a t ion flow a s in sen din g a m essa ge fr om on e pla ce t o a n ot h er . Sign a ls flow t h r ou gh ever y pa r t in a n or ga n ism r egu la t in g t h e m et a bolism . Met a bolism st a n ds for t h e m ea n in g of livin g or ga n ism s t o su st a in life a n d r epr odu ce t h em s elf. On e im por t a n t pa r t in bioch em ist r y is t h e lipids. Lipids a r e im por t a n t com pon en t s in a cell a n d for m cell m em br a n e, vit a l t issu es a n d ser ve a s a n en er gy sou r ce for t h e or ga n ism [1]. Lipids a r e st or ed a s en er gy r eser ves wit h in t h e or ga n ism a n d u sed wh e n n eeded. Lipids h elp keepin g t h e elect r och em ica l ba la n ce of a cell, cell sign a lin g a n d t r a ffickin g r ega r din g wh a t is goin g in or ou t t o t h e cell [1 1]. Th e lipids u su a lly con sist of a pola r h ea d a n d a h ydr oph obic t a il. Th e lipids bin d t o ea ch ot h er du e t o t h e h ydr oph obic pa r t wa n t s t o st a y in con t a ct wit h ot h er h ydr oph obic m olecu les [3]. Th e dist r ibu t ion bet ween t h e h ydr oph obic a n d pola r pa r t s of t h e lipids dir ect s t h e 3 -dim en sion a l st r u ct u r e of t h e m olecu les [7] a n d wit h a r ela t ively la r ge pola r pa r t , t h e lipids for m m icelles wh ile m or e equ a l dist r ibu t ion , lea ds t o t h e for m a t ion of dou ble la yer s kn own a s m em br a n es (Fig 3).

(17)

F i g u re 3. P ict u r e of lipids wit h h ydr oph obic t a ils bou n d t oget h er a n d wit h

ot h er com pon en t s for m in g t h e m em br a n e. (Modified pict u r e t a ken fr om H u m a n Cell Biology r ef. [43]).

3.2.1. L ipid d efin ition

Ch em ist s, bioch em ist s a n d ot h er a n a lyst s t h a t wor k wit h lipids h a ve a gr ea t a n d fir m u n der st a n din g of t h e t er m ca lled lipid a ccor din g t o [19]. Bu t t h er e is n o widely a ccept ed defin it ion t oda y a n d t h ey a r e sa id t o be a gr ou p of n a t u r a lly occu r r in g com pou n ds. In a n or ga n ism , [44] a n d [53] st a t e t h a t t h ou sa n ds of va r iou s for m s of lipid m olecu les ca n be fou n d a n d lipids ca n be ca t egor ized in t o six m a in ca t egor ies (Fig 4). Th ey a ll h a ve a low solu bilit y in wa t er a n d h igh solu bilit y in or ga n ic solven t s.

3.2.2. Classes of L ipid s

Recen t ly a n ew n om en cla t u r e syst em wa s pr oposed by [26] du e t o t h e diver sit y of lipids in h u m a n pla sm a , sepa r a t in g lipids in t o eigh t cla sses or ca t egor y wh er e six of t h em a r e con sider e d m a in cla sses. E a ch cla ss ca n be fu r t h er divided in t o su b cla sses a n d in dividu a l m olecu la r species (Fig 3).

 Th e fir st ca t egor y is t h e fa t t y a cyls a n d is a lso ca lled fa t t y a cids. Th e fa t t y a cids ca n h a ve t h r ee for m s su ch a s fa t t y a cids, oct a deca n oids a n d eicosa n oids. Th ey a r e t h e m ost com m on bu ildin g block for m or e st r u ct u r a l com plex lipids a n d ca n be sa t u r a t ed or u n sa t u r a t ed. Cells u se t h ese lipids t o for m t h e va r iou s m em br a n es fou n d in a cell, t o st or e en er gy a n d t o a dju st t h e m em br a n e flu idit y in m a n y ce lls. [43, 53]

 Secon d ca t egor y is t h e Glycer olipids a n d h a s t h r ee for m s a s m on o-, di- a n d t r ia cylglycer olipids. Th eir fu n ct ion s a r e m a in ly a s en er gy st or a ge a n d a r e bu lked u p in t h e t issu e a s fa t in a n im a ls. [43, 53]

(18)

 Th ir d ca t egor y is ca lled Glycer oph ospolipids bu t t h ey a r e u su a lly ca lled ph osph olipids. Th e m a in for m s a r e P h osph a t idylch olin e (P C), P h osph a t idylch et h a n ola m in e (P E ) a n d P h osph a t idic a cid (P A). Th e glycer oph ospolipid cla sses a r e t h e on ly on es t h a t h a ve a ph osph or bin din g a n d t h ey a r e t h e key com p on en t in or der t o for m bila yer s. [43, 53]

 Th e fou r t h ca t egor y con sist s of Sph in golipids. Th e m a in for m s a r e Sph in gom yelin a n d Cer a m ides. Th e Sph in golipids h a ve a pola r h ea d a n d t wo n on pola r t a ils. Sph in gom yelin a ct a s a pr ot ect ion for m in g a m yelin sh ea t h t o pr ot ect n er ves. [43, 53]

 Th e fift h ca t egor y is t h e St er ol lipids a n d t h ey a r e of va r iou s a lcoh ol for m s. St er ol lipids a r e a n im por t a n t com pon en t for biologica l r oles. St er ols a ct a s r egu la t in g h or m on es a n d a s sign a lin g m olecu les. [43, 53]

 Th e la st ca t egor y is t h e P r en ols t h a t for m t er pen es a n d a ct a s a pr e -cu r sor m ole-cu les of vit a m in s a s vit a m in A, E a n d K. [43, 53]

3.2.3. E n zym es in volved in th e syn th esis of lipid s

A deeper in sigh t is pr esen t ed in t h is sect ion wit h focu s on lipids a n d it is syn t h esis, for a m or e u n der st a n din g on t h e a m ou n t of in for m a t ion a P AT m u st be a ble t o pr ocess. St a r t in g fr om t h e st a r t of da t a in pu t s (a lipid n a m e con n ect ed t o glycer olipids) t o r esu lt s obt a in ed.

Som e lipid ch a in s a r e ver y lon g or com plex wh ile ot h er s a r e sh or t . It wou ld t a ke a lon g t im e t o ch em ica lly syn t h esize t h e lipids, h owever , wit h t h e h elp of en zym es it is m u ch fa st er a s [37] pr esen t s. N u m er ou s for m s of lipids occu r a n d sever a l en zym es a r e n eeded. In [46] a syst em biology view pr esen t s n eeded en zym es by u se of a P AT. E .g. t h e syn t h esis of fa t t y a cids occu r s in t h e cyt opla sm a n d key en zym es in volved a r e t h e a cet yl -CoA ca r boxyla se (ACC) a n d m a lon yl-CoA ca r boxyla se (MCC) st a t ed in [51]. Wh ile a n ot h er gr ou p of coen zym e ca lled Acyl-CoA, ch or est er ol a cylt r a n sfer a se (ACAT) wor ks on ch olest er ol [51]. Th is is st r en gt h en ed in [45] sh owin g a clea r view by pict u r es. Th e fa t t y a cids a r e so m a n y a n d ca n be sa t u r a t ed or u n sa t u r a t ed a n d for t h is pu r pose design a t ed sym bols a r e given [31] in or der t o keep t r a ck of t h e ca r bon a t om s a n d t h eir bin din gs. Th e sym bols con sist of t wo n u m ber s bet ween a colon (:) [31]. Th e fir st n u m ber t ells u s t h e ca r bon len gt h of t h e fa t t y a cid a n d t h e secon d n u m ber t h e st a t e of sa t u r a t ion . A fa t t y a cid wit h sever a l u n sa t u r a t ed bou n ds sh ows a h igh er n u m ber a t it is secon d va lu e (S ee T able 3). Syn t h esis of fa t t y a cids beyon d 16 ca r bon s len gt h goes t h r ou gh a t wo-ca r bon elon ga t ion pr ocess, a ccor din g t o [31] by en zym es in t h e en dopla sm ic r et icu lu m (E R). N ot on ly elon ga t ion occu r s bu t a lso desa t u r a t ion by en zym es in t h e en dopla sm ic r et icu lu m (E R) u sin g fou r en zym es n a m ed desa t u r a se delt a fou r , delt a five, delt a six a n d delt a n in e. Th e design a t ed delt a n a m es wit h a n u m ber a r e given a ccor din g t o wh ich posit ion in t h e fa t t y a cid ca r bon ch a in t h e desa t u r a t ion occu r s [31]. Th e m a in den a t u r a se is delt a n in e a n d is ca lled St ea r oyl-CoA desa t u r a se-1. Th e desa t u r a t ion r equ ir es oxygen (O2), a coen zym e ca lled N icot in a m ide a den in e din u cleot ide h ydr ogen (N ADH ) a n d

(19)

a n elect r on t r a n spor t in g h em opr ot ein ca lled Cyt och r om e b5 [47]. In fa t t y a cid desa t u r a t ion t wo h ydr ogen a t om s a r e r em oved fr om t h e fa t t y a cid m a kin g a n oxida t ion on bot h t h e fa t t y a cid a n d N ADH . Th is cr ea t es a dou ble bon d bet ween ca r bon s in t h e fa t t y a cid ch a in .

Ta b le 3. Th e m a in fa t t y a cids in or ga n ism s (Modified t a ble t a ken fr om Cyber lipid

cen t er r ef. [31] a n d Vir gin ia web edu ca t ion r ef. [10])

Main fatty acids

Number of

carbons

Name Systematic name Symbol Structure

Saturated fatty acids

12 Lauric acid Dodecanoid acid 12:0 CH3( CH2)10COOH

14 Myristic acid Tetradecanoic acid 14:0 CH3( CH2)12COOH

16 Palmitic acid Hexadecanoic acid 16:0 CH3( CH2)14COOH

18 Stearic acid Octadecanoic acid 18:0 CH3( CH2)16COOH

20 Archidic acid Eicosanoic acid 20:0 CH3( CH2)18COOH

22 Behenic acid Docosanoic acid 22:0 CH3( CH2)20COOH

24 Lignoceric acid Tetracosanoic acid 24:0 CH3( CH2)22COOH

Unsaturated fatty acids

16 Palmitoleic acid 9-Hexadecanoic acid 16:1 CH3( CH2)5CH=CH(CH2)7COOH

18 Oleic acid 9-Octadecanoic acid 18:1 CH3( CH2)7CH=CH(CH2)7COOH

18 Linoleic acid 9,12-Octadecanoic acid

18:2 CH3(CH2)4(CH=CHCH2)2(CH2)6COOH

18 a-Linolenic acid 9,12,15-Octadecanoic acid

18:3 CH3CH2(CH=CHCH2)3(CH2)6COOH

18 g-Linolenic acid 6,9,12-Octadecanoic acid

18:3 CH3(CH2)4(CH=CHCH2)3(CH2)3COOH

20 Arachidonic acid

5,8,11,14-Eicosatetraenoic acid

20:4 CH3(CH2)4(CH=CHCH2)4(CH2)2COOH

24 Nervonic acid 15-Tetracosanoic acid 24:1 CH3(CH2)7CH=CH(CH2)13COOH

Com plex lipids h a ve a lon ger biosyn t h et ic pa t h wa y a n d t wo m a in pa t h wa ys a r e kn own a ccor din g t o [47], t h e sn -glycer ol-3-ph osph a t e pa t h wa y (a lso kn own a s t h e Ken n edy pa t h wa y) a n d t h e m on oa cylglycer ol pa t h wa y (Fig. 5

an d 6). Syn t h esis by t h e Ken n edy pa t h wa y occu r s in t h e liver a n d a dipose

t issu es wh ile t h e m on oa cylglycer ol pa t h wa y t a kes pla ce in in t est in e con fir m ed in [42]. Bot h st a r t s by ca t a bolism of glu cose (glycolysis) r esu lt in g in t h e bio-syn t h esis of glycer ol, h owever , n ew eviden ce in [47] in dica t es som e glycer ol is syn t h esized a n ew (de n ovo) fr om sin gle m olecu les by a pr ocess ca lled glycer on eogen esis. Th e followin g r ea ct ion s occu r in t h e

(20)

en dopla sm ic r et icu lu m (E R) of m a m m a lia n or ga n ism s [47]; sn -glycer ol-3-ph osol-3-ph a t eis est er ified by a fa t t y a cid coen zym e in a ca t a lyt ic r ea ct ion by t h e en zym e glycer ol-3-ph osph a t e a cylt r a n sfer a se (GP AT) a t t h e sn - posit ion in or der t o for m lysoph os ph a t idic a cid. Lysoph osph a t idic a cid t h en becom es a cyla t ed for m in g ph osph a t idic a cid, a n in t er m edia t e pr odu ct in t h e syn t h esis of a ll glycer olipids [47]. Du r in g syn t h esis of t r ia cyl-sn -glycer ol t h e ph osph a t e gr ou p is r em oved by a fa m ily of en zym es ca lled l ipid ph osph a t e ph osph a t a se (P AP ), st a t ed by [47], for m in g 1,2-dia cyl-sn -glycer ols a n d fu r t h er a cyla t ed by dia cylglycer ol a cylt r a n sfer a se (DGAT) in t o t r ia cyl sn -glycer ol (Fig 5). Du r in g syn t h esis of glycer oph ospolipids, ph osph a t idylch olin e (P C), ph osph a t idylch et h a n ola m in e (P E ) a n d ph osph a t idylser in , t h e ph osph a t e gr ou p is n ot r em oved st a t ed by [6] fr om ph osph a t ic a cid. In st ea d ph osph a t ic a cid a r e u sed a s pr e -cu r sor m olecu les in t h e syn t h esis of glycer olipids (Fig 6). Th e syn t h esis by t h e m on oa cylglycer ol pa t h wa y is less com plex a n d in volves on ly a few en zym es belon gin g t o a n a cylglycer ol a cylt r a n sfer a se fa m ily t o for m t h e t r ia cylglycer ols in t h e in t est in e [47].

F i g u re 4. Seven lipid cla sses a n d h ow t h ey in t er a ct bio-syn t h et ica lly (Modified

pict u r e t a ken fr om Molecu la r bioch em ist r y r ef. [45]).

F i g u re 5. Th e Ken n edy pa t h wa y syn t h esis in m a m m a ls (Modified pict u r e t a ken

(21)

F i g u re 6. Syn t h esis of a cylglyer ides a n d glycer oph osph olipids sh owin g a lin k

bet ween t h e t wo pa t h wa ys (Modified pict u r e t a ken fr om Lipid Libr a r y r ef. [35]).

3.2.4. L ipoprotein s

Lipids a r e a lm ost in solu ble, h owever , t h er e a r e wa ys for t h em t o be t r a n spor t ed or pa ss t h r ou gh t h e blood cir cu la t ion [5]. Lipopr ot ein s a llow t h e lipids t o be t r a n spor t ed t h r ou gh t h e blood cir cu la t ion in or der t o r ea ch differ en t t issu es [5]. Th e lipopr ot ein s a r e a ssem bled in a wa y t h a t it con t a in s bot h pr ot ein s a n d lipids. Th e pr ot ein pa r t ser ves a s a n em u lsifica t ion for t h e lipids [11] a n d t h er e a r e five m a jor cla sses, t wo bein g ver y im por t a n t cla sses of t h e lipopr ot ein s [1], h igh den sit y lipopr ot ein s (H DL) a n d low den sit y lipopr ot ein s (LDL). Rem a in in g lipopr ot ein s a r e In t er m edia t e den sit y lipopr ot ein (IDL), ver y low den sit y lipopr ot ein s (VLDL) a n d ch ylom icr on s [11]. Bot h H DL a n d LDL ca r r y lipids a s ch olest er ol a n d LDL is som et im es r efer r ed t o a s t h e ba d ch olest er ol wh ile H DL is t h e good ch olest er ol. P r oblem s ca n occu r du r in g t h e oxida t ion of t h e LDL a ccor din g t o [11], lea din g t o a lm ost u n st oppa ble ch a in r ea ct ion s. Ch a in r ea ct ion effect r esu lt s in a t h er oscler osis m a n y yea r s la t er [1 1].

3.3.Ge n o m i c s

Gen om ics is a disciplin e wit h in gen et ics t h a t focu s on t h e st u dy of t h e gen om e of a ll or ga n ism s [32]. Wit h in t h is field of r esea r ch t h e pu r pose is t o det er m in e t h e en t ir e DN A sequ en ce of a ll or ga n ism s a n d m a kin g a sca led m a ppin g of a ll t h e gen es [32]. Th is in clu des a lso m a ppin g of wh a t a gen e does a n d t h e a ssocia t ion it h a s t o pr ocesses wit h in a n or ga n ism e.g. m et a bolom ics or lipidom ics [7]. Du r in g t h e pr ocess of m a ppin g a n d a ssocia t ion ea ch gen e get s a design a t ed n a m e a n d n u m ber (a s a n ID t a g) wit h a ll t h e n ecessa r y in for m a t ion pr ovided a bou t t h a t specific gen e in a da t a ba se [32]. In for m a t ion ca n be r et r ieved wit h a P AT fr om t h ese da t a ba ses wh en n eeded.

(22)

3.4.Me t a b o lo m i c s

Met a bolom ics is t h e st u dy of ch em ica l pr ocesses in volvin g m et a bolit es a n d soph ist ica t ed a n a lyt ica l t ech n ologies a r e u sed t o m a ke syst em a t ic st u dies [8, 18]. A syst em a t ic st u dy con sist s of t a r get a n a lysis, m et a bolit e pr ofilin g a n d m et a bolic fin ger pr in t in g. Th e m et a bolit es a r e fou n d in a ll biologica l cells a n d a ll h a ve u n iqu e ch em ica l sign a t u r es, like a fin ger pr in t [46]. Th e u n iqu e fin ger pr in t s a r e a n en d pr odu ct a ft er a cellu la r pr ocess a n d ca n be u sed t o see h ow specific ch em ica l pr ocesses h a ve occu r r ed [8, 46]. Th e ch em ica l pr ocesses t h a t a r e exa m in ed ca n be fr om a livin g or ga n ism , cells, t issu es a n d even fr om a n or ga n . Resea r ch field of m et a bolom ics con sist s of m a n y su b pa r t s a n d t h e pa r t s we a r e focu sin g on a r e t h e st u dy of lipidom ics (lipids/fa t t y a cids) [40].

3.5.Li p i d o m i c s

Lipidom ics is u sed for descr ibin g t h e com plet e pr ofile of lipids in cells, t issu es or or ga n ism s [47]. Lipidom ics a r e on e su bpa r t of m et a bolom ics a n d a n ewly em er ged r esea r ch field t h a t h a s been dr iven fa st for wa r d by r a pid a dva n ces in t ech n ology [53]. Su ch t ech n ologies a r e e.g. m a ss spect r om et r y (MS), flu or escen ce spect r oscopy (F S) [24], a n d Nu clea r Ma gn et ic Reson a n ce (N MR) [39]. Th ese t ech n ologies sa ve la r ge a m ou n t s of da t a in da t a ba ses, givin g P AT's possibilit y of a n ew m et h od for da t a a n a lysis [18].

3.6.Ca rd i o v a s c u la r d i s e a s e s

Th er e a r e m a n y disea ses a r ou n d t h e wor ld. On e of t h em is a cla ss of disea ses t h a t in volve h ea r t or vessels t h a t t r a n spor t s blood (a r t er ies a n d vein s) a n d a r e ca lled ca r diova scu la r disea ses [46]. Ca r diova scu la r disea ses in clu de t h e followin g: An eu r ysm (Abn or m a l bu lge in a n a r t er y), An gin a (Ch est pa in du e t o la ck of blood t o t h e h ea r t m u scle), At h er oscler osis (pla qu e bu ilds u p in side t h e a r t er ies), Cer ebr ova scu la r Acciden t (St r oke), Con gest ive H ea r t F a ilu r e, Cor on a r y Ar t er y Disea se a n d Myoca r dia l In fa r ct ion (H ea r t At t a ck). Sever a l r esea r ch er s in [12] cla im som e kn own fa ct or s a s lipid or fa t con t en t ca n a ffect t h e ca r diova scu la r syst em , poin t in g t h a t h igh den sit y lipopr ot ein s (H DL) a n d low den sit y lipopr ot ein s (LDL) a r e r ega r ded t o be a fa ct or beh in d t h e ca r diova scu la r disea ses. On ce t h e disea se is det ect ed it h a s u su a lly pr ogr essed for yea r s a n d lea ds t o t h e n ecessit y of oper a t ion or even dea t h . An ot h er gr ou p of r esea r ch er s in [29] h a ve sh own t h a t t h er e a r e sever a l SNP :s wh ich is a ssocia t ed wit h pla sm a level of h igh den sit y lipopr ot ein s (H DL) a n d low den sit y lipopr ot ein s (LDL) wh ich a r e a ssocia t ed wit h m yoca r dia l in fa r ct ion . Aft er t est in g in dividu a ls in [2], fr om five differ en t et h n ic gr ou ps wit h r espect of eigh t SN P fr om t h r ee gen es a ssocia t ed wit h ch olest er ol a n d lipopr ot ein syn t h esis, a clea r cor r ela t ion wit h m yoca r dia l in fa r ct ion s wa s obt a in ed. A st u dy on r isks for m yoca r dia l in fa r ct ion in [48] st r en gt h en s t h is t h eor y.

(23)

4. Com pu t er Scien ce ba ck gr ou n d

Th is sect ion con t a in s ba ckgr ou n d in for m a t ion n eeded in or der t o u n der st a n d t h e IT pa r t a n d h ow a P AT wor ks.

4.1.D a t a b a s e s , D a t a m i n i n g a n d Kn o w le d g e d i s c o v e ry

A da t a ba se con sist s of t a bles wit h m a n y colu m n s a n d r ows wit h a collect ion of va st a m ou n t of da t a in for m of in for m a t ion . Th e in for m a t ion is st or ed a t a specific pla ce a n d is oft en bein g well or ga n ized. Wit h in t h e da t a ba se depen din g on wh a t is in ser t ed in t o it , t h e in for m a t ion ca n be of gen et ics, lipidom ics or a bou t som et h in g else en t ir ely. Da t a ba ses oft en r equ ir e som e for m of t ool in or der t o r ea d a n d r et r ieve specific in for m a t ion fa st a m on g t h e va st a m ou n t of da t a . Th is is wh er e da t a m in in g com es in , wh en specific in for m a t ion is sou gh t a n d ga t h er ed by a t ool, t h en pr esen t ed a s r esu lt s. F r om t h e pr esen t ed r esu lt s, kn owledge ca n be ga in ed . Th e ga in ed kn owledge h a ve m a n y for m s bu t a few of t h ese ca n per h a ps be t o m a ke im pr ovem en t in exper im en t s, con fir m in g exper im en t r esu lt s or per h a ps ch a n ge a ppr oa ch m et h ods t o solve a scien t ific pr oblem [13].

4.2.P AT

As m en t ion ed ea r lier a ll kin ds of IT-ba sed t ools h a ve been developed in va r iou s biom edica l fields [21]. Th is h a s been don e in or der t o keep t r a ck of a ll t h e n ecessa r y scien t ific in for m a t ion obt a in ed du r in g t h e pa st yea r s [20]. P AT t h a t we a r e wor kin g wit h pr ocess in for m a t ion a bou t gen es, SNP :s a n d lipid m et a bolism . Th er efor e t h e P AT pla y a n im por t a n t r ole in lipidom ics a n d gen om ics r esea r ch . Th e t ools ca n be eit h er web ba sed soft wa r e or down loa da ble pr ogr a m s t h a t t a ke da t a in for m s of a gen e n a m e, SNP a ccession ID (r s n u m ber ) or a cla ss n a m e of t h e lipid. Th e P AT t h en pr ocess t h e da t a given , m a kin g sea r ch es in loca l or r em ot e da t a ba ses. Th e da t a ba ses con sist s of m a n y t a bles wit h differ en t in for m a t ion r ela t ed t o gen es a n d lipids. Wh a t t h e P AT do is m a kin g m a n y join fu n ct ion s bet ween t h e t a bles a n d pu t t h ese in for m a t ion t oget h er . Wh en a sea r ch occu r s by da t a m in in g, t h e in for m a t ion fr om t h ese t a bles a r e ga t h er ed a n d pr esen t ed a s r esu lt s ba sed on t h e in pu t pa r a m et er s [9]. Th e in pu t pa r a m et er s a r e t h e in ser t ed t ext s in t h e sea r ch field. In som e ca ses t h e t ools fa il t o pr ovide r esu lt s on cer t a in in pu t . F in a lly t h e a n a lysis t ool sh ows t h e r esu lt s t ellin g if a n y r esu lt s wer e fou n d, wh er e t h e lipids or gen es a r e u sed in t h e m et a bolism a n d h ow t h ey a r e con n ect ed t o ot h er lipids or gen es in t h e m et a bolism . Th e r esea r ch er s wor kin g wit h t h e P AT ca n fu r t h er a n a lyze t h e r esu lt s in or der t o ga in n ew kn owledge. Wit h n ew kn owledge, n ew discover ies ca n be m a de in e.g. lipidom ics or gen om ics a n d t h is lea ds t o t h e dem a n d for u pda t es t o t h e da t a ba ses a n d t h e P ATs (Fig 8). Wit h t h e h elp of P AT, n ew d iscover ies of disea ses like a t h er oscler osis ca n be m a de [24] or n ew pa t h wa y lin k s in t h e m et a bolic syst em r espon ses ca n be fou n d [52].

(24)

F i g u re 8. A view on h ow ever yt h in g is con n ect ed (bot h vir t u a lly a n d

ph ysica lly) t o t h e P AT.

5. Requ ir em en t s a n d Test elicit a t ion

Th e followin g sect ion con t a in s in for m a t ion a bou t r equ ir em en t specifica t ion , t est pla n , t est ca se, a n d soft wa r e t est in g.

5.1.R e q u i re m e n t s

Ma n y soft wa r e pr ogr a m s or t ools r equ ir e m on t h s of t est in g t o see if t h ey fu n ct ion cor r ect ly a n d n eed ver y t h or ou gh set s of specific r equ ir em en t s in or der t o be con sider ed a s good fu n ct ion a l t ools, a s st a t ed in [39]. Specific set s of r equ ir em en t s ca n be obt a in ed by in t er views wit h t h e people wh o a r e of im por t a n ce su ch a s st a keh older s or in dividu a ls wit h a key r ole. We u se a n em pir ica l m et h od on soft wa r e t est in g by ga t h er in g r equ ir em en t s pa r t ia lly ba sed on a r equ ir em en t s ph a se fr om on e of t h e wa t er fa ll m odels u sed by Ia n Som m er ville [50]. In t h e r equ ir em en t s ph a se, som et im es a lso ca lled r equ ir em en t s en gin eer in g ph a se, br a in st or m in g, r esea r ch a n d a n a lysis is bein g con du ct ed on t h e soft wa r e t h a t will eit h er be developed or t est ed. We in t en d t o do t h e t est in g pa r t wit h ou t developin g a n y n ew P AT a n d t h er efor e on ly t h e r equ ir em en t s m et h odology is a da pt ed a n d a pplied in ou r st u dy. Th e br a in st or m in g, r esea r ch a n d a n a lysis is t h e m ost im por t a n t pa r t of r equ ir em en t s ga t h er in g. Ba sic r equ ir em en t s a r e defin ed a n d set for u ses t h a t t h e soft wa r e m u st su ppor t [49]. Du r in g t h is ph a se, in -dept h st u dies of cu r r en t wor kin g pr ocesses a r e don e a n d h ow t h e pr oblem s ca n be a ddr essed or solved. Th u s, wit h ou t u n der st a n din g t h e r equ ir em en t s given , h opes of deliver in g a

(25)

su ccessfu l syst em or soft wa r e is u n likely. Requ ir em en t s elicit a t ion s a r e don e in volvin g m a n y in t er views wit h key in dividu a ls. Requ ir em en t s a r e a n a lyzed a n d eva lu a t ed a n d ca n be of t h r ee kin ds. Th ose wit h h igh est pr ior it y a r e t h e

m u st h ave, secon d pr ior it y a r e sh ou ld h ave a n d t h e lowest pr ior it y a r e t h e n ice to h ave. Specifica t ion s a r e t h en m a de t o t h e r equ ir em en t s a n d va lida t ion s a r e

don e wh er e t h e in dividu a ls wit h a key r ole or st a keh older s a ccept t h e r equ ir em en t s. Requ ir em en t s a r e t h er efor e n eeded in or der t o defin e wh a t a soft wa r e pr ogr a m m u st or sh ou ld do, fea t u r es it h a s a n d a cer t a in qu a lit y t h a t it m u st fu lfill [39]. Requ ir em en t s a r e divided in t o fu n ct ion a l a n d n on fu n ct ion a l r equ ir em en t s. F u n ct ion a l r equ ir em en t s a r e a lwa ys defin ed wit h sh a ll or m u st . N on fu n ct ion a l r equ ir em en t s a r e pr oper t ies or cer t a in qu a lit ies a pr odu ct m u st h a ve a n d a r e u sed t o descr ibe soft wa r e’s u sa bilit y, r elia bilit y a n d per for m a n ce [17]. Qu a lit y is n ot som et h in g t h a t ca n be m ea su r ed ea sily a s m en t ion ed in sect ion 1.3, bu t on e wa y t o m ea su r e qu a lit y a ccor din g t o [23] is by u sin g m et r ics or st a t ist ics on pr oper t ies t h a t ca n be m ea su r ed a n d t h a t a r e a ssocia t ed wit h qu a lit y.

5.2.Te s t i n g

In or der t o do good t est in g, t est pla n s a r e developed in t h e for m of a docu m en t so t h a t syst em a t ic a ppr oa ch es ca n be u sed on soft wa r e t est in g. Th e syst em a t ic a ppr oa ch es a r e fa st a n d t im e sa vin g [30]. A t est pla n descr ibes t h e t est in g ph a ses (e.g. u n it t est in g, in t egr a t ion t est in g, a ccept a n ce t est in g), t h e n eeded r equ ir em en t s, a ll a ct ivit ies, t h e n eeded r esou r ces a n d t h e docu m en t a t ion n ot es. In t h is wor k, we u se a ccept a n ce t est in g t o det er m in e if a set of r equ ir em en t s will be m et . Th e a ccept a n ce t est in g is r u n n in g wit h specific da t a . On ce t h e r esu lt s a r e obt a in ed, t h ey a r e com pa r ed wit h kn own expect ed r esu lt s. U pon cor r ect m a t ch of t h e r esu lt , a pa ss or n o pa ss is given . Th er e a r e m a n y wa ys t o m a ke t est s a n d m ost of t h e t im e t h ey differ bet ween com pa n ies [30]. All t est pla n s a r e cr ea t ed pr ior t o a n y t est in g. Th e con t en t of a t est pla n u su a lly con sist s of a su it a ble n a m e r ela t ed t o t h e t est s per for m ed. On ce a t est pla n is select ed, it is per for m ed a ccor din g t o a t em pla t e [30] wh er e t h e t est pla n docu m en t is divided in t o t wo pa r t s. Th e fir st pa r t con sist s of gen er a l in for m a t ion a bou t t h e t est s t h a t will be per for m ed . Th e gen er a l in for m a t ion con t a in s pa r t icipa n t s, t est st r a t egies, specifica t ion s a n d fu n ct ion s or fea t u r es t h a t will be u sed. Th e secon d pa r t con t a in s t h e pr ocedu r e on scen a r ios a lso ca lled ca ses on h ow t est s will be don e. Th e cr ea t ion of t h e t est pla n , r equ ir em en t s a n d t est ca ses will lea d t o t h e a ct u a l soft wa r e t est in g. Th e pu r pose of t h e soft wa r e t est in g is t o in vest iga t e a n d ga in in for m a t ion on h ow t h e select ed soft wa r e pr ogr a m wor k s [39]. Soft wa r e pr oper t ies a r e eva lu a t ed du r in g a select ed soft wa r e t est in g. Du r in g t h e soft wa r e t est in g, b u gs a n d er r or s t h a t m igh t occu r a r e a lso elim in a t ed [23]. On ce t h e soft wa r e pr ogr a m or a pplica t ion m eet s t h e r equ ir em en t s, it is con sider ed t o be wor k in g well a n d a lso sa t isfies t h e n eeds of t h ose wh o r equ est ed it .

(26)

5.3.Te s t c a s e s

A set of t est con dit ion s h a ve t o be wr it t en fr om fu n ct ion a l r equ ir em en t s [17] if soft wa r e is goin g t o be t est ed. Th e t est con dit ion s a r e r efer r ed t o a s t est ca ses a n d a r e per for m ed in cer t a in sequ en ces wh ich it m u st follow du r in g t est in g. F ir st t h e goa l of t h e t est is descr ibed wit h t h e even t or execu t ion st eps, followed by t h e expect ed r es pon se a n d t h e a ct u a l r espon se fr om t h e t est . Ma n y t im es t est ca ses a r e cr ea t ed, wit h a set of con dit ion s fr om given r equ ir em en t s a s m en t ion ed a bove, in or der t o elim in a t e t h e a m bigu it y t o t h e m in im u m in soft wa r e [21]. Th e a m ou n t of t est ca ses depen ds on t h e a m ou n t of r equ ir em en t s given . To per for m t h e t est in g ph a se, t est er s a r e select ed t o exa m in e, discover a n d det er m in e if t h e soft wa r e is wor kin g cor r ect ly or n ot du r in g t est in g. To keep t r a ck of a t est er 's wor k t r a cea bilit y m a t r ices a r e u sed, lin kin g r equ ir em en t s t o specific t est ca ses (Fig 8). A t est ca se con sist s of m a n y st eps st a r t in g wit h a n in pu t ba sed on a r equ ir em en t t h a t will be t est ed a n d en din g on ce a ll st eps h a ve been com plet ed. Th e a ct ion or execu t ion even t s a r e t h en goin g t o be m a de on h ow t o do t h e t est wit h descr ipt ion s on expect ed r espon se or ou t com e. Th e a ct u a l r esu lt s obt a in ed will be wr it t en down on ce t h e t est ca se is com plet e.

F i g u re 1. A t est ca se t em pla t e u sed in t h is st u dy.

6. Resu lt

6.1.F i n d i n g t h e P ATs

Wit h u se of In t er n et , sea r ch in g for P ATs, m a n y wer e fou n d a n d m ost of t h em h a d t h eir own h om epa ges. Sea r ch es wit h t h e Google sea r ch en gin e wer e m a de a s “P AT”. 46 hom epages were found, descr ibing a nalysis t ools tha t ha d t o be eva lu a t ed. Th e h om epa ges st a t ed wh a t t ype of da t a t h e P ATs cou ld pr ocess, wh ich wa s con fir m ed by down loa din g a n d t est in g t h e P ATs. Th e down loa ded P ATs wer e eva lu a t ed wit h t est ca ses 1 t o 3 t h a t r equ ir e t h e t ool t o be a ble t o

References

Related documents

experience of health care; having knowledge about health (e.g. health professional training or self-acquired expertise); having knowledge of the health care system in the immediate

It is shown that this method can be used to integrate data in functional genomics experiments by separating the systematic variation that is common to all data sets considered

67 Paper B Towards big data analysis of deviation and error reports in product development projects Paper C Modeling industrial engineering change processes using the

Noticing the the large sample size requirement and stringent significance threshold of GWAS, the objective of this study is to develop more powerful meth- ods based on

The result relies on an explicit decomposition of the reflection distance function into irreducible G(r, 1, n)-characters and on the eigenvalues of certain adjacency matrices.. In

3.5.4 MCFS on Gene Expression On the 2,002 genes that were selected from the most important methylation sites and SNPs, another MCFS-based feature selection was performed.. This

Om man ser generellt för alla Facebookinlägg så finns en uppkommen möjlighet för Ekeroth att ta till orda genom att tala till folket för att upprätthålla hans persona och

4) olika former av kroppsligt lärande. Pedagogernas personliga syn på utomhuspedagogik innebar alltså att den gav en mångfald av lärandearenor. De menar att utomhusmiljön i sig