• No results found

Towards new computational tools for predicting toxicity

N/A
N/A
Protected

Academic year: 2022

Share "Towards new computational tools for predicting toxicity"

Copied!
68
0
0

Loading.... (view fulltext now)

Full text

(1)Towards new computational tools for predicting toxicity.

(2)

(3) Linnaeus University Dissertations No 243/2016. T OWARDS NEW COMPUTATIONAL TOOLS FOR PREDICTING TOXICITY. S WAPNIL C HAVAN. LINNAEUS UNIVERSITY PRESS.

(4) Towards new computational tools for predicting toxicity Doctoral dissertation, Department of Chemistry and Biomedical Sciences, Linnaeus University, Kalmar, Sweden, 2016 ISBN: 978-91-88357-04-5 Published by: Linnaeus University Press, 351 95 Växjö, Sweden Printed by: Elanders Sverige AB, 2016.

(5) Abstract Chavan, Swapnil (2016). Towards new computational tools for predicting toxicity, Linnaeus University Dissertation No 243/2016, ISBN: 978-91-88357-04-5. Written in English. The toxicological screening of the numerous chemicals that we are exposed to requires significant cost and the use of animals. Accordingly, more efficient methods for the evaluation of toxicity are required to reduce cost and the number of animals used. Computational strategies have the potential to reduce both the cost and the use of animal testing in toxicity screening. The ultimate goal of this thesis is to develop computational models for the prediction of toxicological endpoints that can serve as an alternative to animal testing. In Paper I, an attempt was made to construct a global quantitative structure-activity relationship (QSAR) model for the acute toxicity endpoint (LD50 values) using the Munro database that represents a broad chemical landscape. Such a model could be used for acute toxicity screening of chemicals of diverse structures. Paper II focuses on the use of acute toxicity data to support the prediction of chronic toxicity. The results of this study suggest that for related chemicals having acute toxicities within a similar range, their lowest observed effect levels (LOELs) can be used in read-across strategies to fill gaps in chronic toxicity data. In Paper III a k-nearest neighbor (kNN) classification model was developed to predict human ether-a-go-go related gene (hERG)-derived toxicity. The results suggest that the model has potential for use in identifying compounds with hERG-liabilities, e.g. in drug development. Keywords Acute toxicity, chronic toxicity, hERG, k-NN, LD50, LOEL, Munro database, QSAR, read-across, toxicity.

(6)

(7) /9:D=G>GFL=FLK 'AKLG>HM:DA;9LAGFK.  <<ALAGF9DHM:DAK@=<OGJCGMLKA<=L@=K;GH=G>L@AKL@=KAK.  +GHMD9JK;A=F;=KMEE9JQ.  ::J=NA9LAGFK.   @9HL=J $FLJG<M;LAGF.  . AGDG?A;9D9;LANALQL=KLAF?.  . +@9JE9;=MLA;9D<JM?L=KLAF?.  .  FNAJGFE=FL9DLGPA;9FLL=KLAF?.  . !GG<K9>=LQL=KLAF?.  . GK= J=KHGFK=J=D9LAGFK@AHK.  .  #GO<JM?K9;L.  .  JM?L9J?=LK.  .  JM? J=;=HLGJAFL=J9;LAGF.  .  GK= J=KHGFK=;MJN=.  . .LJM;LMJ= 9;LANALQJ=D9LAGFK@AHK.  .  $FLJG<M;LAGFG>:AG AKGKL=J=K.  .  (G<A>A;9LAGFG>KAR=9F<K@9H=.  .  $FLJG<M;LAGFG>F=OKM:KLALM=FLK.  . ,M9FLAL9LAN=KLJM;LMJ= 9;LANALQJ=D9LAGFK@AHK.  .  JA=>@AKLGJQ.  . MJJ=FLKL9L=G>,.-9F<J=D9L=<L=;@FAIM=K. 

(8) .   NGDMLAGFG>EGD=;MD9J<=K;JAHLGJK. 

(9) .  /QH=KG>,.-K.  .  '9L=KLL=;@FAIM=K>GJ<=N=DGHAF?,.-.  .  '9L=KL9HHDA;9LAGFKG>,.-K.  .  @=EA;9D;9L=?GJQ9HHJG9;@.  .  9L9?9H>ADDAF?9F<-=9< 9;JGKK. . .L9LAKLA;9DE=L@G<K9F<LGGDK. .  (MDLAHD=DAF=9JJ=?J=KKAGF('-. .  +JAF;AH9D;GEHGF=FL9F9DQKAK+. .  C F=9J=KLF=A?@:GJC )). . *:B=;LAN=K.  @9HL=J ;ML=LGPA;ALQHJ=<A;LAGFMKAF?,.-L=;@FAIM=.   -=K=9J;@IM=KLAGFK9F<G:B=;LAN=K.   .MEE9JQG>OGJC.    (MFJG<9L9:9K=.    (=L@G<GDG?Q. 

(10)    -=KMDLK9F<<AK;MKKAGF.   GF;DMKAGF.  @9HL=J ;ML=LGPA;ALQ KMHHGJL=<;@JGFA;LGPA;ALQHJ=<A;LAGF.   -=K=9J;@IM=KLAGFK9F<G:B=;LAN=K. 

(11) .

(12)    .MEE9JQG>OGJC.     -/) *<9L9:9K=.     (=L@G<GDG?Q.     -=KMDLK9F<<AK;MKKAGF.   GF;DMKAGF.  @9HL=J @ -"LGPA;ALQHJ=<A;LAGFMKAF?C ));D9KKA>A;9LAGFE=L@G<.   -=K=9J;@IM=KLAGFK9F<G:B=;LAN=K.   .MEE9JQG>OGJC.    .=LLAF?MHL@=<9L9:9K=.    (=L@G<GDG?Q.    -=KMDLK9F<<AK;MKKAGF.   GF;DMKAGF.  @9HL=J GF;DM<AF?J=E9JCK9F<>MLMJ=GMLDGGC.  ;CFGOD=<?=E=FLK.  -=>=J=F;=K.  .  .

(13)  . %&!"'&! % &12;<1.;2;2;+*;.-87<1./8558?2709*9.:;?12,1*:.:./.::.-<827<1.<.@< +A<1.2:$86*77=6.:*5; 55 <1. 9=+52;1.- 9*9.:; *:. :.9:8-=,.- ?2<1 <1. 9.:62;;287 /:86 <1. :.;9.,<2>.9=+52;1.:;   %?*97251*>*7*7 2,1855;Björn C. G. Karlsson , Annika $8;.70:.7*>2-.*55*+28(2>2*7*87;8772*7-$8+.:<8 &8-.;,1272&8?*:-;058+*5#%$68-.5+=25-270/8:*,=<. <8@2,2<A=7:8-*<*+*;.,*;.;<=-A7<.:7*<287*58=:7*58/ 85.,=5*:%,2.7,.;         !. 

(14)

(15)  236; .   .  . . %?*9725 1*>*7 $*7 :2.-6*7 *7- *7  2,1855; ,=<. <8@2,2<A;=998:<.- ,1:872, <8@2,2<A 9:.-2,<287  47.*:.;< 7.201+8: ,8=95.- :.*-*,:8;; ;<:*<.0A 7<.:7*<287*5 8=:7*5 8/ 85.,=5*: %,2.7,.; . . .  ! 

(16)

(17)  236;  . . . %?*97251*>*716.-+-.5*B2B.;9.:)245*7-.:*7-*7  2,1855;  47.*:.;< 7.201+8: ,5*;;2/2,*<287 8/ 1$  ,1*77.5 +58,4.:; 8=:7*5 8/ 869=<.:2-.- 85.,=5*: .;207 .9=+ .+:=*:A !  ;   B  .  .

(18) <<ALAGF9DHM:DAK@=<OGJCGMLKA<=L@=K;GH=G>L@AK L@=KAK  $9F )A;@GDDK, Swapnil Chavan, Kerstin Golker, Björn C. G. &9JDKKGF"MKL9> *DKKGFFFAC9( -GK=F?J=F.M:J9E9FA9F .MJAQ9F9J9Q9F9F%=KH=J" 2ACD9F<=J /@=GJ=LA;9D9F< ;GEHML9LAGF9DKLJ9L=?A=K>GJL@=KLM<QG>L@=EGD=;MD9JAEHJAFLAF? HJG;=KK9F<HGDQE=JH=J>GJE9F;= <N9F;=KAFAG;@=EA;9D F?AF==JAF?AGL=;@FGDG?Q

(19)  

(20)  

(21) *$

(22)

(23)

(24) 

(25) 8

(26) 8   .O9HFAD@9N9F.@AJAK@+9O9J-9B=K@.AF?@9F<(  DAR9:=L@ .G:@A9 AF<AF?KAL=;@9J9;L=JAR9LAGFG>"HJGL=AF ;GMHD=< J=;=HLGJ:Q9D9FAF= K;9FFAF?EML9?=F=KAKMKAF?EGD=;MD9J <QF9EA;K9F<:AF<AF?>J===F=J?Q9HHJG9;@9HHDA;9LAGFLG  ;@=EGCAF=J=;=HLGJ - (GD=;MD9JAN=JKALQ

(27)  

(28)  *$

(29)

(30)

(31)  K

(32) 

(33)

(34)   R  FMH.@9@-9FBAL.@AF<=+9N9F&9J=1#QE9N9L@A.O9HFAD @9N9F(  DAR9:=L@.G:@A9 $F<M;=<>AL:AF<AF?G>9D<GK= J=<M;L9K=AF@A:ALGJKLG&- 

(35) (=<A;AF9D@=EAKLJQ-=K=9J;@ 

(36)    *$

(37)

(38)

(39)  K

(40)

(41)

(42) 

(43)  R  .O9HFAD@9N9F.9?9J@9QQ=(  DAR9:=L@.G:@A9 (GD=;MD9J <QF9EA;K<AJ=;L=<G(!KLM<A=KGF;9J:G;QDA;)=MJ9EAFA<9K= AF@A:ALGJK (GD=;MD9JAN=JKALQ

(44)   *$

(45)

(46)

(47)  K

(48) 

(49)

(50)    .O9HFAD@9N9F+9N9F&9J=FMH.@9@1#QE9N9L@A-9B=K@ .AF?@(  DAR9:=L@.G:@A9 (KLM<A=KGFF=MJ9EAFA<9K=>GJ HJG:AF?:AF<AF?HGK=G>ALKAF@A:ALGJK (=<A;AF9D@=EAKLJQ -=K=9J;@

(51)  

(52) 

(53) *$

(54)

(55)

(56)  K

(57)

(58)

(59) 

(60)

(61)  P  (  .G:@A9-9B=K@.AF?@+9N9F&9J=.O9HFAD@9N9F  -9LAGF9D<=KA?FG>-9FL9?GFAKLK9KMJN=QG>;GEHML9LAGF9D KLM<A=K  PH=JL*HAFAGFGFJM?AK;GN=JQ

(62)

(63)   *$

(64)   

(65)  

(66)

(67)   . . .

(68)  .               /GEQ>9EADQ . .  .

(69) . .

(70)  . +*+0'-.$ ) .0((-4 2= 9J= =PHGK=< LG E9FQ ;@=EA;9D KM:KL9F;=K AF GMJ =N=JQ<9Q DA>=  /@AK =PHGKMJ=KL=EK>JGE9N9JA=LQG>KGMJ;=KKM;@9KGMJ@GE=KOGJCHD9;=K>GG< O=;GFKME=<JM?L@=J9HA=KO=E9Q:=L9CAF?9F<L@JGM?@AF<MKLJA9DO9KL=K  /@=F==<LG9KKMJ=GMJK9>=LQ>JGE=PHGKMJ=LG;@=EA;9DKOAL@9<N=JK==>>=;LK J=IMAJ=K AF>GJE9LAGF 9:GML L@= LGPA;ALQ G> L@= FME=JGMK KM:KL9F;=K O= 9J= =PHGK=<LG@GO=N=JKM;@=N9DM9LAGFK9J=;MJJ=FLDQH=J>GJE=<L@JGM?@9FAE9D L=KLAF?9F<9J=N=JQ=PH=FKAN=  $F 9<<ALAGF LG L@AK EADDAGFK G> F=O <JM?K 9J= :=AF? KQFL@=KAR=< 9FFM9DDQ :Q H@9JE9;=MLA;9DAF<MKLJA=KOGJD<OA<=O@A;@<=E9F<KL@=MK=G>L@GMK9F<KG> 9FAE9DK >GJ L@= HMJHGK= G> K9>=LQ 9F< =>>A;9;Q =N9DM9LAGFK <MJAF? HJ=;DAFA;9D L=KLAF? !MJL@=JEGJ=L@=?DG:9DHJG<M;LAGFG>;@=EA;9DK:Q9?JA;MDLMJ9D:AG L=;@FGDG?A;9D ;@=EA;9D ;=E=FL >=JLADAR=J E=L9D HGDQE=JK L=PLAD= =L;  AF<MKLJA=K9EGMFLKMHLG:ADDAGFKG>LGFK 9K=<GF9N9AD9:D=AF>GJE9LAGFL@= LGPA;GDG?A;9D =N9DM9LAGFK G> L@=K= ;@=EA;9DK OGMD< F==< 9:GML  EADDAGF 9FAE9DK AF MJGH=9F 0FAGF 9F< 9:GML  EADDAGF 9FAE9DK AF L@= 0. <MJAF?L@AK<=;9<=>GJL=KLAF?HMJHGK=K /@MKD9J?=9FAE9DJ=KGMJ;=KLAE= =>>GJLK9F<;GKLKOADD:=J=IMAJ=<LGE==LL@=F==<G>LGPA;GDG?A;9D=N9DM9LAGFK G>9DDL@=K=;@=EA;9DK  /G K9N= 9FAE9DK 9F< KH==< MH L@= LGPA;GDG?A;9D =N9DM9LAGFK <=N=DGHE=FL G> F=OFGF 9FAE9D L=KLAF?E=L@G<K9F<EG<=DKAKJ=IMAJ=< /G<9L=E9FQFGF 9FAE9D L=KLAF? E=L@G<K 9J= 9N9AD9:D= 9EGF? O@A;@ ;GEHML9LAGF9D E=L@G<K L@9L ;9F J=D9L= KLJM;LMJ9D >=9LMJ=K G> ;@=EA;9D KM:KL9F;=K OAL@ 9 H9JLA;MD9J >MF;LAGFGJ:AGDG?A;9D9;LANALQ@9N=:==FK@GOFLG:=KH=;A9DDQHJGEAKAF?  "=F=J9DDQ 9 KLJM;LMJ= 9;LANALQ J=D9LAGFK@AH .- EG<=D ;9F := <=N=DGH=< MKAF?L@=>GDDGOAF?KL=HK  GDD=;LAGFG>:AGDG?A;9D LGPA;GDG?A;9D<9L9>GJ9 ?JGMH G> ;@=EA;9DK   =K;JAHLAGF G> ;@=EA;9DK AF L@= >GJE G> FME=JA;9D J=HJ=K=FL9LAGFK A = AFL@=>GJEG><=K;JAHLGJK =JAN9LAGFG>9J=D9LAGFK@AH :=LO==FL@=<=K;JAHLGJK9F<:AGDG?A;9D LGPA;GDG?A;9D=F<HGAFLKL@JGM?@.  .

(71) 9HHDA;9LAGFG>E9L@=E9LA;9D KL9LAKLA;9DL=;@FAIM=K  $F L@AK L@=KAK J=D9LAGFK@AHK :=LO==F ;@=EA;9D KLJM;LMJ= 9F< :AGDG?A;9D 9;LANALQ @9N=:==F=PHDGJ=<MKAF?;GEHML9LAGF9DLGGDKLG<=N=DGHE9L@=E9LA;9DEG<=DK ;GJJ=D9LAF?KLJM;LMJ=9F<9;LANALQOAL@L@=?G9DG>HJG<M;AF??=F=J9DEG<=DK>GJ HJ=<A;LAF?9;ML=LGPA;ALQ9F<;@JGFA;LGPA;ALQ9KO=DD9K9EG<=D>GJA<=FLA>QAF? LGPA;ALQ <=JAN=< >JGE @ME9F L@=J 9 ?G ?G -=D9L=< "=F= @ -" <=JAN=< ;9J<AGLGPA;ALQ  .

(72) .

(73)  . - 1$/$*). (  . & 1 

(74)  # - ! " "#. @ -" # .. $0+ C )) '

(75)  '* ' (. (* ) * ) - )* ' *  + ,.- -/ - # -/ . .- .($' . /<+. :KGJHLAGFAKLJA:MLAGF(=L9:GDAKE9F< P;J=LAGF @=EA;9D:KLJ9;LK.=JNA;= @=EAKLJQ=N=DGHE=FL&AL JGKK19DA<9LAGF >>=;LAN=GF;=FLJ9LAGF

(76)  MJGH=9F@=EA;9D?=F;Q JJGJ-9L= !GG<9F<JM?<EAFAKLJ9LAGF "=F=LA;D?GJAL@E "DG:9DDQ#9JEGFAR=<.QKL=E #ME9F L@=J à ?G ?G -=D9L=<"=F= #9R9J< N9DM9LAGF.MHHGJL.QKL=E $FL=JF9LAGF9D0FAGFG>+MJ=9F<HHDA=<@=EAKLJQ C )=9J=KL)=A?@:GJ '=L@9DGK=

(77)  'GO=KL*:K=JN=< >>=;L'=N=D (GD=;MD9J=KK.QKL=E (G<=*>;LAGF )=O F=J?Q9F<$F<MKLJA9D/=;@FGDG?Q=N=DGHE=FL*J?9FAR9LAGF )GF JJGJ-9L= )G*:K=JN=< >>=;L'=N=D *J?9FAK9LAGF>GJ ;GFGEA;G GH=J9LAGF9F<=N=DGHE=FL +JAF;AH9DGEHGF=FLF9DQKAK ,M9FLAL9LAN=.LJM;LMJ= ;LANALQ-=D9LAGFK@AH -=H=9L=<GK=/GPA;ALQ -=?AKLJ9LAGF N9DM9LAGFML@GJAR9LAGF9F<-=KLJA;LAGFG>@=EA;9DK -=?AKLJQG>/GPA; >>=;LKG>@=EA;9D.M:KL9F;=K .LJM;LMJ= ;LANALQ-=D9LAGFK@AH .AEHDA>A=<(GD=;MD9J $FHML'AF= FLJQ.QKL=E /GJK9<=K=+GAFL=K.  .

(78) #+/ - $)/-*0/$*) /@=:AGDG?A;9D9;LANALA=KG>D9J?=FME:=JKG>;@=EA;9DKKM;@9KH@9JE9;=MLA;9D <JM?K>GG<KM:KL9F;=K=FNAJGFE=FL9D9F<9?JA;MDLMJ9DHJG<M;LK9F<AF<MKLJA9D ;@=EA;9DK F==<LG:= <=L=JEAF=<LG=FKMJ= K9>=LQ /@AKAK9LAE=;GFKMEAF? 9F< =PH=FKAN= 9;LANALQ L@9L @9K L@= HGL=FLA9D LG := AEHJGN=< :Q 9<GHLAF? ;GEHML9LAGF9DL=;@FAIM=K .=N=J9D AFKADA;GLGPA;GDG?Q9F<H@9JE9;GDG?QLGGDK 9J=:=AF?<=N=DGH=<>GJ9KKAKLAF?AFL@=HJG>ADAF?G>LGPA;KM:KL9F;=KAFGJ<=JLG AFN=KLA?9L= L@=AJ EG<= G> 9;LAGF (* JAKC 9KK=KKE=FL 9F< K9>=LQ KLM<A=K  /@= OGJC HJ=K=FL=< @=J= >G;MK=K GF <A>>=J=FL ;GEHML9LAGF9D 9HHJG9;@=K >GJ 9KK=KKAF? :AGDG?A;9D LGPA;GDG?A;9D 9;LANALQ  /@AK L@=KAK J=N=9DK IM9FLAL9LAN= KLJM;LMJ= 9;LANALQ J=D9LAGFK@AH ,.- KLM<A=K >GJ L@= HJ=<A;LAGF G> 9;ML= LGPA;ALQ ;@JGFA; LGPA;ALQ 9F< @ME9F L@=J 9 ?G ?G -=D9L=< "=F= @ -" <=JAN=<LGPA;ALQ . AGDG?A;9D9;LANALQL=KLAF? #ME9FK 9J= =PHGK=< LG E9FQ ;@=EA;9DK 9F< KM:KL9F;=K L@JGM?@GML L@=AJ DA>=KH9F  /@=K= =PHGKMJ=K G;;MJ >JGE <A>>=J=FL KGMJ;=K DAC= L@= >GG< KMHHDQ <JM? L@=J9HQ DANAF? =FNAJGFE=FL OGJCHD9;=K 9F< AF<MKLJA9D O9KL=K  /@MK L@=J=AK9F==<LG=FKMJ=@ME9FK9>=LQ>JGE9DDL@=;@=EA;9DK O@A;@@ME9FK 9J= =PHGK=< LG /GE==L L@AK <=E9F< AL AK F=;=KK9JQ LG H=J>GJE 9 :AGDG?A;9D =N9DM9LAGF G> 9DD L@=K= ;@=EA;9DK 9F< KM:KL9F;=K  /@= E9BGJ 9AE G> L@= :AGDG?A;9D =N9DM9LAGFG> ;@=EA;9DK AK LG A<=FLA>Q L@=AJ (* HGL=F;Q 9F< LG EGFALGJ@ME9F9F<=FNAJGFE=FL9DK9>=LQ  AG9KK9QAK9E=L@G<L@9L<=L=JEAF=KL@=:AGDG?A;9D9;LANALQG>9KM:KL9F;=:Q E=9KMJAF?ALK=>>=;LGF9FGJ?9FAKE9F<;GEH9JAF? ALOAL@L@=9;LANALQG>9F 9?J==<KL9F<9J<  $FGL@=JOGJ<K :AG9KK9QAK<=>AF=<9K L@==KLAE9LAGFG>L@= ;GF;=FLJ9LAGF GJ HGL=F;Q G> 9 KM:KL9F;= :Q E=9KMJ=E=FL G> L@= :AGDG?A;9D J=KHGFK=L@9LALHJG<M;=K /@=E9AFMK=KG>:AG9KK9Q9J= ¾ LG=KLAE9L=L@=H@9JE9;GDG?A;9D9;LANALQG>9KM:KL9F;= ¾ LG=PHDGJ=L@=>MF;LAGFKG>=F<G?=FGMKE=<A9LGJK9F<. .

(79)   ¾ LG <=L=JEAF= L@= LGPA;GDG?A;9D 9F< MFO9FL=< =>>=;LK G> F=O GJ MF<=>AF=< KM:KL9F;=K >GJ EGFALGJAF? @ME9F GJ =FNAJGFE=FL9D K9>=LQ. +@9JE9;=MLA;9D<JM?L=KLAF? /@GMK9F<K G> F=O <JM?K 9J= KQFL@=KAR=< Q=9JDQ :Q H@9JE9;=MLA;9D AF<MKLJA=K OGJD<OA<=AFGJ<=JLG>AF<:J=9CL@JGM?@K>GJE9FQ<A>>=J=FL @ME9F<AK=9K=K 9F<<AKGJ<=JK DDL@=K=<JM?KF==<LGMF<=J?GJA?GJGMK9FAE9DL=KLAF?:=>GJ= :=AF? 9HHJGN=< >GJ @ME9F LJA9DK  FAE9D L=KLAF? AFNGDN=K H@9JE9;GCAF=LA; 9F< H@9JE9;G<QF9EA; HJG>ADAF? G> <JM?K  +@9JE9;GCAF=LA;K AFN=KLA?9L= F=O <JM?K’ 9:KGJHLAGF <AKLJA:MLAGF E=L9:GDAKE 9F< =P;J=LAGF (  O@AD= H@9JE9;G<QF9EA;KAFN=KLA?9L=L@=AJ:AGDG?A;9D9;LANALQ=>>A;9;Q9F<LGPA;ALQ  /@= =>>A;9;Q L=KLAF? G> 9 F=O <JM? AK AFN=KLA?9L=< :Q L=KLAF? ALK KLJ=F?L@ AF ;MJAF?L@=AF<M;=<ADDF=KKG>AFL=J=KLAFL@=L=KL9FAE9D /@AKHJ=;DAFA;9DKLM<Q AFNGDN=K L@= 9;ML= KM: 9;ML= 9F< ;@JGFA; LGPA;ALQ L=KLAF?  /@= 9;ML= LGPA;ALQ L=KLAF? AK H=J>GJE=< LG KLM<Q J9HA< HGAKGFAF? O@=J=9K KM: 9;ML= LGPA;ALQ L=KLAF?AFN=KLA?9L=KO@=L@=J9FQLGPA;E=L9:GDAL=G>L@=<JM?@9K:==F>GJE=< GN=J LAE=  /@= ;@JGFA; LGPA;ALQ AFN=KLA?9L=K L@= LGPA; =>>=;L G> 9 <JM? GN=J HJGDGF?=< MH LG L@= LGL9D DA>=KH9F G> L@= L=KL 9FAE9D 9F< J=H=9L=< =PHGKMJ=K  >L=J KM;;=KK>MDDQ H9KKAF? HJ=;DAFA;9D L=KLK L@= <JM? MF<=J?G=K @ME9F LJA9DK  $F L@AK KL=H L@= <JM? AK 9<EAFAKL=J=< AF 9 <GM:D= :DAF< ;GFLJGDD=<LJA9DL@9L=F9:D=K;DAFA;9DHJ9;LALAGF=JKLG<=L=JEAF=L@==>>=;LG>L@= <JM?9F<ALK<GK= J=KHGFK=J=D9LAGFK@AH

(80) .  FNAJGFE=FL9DLGPA;9FLL=KLAF? FNAJGFE=FL9D LGPA;9FLK 9J= <=>AF=< 9K ;@=EA;9D GJ H@QKA;9D 9?=FLK J=D=9K=< AFLG L@= ?=F=J9D =FNAJGFE=FL L@9L ;9F HJG<M;= 9<N=JK= @=9DL@ =>>=;LK 9EGF? D9J?=FME:=JKG>H=GHD=  FNAJGFE=FL9DLGPA;9FLK9>>=;LL@=J=KHAJ9LGJQLJ9;L L@= ?9KLJGAFL=KLAF9D LJ9;L <=JE9D LAKKM=K 9F<GL@=J GJ?9FK 9F< L@=J=:Q ;9MK= LAKKM= <9E9?= 9F< GJ DGKK G> KLJM;LMJ= 9F< >MF;LAGF G> NAL9D GJ?9FK  /@= AFL=FKALQG>KM;@LGPA;=>>=;LKAK:9K=<MHGFL@=<GK=KG>LGPA;9FLK "=F=J9DDQ L@=K= LGPA;9FLK ;9F ;9MK= 9;ML= LGPA;ALQ >GJ K@GJL L=JE =PHGKMJ= :ML EGKL G>L=FL@=Q;9MK=;@JGFA;=>>=;LK<M=LGDGF? L=JE=PHGKMJ=K  PHGKMJ= J=KHGFK=J=D9LAGFK@AHK;9F:==KL9:DAK@=<>GJ@ME9FK@GO=N=JL@=J= 9DJ=9<Q=PAKLE9FQLGPA;ALQKLM<A=KMKAF?9FAE9DK /@=J=>GJ=KGE=J=K=9J;@=JK @9N=J=;GEE=F<=<=PLJ9HGD9LAF?@ME9FLGPA;ALA=KG>=FNAJGFE=FL9DLGPA;9FLK >JGE 9FAE9D :AG9KK9Q <9L9  /GPA;ALQ L=KLAF? AF 9FAE9DK AK <=H=F<=FL MHGF <AJ=;L LGPA;ALQ 9KK=KKE=FL AF O@GD= GJ?9FAKEK O@=J= L@= GJ?9FAKEK 9J= =PHGK=< LG L@= LGPA;9FLK G> AFL=J=KL 9F< G:K=JN=< >GJ 9FQ KA?F G> 9<N=JK= @=9DL@=>>=;LK /@=<MJ9LAGFG>L@==PHGKMJ=<=H=F<KMHGFL@=LQH=G>LGPA;ALQ :=AF? =P9EAF=<  /@= KLM<Q <MJ9LAGF >GJ K@GJL L=JE 9;ML= =>>=;LK ;9F :=  @GMJK LG  <9QK KM: ;@JGFA; =>>=;LK >JGE >=O O==CK LG >=O EGFL@K 9F<.  .

(81) ;@JGFA; =>>=;LK 9J= MKM9DDQ KLM<A=< >GJ 9 KA?FA>A;9FL HGJLAGF G> the organism’s O@GD=DA>=KH9F . !GG<K9>=LQL=KLAF? /@=>GG<L@9L;GFKLALML=KGMJ<9ADQ<A=L;GFKAKLKG>E9FQ<AJ=;LDQGJAF<AJ=;LDQ 9<<=< KM:KL9F;=K O@A;@ E9Q GJ E9Q FGL := LGPA; >GJ @ME9F @=9DL@  EGF? L@= <AJ=;LDQ 9<<=< ;GFKLALM=FLK 9J= ;@=EA;9DK HMJHGK=>MDDQ MK=< AF >GG< HJG<M;LAGFHJG;=KKAF?9F<KLGJ9?=L@9L>AF9DDQ=F<MHAFL@=<A=LO@AD=GL@=J ;@=EA;9DK9J=MK=<>GJ=F@9F;AF?L@=L9KL=;GDGMJ9F<>D9NGMJG>>GG< EGF? L@= AF<AJ=;LDQ 9<<=< KM:KL9F;=K 9J= L@= J=KA<M=K G> <JM?K @GJEGF=K 9F< GJ >==< 9<<ALAN=K MK=< AF 9FAE9D 9F< EADC HJG<M;LAGF H=KLA;A<=K MK=< >GJ N=?=L9:D= HJG<M;LAGF F9LMJ9D LGPAFK 9F< ;@=EG HJ=N=FLAN=K AF HD9FLK EA;JG:A9D LGPAFK :9;L=JA9D >MF?9D AF >GG< 9F< E9L=JA9DK MK=< AF >GG< HJG;=KKAF? 9F< H9;C9?AF?  /@= ;GFK=IM=F;=K G> L@=K= KM:KL9F;=K 9J= L@= @=9DL@ HJG:D=EK L@9L ;9F 9JAK= KM;@ 9K 9DD=J?A=K ;9F;=J F=MJGLGPA;ALQ @=H9LGLGPA;ALQAF>=JLADALQ9F<9;ML=9F<;@JGFA;HGAKGFAF?  /G =KL9:DAK@ >GG< K9>=LQ D9J?= =>>GJLK @9N= :==F E9<= :Q LGPA;GDG?AKLK 9F< =HA<=EAGDG?AKLK  /@= =KLAE9LAGF G> L@= ;GF;=FLJ9LAGF G> >GG< 9<<ALAN=K 9F< 9<MDL=J9FLKAK;GEEGFDQMK=<LGA<=FLA>QL@=AJD=N=DG><9ADQAFL9C=O@AD=L@= A<=FLA>A;9LAGF G> LGPAFK 9F< ;9J;AFG?=FA; ;GFKLALM=FLK AF >GG< AK MKM9DDQ 9;;GEHDAK@=< :Q L@= :AG9KK9Q E=L@G< 9F< 9HHDA;9LAGF G> L@= <GK= J=KHGFK= ;MJN= . GK= J=KHGFK=J=D9LAGFK@AHK /@= :AGDG?A;9D 9;LANALQ ;9F := <=K;JA:=< ?J9H@A;9DDQ OAL@ J=KH=;L LG L@= ;GF;=FLJ9LAGF G> L@= <JM? GJ LGPA;9FL  /@= E9?FALM<= G> KM;@ 9 :AGDG?A;9D =>>=;L;9F:=<AKHD9Q=<9K9>MF;LAGFG>L@=<JM?;GF;=FLJ9LAGFAFL@=>GJEG>9 <GK= J=KHGFK= ;MJN=   /@MK L@= <GK= J=KHGFK= ;MJN= AK 9F AEHGJL9FL <=K;JAHLGJAFMF<=JKL9F<AF?<JM?9;LANALQ .  #GO<JM?K9;L ;;GJ<AF? LG :9KA; HJAF;AHD=K G> H@9JE9;GDG?Q 9 <JM? EMKL =P=JL KGE= ;@=EA;9D=>>=;LGF L@=;=DD;GFKLALM=FLKG> 9:G<QLAKKM=AFGJ<=JLG=P@A:AL 9 H@9JE9;GDG?A;9D LGPA;GDG?A;9D9;LAGF .M;@9;@=EA;9D=>>=;L;9FGFDQG;;MJA> 9 <JM? :AF<K LG KH=;A>A; ;=DD ;GFKLALM=FLK  /@= :AF<AF? KAL=K G> <JM?K 9J= CFGOF9KL9J?=LK /@=H@9JE9;GDG?A;9DJ=K=9J;@AKE9AFDQ<JAN=F:QL@=9;LG> A<=FLA>QAF? L@= E=;@9FAKEK :Q O@A;@ 9 <JM? 9KKG;A9L=K OAL@ ALK L9J?=L LG =P@A:AL 9 H@QKAGDG?A;9D J=KHGFK=  /@= :9KA; ;GF;=HL G> L@= <JM? J=;=HLGJ AFL=J9;LAGFAK<=K;JA:=<:QL@=“lock and key” mechanismAFO@A;@9J=;=HLGJ @9K9;GEHD=E=FL9JQ:AF<AF?KAL=>GJ9<JM?!A?MJ=   $>9<JM?@9KL@= ;GJJ=;LK@9H=LG>ALAFL@=J=;=HLGJ:AF<AF?KAL=L@=F9C=Q<JM?;9FGH=F9 DG;CJ=;=HLGJ . .

(82)  . . !A?MJ=   'G;C9F<C=Q E=;@9FAKEG><JM?9;LAGF 9<JM?AFL=J9;LKOAL@ 9 J=;=HLGJ LG. >GJE9<JM? J=;=HLGJ;GEHD=P .  JM?L9J?=LK /@=J= 9J= K=N=J9D LQH=K G> :G<Q ;GEHGF=FLK L@9L K=JN= 9K <JM? L9J?=LK  /@= term ‘receptor’ is often used to refer to any target molecule with which a drug ;9FAFL=J9;LLG=DA;ALH@9JE9;GDG?A;9D9;LAGF  . . !A?MJ=   -=;=HLGJK  F?AGL=FKAF ;GFN=JLAF? =FRQE=  +@G H@GKH@9L=. LJ9FKHGJL=J  @ -" & ;@9FF=D  / JA;@ ) <=;9E=J AF .  9MJ=MK OAL@ ))S :AK6   L=LJ9@Q<JGHQJAEA<AF  QDH@=FQD7:AH@=FQD S <A;9J:GP9EA<= 9F<  )=MJ9EAFA<9K=AF;JAEKGFJ=<GFL@=KMJ>9;=G>L@=AF>DM=FR9NAJMK $E9?=K9F< 9J=HJG<M;=<>JGE+>AD=K % '9F<0MKAF? <M+QEGD 

(83) $E9?=J=HJ=K=FLK AF @GMK=@GEGDG?QEG<=DG>+@GL@=AE9?=@9K:==FHJG<M;=<MKAF?1(  $E9?=  AKG:L9AF=<>JGE1AJGDG?Q:DG?OAL@L@=CAF<H=JEAKKAGFG>+JG> 1AF;=FL-9;9FA=DDG .  .

(84) /@=<GH9EAF=J=;=HLGJKAFL@=:J9AF9J=J=;=HLGJK>GJ9FLAHKQ;@GLA;<JM?KL@= & ;@9FF=DK 9J= J=;=HLGJK >GJ 9FLA9JJ@QL@EA; <JM?K = ?  <G>=LADA<= L@= 9F?AGL=FKAF ;GFN=JLAF? =FRQE= AK L@= J=;=HLGJ >GJ N9KG<AD9LGJK = ?  ;9HLGHJAD L@= HJGLGF ;GMHD=< H@GKH@9L= LJ9FKHGJL=J E9C=K E9BGJ ;GFLJA:MLAGFLGH@GKH@9L=LJ9FKHGJLAF. ;=J=NAKA9=)K9J=J=;=HLGJK>GJ <A>>=J=FL;GEHGMF<K= ? 9F/ JA;@)<=;9E=JAF. 9MJ=MKAKAF@A:AL=< :Q:AK9EA<AF=;GEHGMF<K9F<KMJ>9;=?DQ;GHJGL=AFDAC=F=MJ9EAFA<9K=AK9 J=;=HLGJ>GJ9FLANAJ9D<JM?K= ? R9F9EANAJ!A?MJ=   .  JM? J=;=HLGJAFL=J9;LAGF (GKL <JM?K @9N= 9 @A?@ KH=;A>A;ALQ LGO9J<K :AF<AF? LG 9 H9JLA;MD9J J=;=HLGJ  GFN=JK=DQ 9 J=;=HLGJ L@9L 9;LK 9K 9 <JM? L9J?=L G>L=F @9K 9 @A?@ <=?J== G> K=D=;LANALQ >GJ 9 H9JLA;MD9J<JM? /@AK ;GEHD=E=FL9JQ KH=;A>A;ALQ G> 9 J=;=HLGJ :AF<AF?KAL=LGO9J<K9DA?9F<<=>AF=KL@=EGD=;MD9JJ=;G?FALAGFHJGH=JLQG>9 J=;=HLGJ ML9FQ;@9F?=AF9J=;=HLGJKM;@9KJ=EGN9DG>GF=GJEGJ=9EAFG 9;A<K ;9F 9DL=J L@= K@9H= G> L@= ;GEHD=E=FL9JQ :AF<AF? KAL= 9F< E9C= L@= J=;=HLGJAF9;LAN=>GJ9H9JLA;MD9JDA?9F< . . !A?MJ=  AKLAF;LAGF:=LO==F9>>AFALQ9F<=>>A;9;Q & 9F<& 9J=J9L=;GFKL9FLK>GJL@= :AF<AF?J=9;LAGFO@AD=Į9F<ȕ9J=J9L=;GFKL9FL>GJL@=J=;=HLGJ9;LAN9LAGFJ=9;LAGF . "=F=J9DDQ L@= :AF<AF? G> 9 DA?9F< LG 9 J=;=HLGJ E9Q D=9< LG 9 H@QKAGDG?A;9D J=KHGFK=L@JGM?@L@=9;LAN9LAGFGJAF9;LAN9LAGFG>L@9LJ=;=HLGJ /@=:AF<AF? of a drug to a receptor is known as ‘<JM?affinity’;O@AD=A>9J=;=HLGJ=DA;ALK9 LAKKM=J=KHGFK=MHGF<JM?:AF<AF?ALis known as ‘<JM?efficacy’!A?MJ=   . .

(85)   F 9?GFAKL AK 9 <JM? L@9L @9K @A?@ 9>>AFALQ 9K O=DD 9K @A?@ =>>A;9;Q 9F< 9F 9FL9?GFAKL@9K@A?@9>>AFALQ:MLR=JG=>>A;9;Q .  GK= J=KHGFK=;MJN= /@= :AF<AF? ;9H9;ALQ G> 9<JM? ;9F := E=9KMJ=<<AJ=;LDQ :ML LG =KLAE9L= ALK :AGDG?A;9DJ=KHGFK=KM;@9K;GFLJ9;LAGFGJJ=D9P9LAGFG>EMK;D=K9;LAN9LAGFGJ AF@A:ALAGF G> =FRQE=K GJ ;@9F?=K AF E=E:J9F= HGL=FLA9D GJ @=9JL J9L= L@= H@9JE9;GDG?AKLKG>L=FF==<LGKLM<QL@=<JM?OAL@L@=@=DHG>9<GK= J=KHGFK= ;MJN=   <GK= J=KHGFK= ;MJN= AK 9 KAEHD= HDGL L@9L J=D9L=K L@= 9EGMFL G> L@= <JM? GJ LGPA;9FL HGDDML9FL KLJ=KKGJ J9<A9LAGF OAL@ L@= :AGDG?A;9D J=KHGFK=  /@=<GK=G>9<JM?AKG>L=FHDGLL=<GFL@=3 9PAKO@AD=ALKJ=KHGFK=AKHDGLL=< GFL@=4 9PAK  .  !A?MJ=  GK= J=KHGFK=;MJN=<=K;JA:AF?L@=L@=J9H=MLA;=>>=;LG>9<JM? .  !A?MJ=  K@GOK9LQHA;9D<GK= J=KHGFK=;MJN=O@A;@AKHDGLL=<GF9K=EA DG?9JAL@EA; K;9D= 9F< AK ;@9J9;L=JAKLA;9DDQ KA?EGA<9D AF K@9H=   ;MJN= AK ;@9J9;L=JAR=<:Q9L@J=K@GD<E9PAE9D=>>=;L9F< KM: E9PAE9D=>>=;L 

(86)   /@=>AJKLHGAFLL@9LAF<A;9L=KL@9L9J=KHGFK=JAK=K9:GN=L@=R=JGJ=KHGFK=D=N=D AKCFGOF9KL@=L@J=K@GD<;GF;=FLJ9LAGF /@=<=KAJ=<J=KHGFK=G>9<JM?;9F := K==F 9K 9:GN= L@= L@J=K@GD< ;GF;=FLJ9LAGF D=N=D  /@= E9PAE9D 9LL9AF9:D= J=KHGFK=G>9<JM?AKCFGOF9KL@=;=ADAF?=>>=;L /@=<JM?;GF;=FLJ9LAGFL@9L =DA;ALK9@9D>O9QJ=KHGFK=:=LO==FL@=:9K=DAF=9F<L@=;=ADAF?=>>=;LAKJ=>=JJ=< LG9KL@= 

(87)  >>=;LAN=GF;=FLJ9LAGF

(88)  . .LJM;LMJ= 9;LANALQJ=D9LAGFK@AHK /@=:AGDG?A;9D=>>=;LAK?GN=JF=<:QL@=KLJM;LMJ=G>9DA?9F< /@=J=D9LAGFK@AH :=LO==F L@= KLJM;LMJ= G> L@= DA?9F<9F< L@= :AGDG?A;9D 9;LANALQ AK L=JE=< 9K9 KLJM;LMJ= 9;LANALQJ=D9LAGFK@AH.- /@=HJ=K=F;=G>C=QKLJM;LMJ9D=D=E=FLK.  .

(89) KM;@ 9K 9 H9JLA;MD9J >MF;LAGF9D ?JGMH >J9?E=FL GJ KM: KLJM;LMJ= AF 9 DA?9F< G>L=FJ=KMDLKAF9;=JL9AFN9JA9LAGFAFALK:AGDG?A;9D9;LANALQ /@=9F9DQKAKG>L@= .- G> 9 D=9< ;GEHGMF< 9F< ALK 9F9DG?K E9Q := MK=< LG <=L=JEAF= L@= KA?FA>A;9F;= G> H9JLK G> L@= KLJM;LMJ= G> L@= D=9< ;GEHGMF< L@9L J=KMDL AF AF;J=9K=<:AGDG?A;9D9;LANALQ9F<J=<M;=<MFO9FL=<KA<==>>=;LK /@=.-;9F :=MK=<LG<=N=DGH9F=O<JM?OAL@AEHJGN=<9;LANALQ9F<D=KKKA<==>>=;LK  .-K 9J= MKM9DDQ AFN=KLA?9L=< :Q ;J=9LAF? KDA?@L EG<A>A;9LAGFK LG L@= D=9< ;GEHGMF< LG HJG<M;= F=O 9F9DG?K 9F< E=9KMJAF? L@= AF>DM=F;= G> L@=K= KLJM;LMJ9D ;@9F?=K GF L@=AJ :AGDG?A;9D9;LANALQ  /@=K= KLJM;LMJ9D ;@9F?=K ;9F :=:JGM?@L9:GMLAF9FME:=JG>O9QK9KHJ=K=FL=<:=DGO .  $FLJG<M;LAGFG>:AG AKGKL=J=K )=O 9F9DG?K LG =PAKLAF? D=9< ;GEHGMF<K ;9F := <=JAN=< :Q J=HD9;AF? 9F =PAKLAF?KLJM;LMJ9DEGA=LQG>9D=9<OAL@9F=OEGA=LQ /@=;@GA;=G>L@=F=O EGA=LQ AK G>L=F :9K=< GF L@= ;GF;=HL G> AKGKL=J=K  $KGKL=J=K 9J= L@= ;@=EA;9D ?JGMHKL@9L=P@A:ALKAEAD9JALQAFKGE=G>L@=AJ;@=EA;9DGJH@QKA;9DHJGH=JLA=K 9K9J=KMDLG>@9NAF?L@=K9E=FME:=JG>LGL9DGJN9D=F;==D=;LJGFKO@AD=FGL F=;=KK9JADQ@9NAF?L@=K9E=FME:=JG>9LGEK!A?MJ=   $LAK=PH=;L=<L@9L :=AF?KAEAD9JAFKLJM;LMJ=KAKGKL=J=KG>L=F=P@A:ALKAEAD9JH@9JE9;GCAF=LA;9F< H@9JE9;G<QF9EA;HJGH=JLA=K@GO=N=JAKGKL=JA;9F9DG?K<GFGLK@GO9KAEAD9J LQH=G>9;LANALQAF=N=JQ;9K= .  !A?MJ=   P9EHD=KG>AKGKL=J=KO@=J==9;@JGOJ=HJ=K=FLKAKGKL=JA;?JGMHK . .

(90)   .  (G<A>A;9LAGFG>KAR=9F<K@9H= /@=KLJM;LMJ9D;@9F?=KAF9D=9<;GEHGMF<;9F:=:JGM?@L9:GMLL@JGM?@L@= EG<A>A;9LAGF G> KAR= 9F< K@9H= G> L@= D=9<  !GJ =P9EHD= L@AK ;9F := 9;;GEHDAK@=< :Q 9DL=JAF? L@= ;@9AF D=F?L@ GJ KAR= G> L@= JAF? ;@9F?AF? L@= FME:=JG><GM:D=9F<LJAHD=:GF<K9F<9<<AF?GJJ=EGNAF?JAF?EGA=LA=K .  $FLJG<M;LAGFG>F=OKM:KLALM=FLK  F=O KM:KLALM=FL ;9F := AFLJG<M;=< AF GJ<=J LG J=HD9;= HJ=NAGMKDQ =PAKLAF? KM:KLALM=FLKGJLGG;;MHQHJ=NAGMKDQMFKM:KLALML=<HGKALAGFKAF9D=9< /@JGM?@ L@= AF;GJHGJ9LAGF G> 9 F=O KM:KLALM=FL L@= H@9JE9;GCAF=LA; 9F< H@9JE9;G<QF9EA; HJGH=JLA=K G> L@= D=9< ;GEHGMF< ;9F := AEHJGN=<  /@= ;@GA;=G>KM:KLALM=FLAKE9AFDQ<JAN=F:Q L@=G:B=;LAN=G>=F@9F;AF?9;=JL9AF HJGH=JLQ G> 9F 9F9DG? GN=J 9 D=9< ;GEHGMF< KM;@ 9K AEHJGNAF? KGDM:ADALQ =F@9F;AF? H=JE=9LAGF 9;JGKK ;=DD E=E:J9F=K GJ J=<M;AF? L@= J9L= G> E=L9:GDAKE . ,M9FLAL9LAN=KLJM;LMJ= 9;LANALQJ=D9LAGFK@AHK K<AK;MKK=<HJ=NAGMKDQKLJM;LMJ9DN9JA9LAGFAF>DM=F;=KL@=:AGDG?A;9D9;LANALQG> 9H9JLA;MD9J;D9KKG>;@=EA;9DKO@A;@HJGNA<=K9H9L@LG<AK;GN=JL@=KLJM;LMJ= 9;LANALQ J=D9LAGFK@AHK  .LM<QAF? L@=K= J=D9LAGFK@AHK KL9LAKLA;9DDQ 9F< ;GEHML9LAGF9DDQ @9K H9N=< L@= H9L@ >GJ L@= =KL9:DAK@E=FL G> ;GEHML9LAGF9D HJ=<A;LAN=L=;@FAIM=K,.-AKGF=KM;@L=;@FAIM=  ,.-AK9E9L@=E9LA;9DJ=D9LAGFK@AH:=LO==F L@=:AGDG?A;9D9;LANALQ9F<L@= H@QKA;G;@=EA;9D H9J9E=L=JK AF L@= >GJE G> 9F =IM9LAGF  "=F=J9DDQ L@=K= H9J9E=L=JK 9J= J=HJ=K=FL9LAN= G> HJGH=JLA=K KM;@ 9K DAHGH@ADA;ALQ =D=;LJGFA; =>>=;LKKL=JA;=>>=;LK9F<;@=EA;9D;GEHGKALAGFK /@=K=HJGH=JLA=K9J=<=>AF=< AF L@= >GJE G> FME=JA;9D <9L9 9DKG CFGOF 9K <=K;JAHLGJK L@9L 9J= G:L9AF=< =PH=JAE=FL9DDQ GJ ;9F := ;9D;MD9L=< MKAF? ;GEHML=J HJG?J9EK  !GJ 9 K=L G> ;GEHGMF<KKM;@HJGH=JLA=K;9F:=E=9KMJ=<GJ;9D;MD9L=<9F<;9F:=J=D9L=< OAL@ L@=AJ :AGDG?A;9D 9;LANALQ :Q E=9FK G> E9L@=E9LA;9D =IM9LAGFK MKAF? <A>>=J=FLKL9LAKLA;9DE=L@G<KKM;@9KJ=?J=KKAGF9F9DQKAK .  JA=>@AKLGJQ $F JME JGOF9F<!J9K=JHJGHGK=<L@9LL@=KLJM;LMJ9DEG<A>A;9LAGFAF 9K=JA=KG>HGAKGFKD=<LGKA?FA>A;9FL<A>>=J=F;=KAFL@=AJ9;LAGFK 9K=<GFL@AK L@=QHGKLMD9L=<L@9LL@=H@QKAGDG?A;9D9;LAGFG>9EGD=;MD=AK9>MF;LAGFG>ALK ;@=EA;9D;GFKLALMLAGF9F<;9F:==PHJ=KK=<9K +@QKAGDG?A;9D9;LAGF>MF;LAGF;@=EA;9D;GFKLALMLAGF.   .  .

(91) #9FK;@ =L 9D  AF  HM:DAK@=< 9 KLM<Q G> L@= AF>DM=F;= G> L@= #9EE=LL ;GFKL9FL 9F< @Q<JGH@G:A;ALQ GF L@= KLJM;LMJ= 9;LANALQ J=D9LAGFK@AH G> HD9FL ?JGOL@J=?MD9LGJK  'G? 9DG?+:DG?+;ı+ …+ k .   . 2@=J= ;GF;=FLJ9LAGFL@9LHJG<M;=K:AGDG?A;9D=>>=;L +G;L9FGD O9L=J;G=>>A;A=FL 9:;;G=>>A;A=FLK C4 AFL=J;=HL ı=D=;LJGFA;#9EE=LL;GFKL9FL /@=!J== 2ADKGF9HHJG9;@HJGHGK=<L@9LFGH@QKA;G;@=EA;9DH9J9E=L=JK9J= J=IMAJ=<>GJL@=KLJM;LMJ=9;LANALQJ=D9LAGFK@AHAFKL=9<L@=;GFLJA:MLAGFG>=9;@ KLJM;LMJ9D >=9LMJ= O9K G> AFL=J=KL 

(92)  /@= KAEHD=KL =IM9LAGF >GJ 9 :AGDG?A;9D J=KHGFK=->GDDGOK . log BR = ∑(substituent contributions) + contribution from base molecule      . MJJ=FLKL9L=G>,.-9F<J=D9L=<L=;@FAIM=K /G<9QSK,.-EG<=DK9J=EM;@EGJ=9<N9F;=<9KL@=Q9J=9M?E=FL=<OAL@ <A>>=J=FL ?J9H@A;9D 9HHJG9;@=K @A?@ <AE=FKAGF9D <=K;JAHLGJK 9F< KLM<A=< AF L9F<=E OAL@ 9<N9F;=< E9L@=E9LA;9D =IM9LAGFK 9F< @A?@ ;GEHMLAF? J=KGMJ;=K  .   NGDMLAGFG>EGD=;MD9J<=K;JAHLGJK /@= <=K;JAHLGJK J=HJ=K=FL KLJM;LMJ=K G> EGD=;MD=K 9F< L@=AJ H@QKA;G;@=EA;9D HJGH=JLA=K  /@= EGD=;MD9J <=K;JAHLAGF G> 9 ;GEHGMF< AK =PHD9AF=< :Q ;GFKLALMLAGF9D <=K;JAHLGJK  /@=  <=K;JAHLGJK 9J= <=JAN=< :Q ;GMFLAF? KLJM;LMJ9D >J9?E=FLK KM;@ 9K L@= FME:=J G> HJAE9JQ 9F< K=;GF<9JQ ;9J:GF 9LGEK9EAF=9F<FALJG?JGMHK=L; /@=EGJ=9<N9F;=<<=K;JAHLGJKDAC=  9F< J=HJ=K=FLLGHGDG?A;9D?=GE=LJA;9DKL=JAG;@=EA;9D9F<GL@=JCAF<G> AF>GJE9LAGF 9:GML ;GEHGMF<K  /GHGDG?A;9D <=K;JAHLGJK =PHD9AF L@= :GF<AF? ;GDD=;LAGFAF9EGD=;MD= "=GE=LJA;9D<=K;JAHLGJKJ=N=9DAF>GJE9LAGFJ=D9L=<LG L@= K@9H= 9F< KAR= G> ;GEHGMF<K O@AD= =D=;LJGKL9LA; <=K;JAHLGJK =PHD9AF AF>GJE9LAGF 9:GML = ?  EGD=;MD9J ;@9J?= HGD9JAR9:ADALQ 9F< LGHGDG?A;9D HGD9J KMJ>9;= 9J=9  N=F EGJ= ;GEHD=P 9F< ;GEHML9LAGF9DDQ =PH=FKAN= <=K;JAHLGJK KM;@ 9K IM9FLME ;@=EA;9D <=K;JAHLGJK <=K;JA:= AF>GJE9LAGF J=D9L=< LG L@= =D=;LJGFA;KLJM;LMJ= -=;=FLDQ 9F< <=K;JAHLGJK@9N=:==F<=N=DGH=<  /@=K= <=K;JAHLGJK @9N= ;GFKA<=J=< 9F =FK=E:D= G> ;GF>GJE9LAGFK >GJ =9;@ DA?9F< 9F< AF<M;=< >AL H@=FGE=F9 L@9L L9C= HD9;= 9>L=J DA?9F< J=;=HLGJ AFL=J9;LAGFJ=KH=;LAN=DQ . 

(93) .

(94)  . 

(95)  90&3/'3 )&  $". #& $,"33*'*&% #"3&% /. 6"2*/53 '"$4/23 35$) "3 4)& -"4)&-"4*$",4&$).*15&53&%&(2&(2&33*/.".%$,"33*'*$"4*/. 4)&02/0&249 4/ #& *.6&34*("4&% &( 15".4*4"4*6& 3425$452&02/0&249 2&,"4*/.3)*0 (2/50 #"3&%  '2"(-&.4#"3&%  ".% 0)"2-"$/0)/2&3*-*,"2*49#"3&%  ".% 490& /' %&3$2*04/23 53&% &(    ".%-5,4*%*-&.3*/.",. 

(96)  "4&344&$).*15&3'/2%&6&,/0*.( "2*/53 4&$).*15&3 "2& 53&% '/2 4)& $/.3425$4*/. /' 3 /--/. &8"-0,&3 /' 2&(2&33*/. ".",93*3 *.$,5%& 0"24*", ,&"34 315"2&  -5,4*0,& ,*.&"22&(2&33*/. +.&"2&34.&*()#/2+2&(2&33*/.".%02*.$*0", $/-0/.&.4 2&(2&33*/.  && 3&$4*/.   /--/. &8"-0,&3 /' $,"33*'*$"4*/. -&4)/%3 "2& 4)& +.&"2&34 .&*()#/2 + $,"33*'*$"4*/. %*3$2*-*.".4 ".",93*3 ".% $,"33*'*$"4*/. 42&&3 )& -/34 $/--/.,9 53&%   4&$).*15&3 "2& $/-0"2"4*6& -/,&$5,"2 '*&,% ".",93*3  ".% $/-0"2"4*6& -/,&$5,"2 3*-*,"2*49 *.%&8 ".",93*3  . 4)& .&7,9 &-&2(*.( '*&,% /' -5,4*%*-&.3*/.",  -5,4*0,& $/.'/2-"4*/.3 /' 4)& ,*(".% *.%5$&% '*4 &''&$43 /' 4)& ,*(".%2&$&04/2 $/-0,&8 ".% 4)& 3/,6"4*/. &.&2(9 /' 4)& ,*(".%2&$&04/2 *.4&2"$4*/. )"6& #&&. *.$/20/2"4&% 4/ $,/3&,9 -*-*$ #*/,/(*$", 02/$&33&3 80&24 3934&-3 $/-0/3&% /' " (2/50 /' %*6&23& -/%&,3 $/-#*.&% 7*4) %*''&2&.4 %"4"#"3&3 &( 4)&   -/%&, %"4"#"3&)"6&#&&.2&$&.4,9%&6&,/0&% . 

(97)  "4&34"00,*$"4*/.3/'3 )&2& "2& " ,"2(& .5-#&2 /' "00,*$"4*/.3 /'  -/%&,3 7*4)*. *.%53429 "$"%&-*"".%(/6&2.-&.4",".%2&(5,"4/29"(&.$*&3

(98)  '&7/'4)&53&3"2& ,*34&%#&,/7 ¾ 2&%*$4*/. /' $)&-*$", 02/0&24*&3 &( -&,4*.( 0/*.4 #/*,*.( 0/*.4,/( ¾ 2&%*$4*/. /' #*/,/(*$", "$4*6*49 &( *.)*#*4/29 $/.$&.42"4*/. /' %25(

(99) 

(100) 

(101) -54"(&.*$*49 ¾ 2&%*$4*/./'02/0&24*&3/'.&7$)&-*$",&.4*4*&3*.%25( %*3$/6&29 ¾ *3+-"."(&-&.4/'$)&-*$",3#92&(5,"4/29"54)/2*4*&3 ¾ 2&%*$4*/./'4)&&.6*2/.-&.4",#&)"6*/52/'/2(".*$0/,,54".43 ¾ 2&%*$4*/./'4)&$)2/-"4/(2"0)*$2&4&.4*/./'"$)&-*$", ¾ 33&33-&.4/'4)&."./4/8*$*49/'."./-"4&2*",3. 

(102) 

(103) )&-*$",$"4&(/29"002/"$) )& 52/0&". )&-*$", (&.$9  (5*%".$& %/$5-&.4 /. ./. ".*-",4&34*.( "002/"$)&3 )"3 &.%/23&% 4&$).*15&3 35$) "3 $)&-*$", (2/50*.(2&"%"$2/33".%7&*()4/'&6*%&.$&!",/.(7*4)3'/2.  .

(104) =PL=F<AF? 9F< =D9:GJ9LAF? L@= =PAKLAF? AF>GJE9LAGF KG 9K LG AEHJGN= F=O L=KLAF? KLJ9L=?A=K >GJ HJ=<A;LAF? @ME9F @9R9J<K 9F< 9KK=KKAF? =FNAJGFE=FL9D JAKCK  ;@=EA;9D;9L=?GJQAK<=>AF=<9K9?JGMHG>;@=EA;9DKO@GK=LGPA;GDG?A;9DGJ H@QKA;G;@=EA;9D HJGH=JLA=K 9J= DAC=DQ LG := KAEAD9J :=;9MK= G> L@=AJ KLJM;LMJ9D KAEAD9JALQ The ‘similarity’ rationale;9F:=<=K;JA:=<9K;@=EA;9DKL@9L@9N= ¾ 9;GEEGF>MF;LAGF9D?JGMHK= ? =HGPA<=9D<=@Q<==L;  ¾ AF;J=E=FL9DGJ<=;J=E=FL9D;@9F?=KAFKLJM;LMJ== ? ;@9AFD=F?L@ ¾ ;GEEGF;@=EA;9D;D9KK=KGJ;GFKLALM=FLK ¾ 9;GEEGFHJ=;MJKGJGJE=L9:GDAL= 9L=?GJA=KG>;@=EA;9DK9J=G>L=F=KL9:DAK@=<OAL@L@=9KKMEHLAGFL@9L9K=JA=K G> ;@=EA;9DK OAL@ ;GEEGF KAEAD9J KLJM;LMJ9D >=9LMJ=K OADD =P@A:AL ;G@=J=FL LJ=F<KAFL@=AJH@QKA;G ;@=EA;9DHJGH=JLA=K9F<L@=J=:QAFL@=AJLGPA;GDG?A;9D =>>=;LK9F< GJ=FNAJGFE=FL9D>9L=HJGH=JLA=K /@=K=KG ;9DD=<;G@=J=FLLJ=F<K AF L@=AJ :=@9NAGMJ 9J= ?=F=J9DDQ 9KKG;A9L=< OAL@ 9 ;GEEGF MF<=JDQAF? E=;@9FAKEG>9;LAGF .  9L9?9H>ADDAF?9F<-=9< 9;JGKK -=9< 9;JGKKAK9L=;@FAIM=L@9LHJ=<A;LK=F<HGAFLAF>GJE9LAGFA = :AGDG?A;9D 9;LANALQ GJ ;@=EA;9D HJGH=JLQ G> 9 ;@=EA;9D :9K=< GF <9L9 >JGE L@= K9E= =F<HGAFL >JGE 9FGL@=J ;@=EA;9D L@9L AK KAEAD9J AF KGE= 9KH=;LK DAC= 9 KLJM;LMJ9DKAEAD9JALQ9F< GJKAEAD9JHJGH=JLA=K . . !A?MJ=   9L9 ?9H >ADDAF? MKAF? J=9< 9;JGKK AFL=JHGD9LAGF GJ =PLJ9HGD9LAGF >JGE GF= L=KL=<;@=EA;9DLG9FMFL=KL=<;@=EA;9D . !A?MJ=   AK 9F =P9EHD= G> 9 <9L9 ?9H >ADDAF? >GJ L@= DG?+ HJGH=JLQ G> 9 ;@DGJG 9DC9F= OAL@AF L@= <=>AF=< ;9L=?GJQ G> ;@DGJG 9DC9F=K  /@=J= AK 9F AF;J=E=FL9D;@9F?=AFL@=;@9AFD=F?L@G>E=E:=JKO@A;@AK9KKG;A9L=<OAL@ 9F AF;J=E=FL9D ;@9F?= AF L@=AJ DG?+  /@= J=9< 9;JGKK L=;@FAIM= ;9F := =>>=;LAN=DQMK=<@=J=>GJL@=<9L9?9H>ADDAF?  $FL=JHGD9LAGFAKL@==KLAE9LAGFG>9F=F<HGAFL>GJ9;@=EA;9DMKAF?E=9KMJ=< N9DM=K >JGE GL@=J E=E:=JK GF :GL@ KA<=K G> L@9L ;@=EA;9D OAL@AF L@= ?AN=F. .

(105)   ;9L=?GJQ J9F?=  PLJ9HGD9LAGF J=>=JK LG L@= =KLAE9LAGF G> 9F =F<HGAFL >GJ 9 ;@=EA;9DL@9LAKF=9JGJ9LL@=:GMF<9JQG>9?AN=F;9L=?GJQ:QMKAF?E=9KMJ=< N9DM=K>JGEAFL=JF9D;9L=?GJQE=E:=JK . .L9LAKLA;9DE=L@G<K9F<LGGDK 19JAGMK EMDLAN9JA9L= 9F9DQKAK E=L@G<K 9J= MK=< >GJ =KL9:DAK@AF? KLJM;LMJ= >MF;LAGF J=D9LAGFK@AHK  /@=J= 9J= LOG LQH=K G> EMDLAN9JA9L= 9F9DQKAK 9N9AD9:D= KMH=JNAK=<('-C ))=L; 9F<MFKMH=JNAK=<+<=;AKAGFLJ===L;  .  (MDLAHD=DAF=9JJ=?J=KKAGF('- J=?J=KKAGF9F9DQKAK;GJJ=D9L=KAF<=H=F<=FLN9JA9:D=K3OAL@L@=<=H=F<=FL N9JA9:D=4O@=J=L@=linear relationship is established between ‘m’ number of 3K 9F< L@= :AGDG?A;9D 9;LANALQ <9L9 4 L@JGM?@ 9 DAF=9J =IM9LAGF  .M;@ 9 J=D9LAGFK@AH;9F:==PHJ=KK=<OAL@9EMDLAHD= DAF=9J=IM9LAGF 4:

(106) : 3 :3+ …+ bE3E. .  . . 2@=J= :

(107) 4AFL=J;=HL : =KLAE9L=<KDGH=G>9J=?J=KKAGFG>4GF3  /@=('-L=;@FAIM=AK?=F=J9DDQMK=<LGA<=FLA>QL@=:=KLK=LG>AF<=H=F<=FL N9JA9:D=K KG L@9L 9 <=H=F<=FL N9JA9:D= ;9F := HJ=<A;L=< 9K 9;;MJ9L=DQ 9K HGKKA:D=  /@= :=KL AF<=H=F<=FL N9JA9:D=K ;9F := A<=FLA>A=< OAL@ L@= @=DH G> KL9LAKLA;9D9HHJG9;@=KKM;@9K:9;CO9J<=DAEAF9LAGFGJKL=HOAK=K=D=;LAGF 

(108)  /@= :9;CO9J< =DAEAF9LAGF 9HHJG9;@ J=EGN=K L@= D=9KL KA?FA>A;9FL N9JA9:D=K >JGE 9 EG<=D 9F< J=>ALK L@= EG<=D  /@AK HJG;=<MJ= ;GFLAFM=K MFLAD L@= ‘stopping criteria’ 9J=E=LKM;@9KL@=@A?@=KLJGJDGO=KL=JJGJO@=J=9KL@= KL=HOAK=K=D=;LAGF:=?AFKOAL@GML9FQN9JA9:D=KAF9EG<=D9F<L@=F;GFKLJM;LK L@=EG<=D9<<AF?F=ON9JA9:D=KMFLADFG>MJL@=JN9JA9:D=K9J=KA?FA>A;9FL $LAK HGKKA:D= LG =P;DM<= 9 N9JA9:D=K L@9L AK D=KK KA?FA>A;9FL L@9F L@= N9JA9:D=K 9<<=<D9L=J  .  +JAF;AH9D;GEHGF=FL9F9DQKAK+ +JAF;AH9D ;GEHGF=FL 9F9DQKAK AK 9 E=L@G< >GJ ;GFN=JLAF? HGKKA:DQ ;GJJ=D9L=< N9JA9:D=K AFLG 9 K=L G> MF;GJJ=D9L=< N9JA9:D=K :Q E=9FK G> GJL@G?GF9D LJ9FK>GJE9LAGF /@=:9KA;A<=9:=@AF<+AKLGJ=<M;=L@=<AE=FKAGF9DALQG> <9L9L@9L;GFL9AFK9D9J?=FME:=JG>AFL=JJ=D9L=<N9JA9:D=KO@AD=J=L9AFAF?9K EM;@ N9JA9LAGF 9K HGKKA:D= AF L@= <9L9  /@AK AK 9;@A=N=< :Q LJ9FK>GJEAF? L@= <9L9 AFLG 9 F=O K=L G> N9JA9:D=K CFGOF 9K HJAF;AH9D ;GEHGF=FLK +K  /@= +K 9J= MF;GJJ=D9L=< 9F< 9J= AF GJ<=J A =  L@= >AJKL + ;GFKAKLK G> @A?@=KL AF>GJE9LAGF =PHD9AF=<  N9JA9F;= L@9F L@= K=;GF< + 9F< KG GF !A?MJ=.   .  .

(109) /@=+O9KMF<=JL9C=FL@JGM?@9F9DQKAF?G>K;GJ=9F<DG9<AF?HDGLK /@= K;GJ= HDGL =F9:D=K AFL=JHJ=L9LAGF G> J=D9LAGFK@AHK 9EGF? L@= K9EHD=K  $> L@= K9EHD=K9J=;DGK=L@=FL@=Q9J=KAEAD9JLG=9;@GL@=J9F<NA;= N=JK9 *FL@= GL@=J@9F<L@=DG9<AF?KHDGL=F9:D=KAFL=JHJ=L9LAGFG>L@=J=D9LAGFK@AHK9EGF? N9JA9:D=K  /@= K9EHD=K HD9;=< GF L@= JA?@L KA<= G> L@= K;GJ= HDGL 9J= ;@9J9;L=JAR=<:Q@9NAF?@A?@N9DM=KG>N9JA9:D=KHJGB=;L=<GFL@=JA?@LKA<=G> L@=DG9<AF?KHDGL9F<NA;= N=JK9 .  !A?MJ=   /@= J=HJ=K=FL9LAGF G> HJAF;AH9D ;GEHGF=FLK G> + AF L@J== <AE=FKAGF9D. KH9;=  +  AK 9 N=;LGJ L@9L :=KL >ALK L@= <9L9 AF L@= <AJ=;LAGF G> E9PAEME N9JA9F;= O@=J=9K +  HJGB=;LK H=JH=F<A;MD9JDQ LG +  AF L@= <AJ=;LAGF G> L@= K=;GF< E9PAEME N9JA9F;=AFL@=<9L9K=L .  C F=9J=KLF=A?@:GJC )).   9;C?JGMF<. C ))AK9FGF H9J9E=LJA;E=L@G<MK=<>GJJ=?J=KKAGF9F<;D9KKA>A;9LAGF $LAK GF= G> L@= KAEHD=KL 9F< >9KL=KL AFKL9F;= :9K=< D=9JFAF? L=;@FAIM=K 9F< AK 9EGF?L@=>AJKL;@GA;=K>GJ9;D9KKA>A;9LAGFEG<=D<=N=DGHE=FLO@=FL@=J=AK DALLD=GJFGHJAGJAF>GJE9LAGFJ=?9J<AF?L@=<AKLJA:MLAGFG>L@=<9L9 /@=C )) 9D?GJAL@EO9KGJA?AF9DDQHJGHGK=<:Q!AP9F<#G<?=KAF  .   =>AFALAGFK9F<L@=GJQ  C ))EG<=DAK<=>AF=<:Q9K=LG>K9EHD=K>GJO@A;@J=KHGFK=N9JA9:D=K9J= CFGOF  9;@ K9EHD= >JGE 9 <9L9K=L ;GFKAKLK G> AF<=H=F<=FL 9F< <=H=F<=FL N9JA9:D=KL@9L9J=;GFLAFMGMKGJ;9L=?GJA;9DAFF9LMJ= /@=EG<=DK;GFKLJM;L=< >GJ;GFLAFMGMK9F<;9L=?GJA;9DLQH=KG><=H=F<=FLN9JA9:D=K9J=J=>=JJ=<LG9K C ))J=?J=KKAGF9F<C ));D9KKA>A;9LAGFJ=KH=;LAN=DQ  KLAE9LAF?L@=GML;GE=>GJ9?AN=FMFCFGOFK9EHD=IM=JQ;9F:=9;@A=N=< :Q ;9D;MD9LAF? L@= <AKL9F;= E9LJAP 9F< L@=J=:Q >AF<AF? C K9EHD=K L@9L 9J= ;DGK=KL AF <AKL9F;= LG L@= IM=JQ K9EHD=  $F L@= C )) J=?J=KKAGF EG<=D L@= GML;GE=>GJ9IM=JQK9EHD=;9F:=>GMF<:Q9N=J9?AF?L@=GML;GE=KG>ALK C F=A?@:GJKO@AD= AFL@=C ));D9KKA>A;9LAGF:Q9KKA?FAF?L@=;9L=?GJQG> L@= E9BGJALQG>ALKC F=A?@:GJK!A?MJ=   . .

(110)  . . !A?MJ=  $>9<AKL9F;=E9LJAP>GJ9?AN=FK=LG>>AN=K9EHD=K9F<9IM=JQAK9K9:GN=L@=F. >GJCL@=HJ=<A;L=<;D9KKG>L@=IM=JQOADD:=L@=;D9KKG>E9BGJALQG>ALKL@J==F=A?@:GJK A = ;D9KK O@AD=L@=HJ=<A;L=<=F<HGAFLN9DM=OADD:=L@=9N=J9?=G>=F<HGAFLN9DM=KG>9DD ALKL@J==F=A?@:GJK . !GJ9?AN=FIM=JQHGAFLL@= C ))9D?GJAL@EE9C=KHJ=<A;LAGFK:9K=<GFL@= GML;GE= G> L@= C F=A?@:GJK F=9J=KL LG L@9L HGAFL  /@MK LG H=J>GJE KM;@ 9 HJ=<A;LAGF9E=LJA;>GJ;9D;MD9LAF?L@=<AKL9F;=:=LO==FL@=IM=JQHGAFL9F< LJ9AFAF? <9L9K=L K9EHD=K F==<K LG := <=>AF=<  /OG O=DD CFGOF ;@GA;=K LG E=9KMJ= L@AK <AKL9F;= 9J= M;DA<=9F 9F< (9F@9LL9F  /@= <AKL9F;= E9LJAP ;9D;MD9LAGFK:QL@=K=E=L@G<K;9F:=H=J>GJE=<MKAF?>GDDGOAF?>GJEMD9K  ‫ ݁ܿ݊ܽݐݏ݈݅݀݊ܽ݁݀݅ܿݑܧ‬ൌ ටσ௞௜ୀଵሺ‫ݔ‬௜ െ ‫ݕ‬௜ ሻଶ        . ‫ ݁ܿ݊ܽݐݏ݅݀݊ܽݐݐ݄ܽ݊ܽܯ‬ൌ σ௞௜ୀଵȁ‫ݔ‬௜ െ ‫ݕ‬௜ ȁ.        /G <=L=JEAF= L@= GHLAE9D C N9DM= A =  L@= GHLAE9D FME:=J G> F=A?@:GJK K=N=J9DE=L@G<K;9F:=MK=<KM;@9K9JAKC>MF;LAGF=EHAJA;9DJMD=K9F<;JGKK N9DA<9LAGF /@=CN9DM=L@9L?AN=KL@=GHLAE9DEG<=DKL9LAKLA;KAK;@GK=F>GJL@= ;GFKLJM;LAGFG>9EG<=D  .   <N9FL9?=K9F<<AK9<N9FL9?=K /@= E9AF 9<N9FL9?= G> L@= C )) E=L@G< AK L@9L AL AK L@= KAEHD=KL G> 9DD E9;@AF= D=9JFAF? L=;@FAIM=K  $F 9<<ALAGF AL AK 9 KGH@AKLA;9L=< ;D9KKA>A;9LAGF L=;@FAIM=KAF;=ALAK9FGF DAF=9JFGF H9J9E=LJA;9HHJG9;@9F<AL;9F@9F<D= EMDLA ;D9KK <9L9K=L  /@= C )) AK 9 K=FKALAN= E=L@G< AF ALK 9HHDA;9LAGFK LG <AKL9F;=E9LJA;=K9F<K;9DAF?L=;@FAIM=K <AK9<N9FL9?=G>L@= C ))E=L@G< AKL@9LAL;9F:=9>>=;L=<:QL@=DG;9DKLJM;LMJ=G>L@=<9L9K=L  . .  .

(111)   .A?FA>A;9F;= /@=E9AFKA?FA>A;9F;=G>L@AKL=;@FAIM=AKL@9LALAKFGF H9J9E=LJA;E=9FAF? L@9L AL <G=K FGL >GJE 9FQ 9KKMEHLAGFK GF L@= MF<=JDQAF? <9L9 <AKLJA:MLAGF  /@AK AK AEHGJL9FL 9K AF EGKL G> L@= ;9K=K LJ9AFAF? <9L9 <G=K FGL >GDDGO L@= ;GEEGF L@=GJ=LA;9D 9KKMEHLAGFK = ?  DAF=9JDQ K=H9J9:D=  )GF H9J9E=LJA; 9D?GJAL@EK DAC= C )) 9J= N=JQ MK=>MD AF KM;@ ;9K=K  FGL@=J KA?FA>A;9F;= AK L@9LALAK9D9RQ9D?GJAL@EO@A;@E9C=K<=;AKAGFK:9K=<GFL@==FLAJ=LJ9AFAF? <9L9K=LA = AL<G=KFGL<AK;9J<9FQAF>GJE9LAGF>JGEL@=LJ9AFAF?<9L9 .   HHDA;9LAGFK /@=C ))9D?GJAL@E@9K:==FMK=<AF9FME:=JG>>A=D<K KM;@9K>GJ ¾ AGDG?A;9D LGPA;GDG?A;9D 9;LANALQ HJ=<A;LAGF G> :AG9;LAN= ;GEHGMF<K 9F<LGPA;9FLK  ¾ "=F==PHJ=KKAGF<9L9KLM<A=K  ¾ +JGL=AF HJGL=AFAFL=J9;LAGFKLM<A=K  ¾ +J=<A;LAF?KLJM;LMJ=KG>HJGL=AFK  . .

(112)  . *:B=;LAN=K /@= G:B=;LAN= G> L@AK L@=KAK O9K LG <=N=DGH HJ=<A;LAN= EG<=DK >GJ <A>>=J=FL LQH=K G> LGPA;ALA=K MKAF? ;GEHML9LAGF9D E=L@G<K  /@= OGJC AFNGDN=< L@= ;GFKLJM;LAGF G> ,.- EG<=DK >GJ 9;ML= '

(113)  J=H=9L=< <GK= '* ' 9F< @ -" <=JAN=<LGPA;ALA=K   $F H9H=J $ L@= G:B=;LAN= O9K LG AFN=KLA?9L= L@= HGKKA:ADALQ G> =KL9:DAK@AF? 9 ?DG:9D,.-EG<=D>GJL@=9;ML=LGPA;ALQ=F<HGAFL'

(114) :Q9HHDQAF?L@=C )) L=;@FAIM= MKAF? L@= (MFJG <9L9:9K= L@9L J=HJ=K=FLK 9 :JG9< K=L G> ;@=EA;9DKL@9L9J=<AN=JK=AFL=JEKG>L@=AJKLJM;LMJ=K9F<(*K   .=N=J9D (*K ;GFLJA:ML= LGO9J<K L@= 9;ML= LGPA;ALQ =F<HGAFL '

(115)  9F< KAEAD9JDQ E9FQ =>>=;LK 9J= KLM<A=< AF GJ<=J LG <=JAN= L@= ;@JGFA; LGPA;ALQ =F<HGAFL '* '  GL@ L@=K= =F<HGAFLK 9J= AF>DM=F;=< :Q EMDLAHD= LGPA;ALQ E=;@9FAKEK9F<9J==PHJ=KK=<AFL@=K9E=MFAL A = EADDA?J9EG>;@=EA;9DH=J CADG?J9EG>:G<QO=A?@LE? C? /@=J=>GJ=AFH9H=J$$L@=G:B=;LAN=O9KLG <=L=JEAF=O@=L@=JALAKHGKKA:D=LGHJ=<A;LL@=;@JGFA;LGPA;ALQ'* 'MKAF? 9;ML=LGPA;ALQ'

(116) <9L9:QE=9FKG>C ));D9KKA>A;9LAGF9F<L@=J=9< 9;JGKK 9HHJG9;@   $F H9H=J $$$ L@= G:B=;LAN= O9K LG AFN=KLA?9L= L@= HGKKA:ADALQ G> <=N=DGHAF? 9 ,.-EG<=D:Q9HHDQAF?L@=C ))9HHJG9;@LGL@=KLJM;LMJ9DDQ<AN=JK=K=LG> !=FA;@=D 9F< *;@=E <9L9:9K= ;GEHGMF<K L@9L =P@A:ALK 9 MFAIM= (* >GJ @ -"&;@9FF=D:DG;C9<=     .  .

(117) #+/ - 0/ /*3$$/4 +- $/$*)0.$)",.- / #)$,0  (ADDAGFK G> F=O ;@=EA;9D KM:KL9F;=K 9J= KQFL@=KAR=< 9FFM9DDQ AF L@= <JM? <AK;GN=JQ HJG;=KK=K G> L@= world’s E9FQ H@9JE9;=MLA;9D ;GEH9FA=K  /@= HJ=;DAFA;9DL=KLAF?G>L@=K=KM:KL9F;=K<=E9F<KL@=MK=G>L@GMK9F<KG>9FAE9DK >GJ L@= HMJHGK= G> K9>=LQ 9F< =>>A;9;Q =N9DM9LAGFK  (GJ=GN=J L@= ?DG:9D HJG<M;LAGF G> ;@=EA;9DK :Q = ?  9?JA;MDLMJ9D >=JLADAR=J ;@=EA;9D ;=E=FL E=L9D:AGL=;@FGDG?A;9DL=PLAD=9F<HGDQE=JAF<MKLJA=K9EGMFLKLG:ADDAGFKG> LGFK  ;;GJ<AF? LG 9 KLM<Q 9L 0FAN=JKALQ G> '=A;=KL=J 0& LGPA;GDG?A;9D =N9DM9LAGF G> L@=K= ;@=EA;9DK OGMD< F==< 9<<ALAGF9DDQ 9:GML  EADDAGF 9FAE9DK>GJL=KLAF?HMJHGK=K 

(118)  !MJL@=JEGJ=G>L@=;@=EA;9DKL@9L=PAKL AFL@=E9JC=LLG<9Q9J=FGLKM:B=;L=<LGK9>=LQ=N9DM9LAGFL=KLAF?9F<L=KLAF? G> L@=K= ;@=EA;9DK OGMD< F==< 9 D9J?= 9FAE9D J=KGMJ;=K LAE= =>>GJLK 9F< ;GKL 

(119)  /GK9N=9FAE9DK9F<KH==<MHL@=LGPA;GDG?A;9D=N9DM9LAGFL@=J=AK9F==<>GJ L@=<=N=DGHE=FLG>F=OFGF L=KLAF?E=L@G<K9F<EG<=DK .=N=J9DFGF L=KLAF? E=L@G<K 9J= 9N9AD9:D= 9F< 9EGF? L@=E ;GEHML9LAGF9D E=L@G<K 9J= L@= IMA;C=KL 9F< ;@=9H=KL  /@= ,.- L=;@FAIM= AK GF= G> L@= KM;;=KK>MD ;GEHML9LAGF9D E=L@G< L@9L @9K :==F 9HHDA=< =PL=FKAN=DQ  (GJ=GN=J L@= - #9FF=P3$@9K9DKG =FNAK9?=<L@=MK=G>,.-E=L@G<K>GJL=KLAF? ;@=EA;9DK /@=J=>GJ= :=LL=J,.-EG<=DK9J=F==<=<LG=N9DM9L= L@=K9>=LQ G> ;@=EA;9DK L@9L @9N= <AN=JK= KLJM;LMJ=K 9F< ;GE= >JGE 9 N9JA=LQ G> KGMJ;=K KM;@9K;@=EA;9DH@9JE9;=MLA;9D9?JA;MDLMJ9DHGDQE=J9F<>GG<AF<MKLJA=K  /@MKO=@9N=E9<=9F9LL=EHLLG<=N=DGH9?DG:9D,.-EG<=DMKAF?L@= (MFJG <9L9:9K= L@9L ;GFKAKLK G> 9 N9JA=LQ G> ;@=EA;9DK >JGE H@9JE9;=MLA;9D 9?JA;MDLMJ9D>GG<9F<=FNAJGFE=FL9D>A=D<KKGL@9LL@AKEG<=D;9F;GFLJA:ML= LGO9J<K L@= K;J==FAF? G> 9;ML= LGPA;ALQ G> MFCFGOF ;GEHGMF<K G> <AN=JK= KLJM;LMJ=K9F<GJA?AFK . .

(120)  .  -=K=9J;@IM=KLAGFK9F<G:B=;LAN=K /@=(MFJG<9L9:9K=J=HJ=K=FLK9OA<=;@=EA;9DD9F<K;9H=A = AL;GFKAKLKG> 9N9JA=LQG>H@9JE9;=MLA;9DK9?JA;MDLMJ9D9F<AF<MKLJA9D;@=EA;9DKKM:KL9F;=K MK=< AF >GG< HJG<M;LAGF 9F< ;@=EA;9DK L@9L @9N= 9F AEH9;L GF L@= =FNAJGFE=FL $FL@AKJ=KH=;L.LG;@=JJG=L9D @9N=J=>=JJ=<LGL@AK<9L9:9K=9K ‘9 world of chemicals’. ;;GJ<AF?DQ L@AK <9L9K=L AK G> H9JLA;MD9J AFL=J=KL >GJ ;GEHML=J EG<=D=JK <M= LG ALK <AN=JK= ;GFL=FL   >=O ;GEHML9LAGF9D 9HHJG9;@=K @9N= :==F 9HHDA=< GF L@= (MFJG <9L9:9K= KM;@ 9K L@= =KL9:DAK@E=FLG>9LGPA;GDG?A;9DL@J=K@GD<G>;GF;=JF//MKAF?9<=;AKAGF LJ==9HHJG9;@9F<L@=9HHDA;9LAGFG>+JAF;AH9DGEHGF=FLF9DQKAK+ *JL@G?GF9D A<AJ=;LAGF9D +JGB=;LAGFK LG '9L=FL .LJM;LMJ=K AK;JAEAF9FL F9DQKAK *+'.  9F< L@= ;DMKL=JAF? 9HHJG9;@ >GJ L@= HMJHGK= G> ;D9KKA>QAF?L@=(MFJG<9L9:9K=;@=EA;9DK9;;GJ<AF?LGLGPA;ALQ  /@=9:GN= E=FLAGF=<KLM<A=KAF;GJHGJ9L=<KM: ;@JGFA;LGPA;ALQ=F<HGAFL<9L9 )* ' >GJ L@= ;D9KKA>A;9LAGF G> L@= (MFJG <9L9:9K= O@AD= L@=J= 9J= FG HM:DAK@=< KLM<A=K MKAF? 9;ML= LGPA;ALQ =F<HGAFL <9L9 DAC= '

(121)  GF L@AK <9L9:9K=  .AF;= L@= <9L9:9K= ;GFKAKLK G> <AN=JK= ;@=EA;9DK 9 ;GEHML9LAGF9D EG<=D<=N=DGH=<OAL@KM;@9<9L9K=L;GMD<K=JN=9K9:=LL=JK;J==FAF?LGGD>GJ HJ=<A;LAF?LGPA;HGL=FLA9DKG>MFCFGOF;GEHGMF<KG>N9JA=<KAR=K9F<K@9H=K  (GJ=GN=JL@=:=F=>ALG>MKAF?9;ML=LGPA;ALQN9DM=KGN=J;@JGFA;LGPA;ALQN9DM=K AK L@9L L@=Q 9J= G:L9AF=< AF 9 K@GJL LAE= <MJ9LAGF  –  <9QK O@A;@ ;9F ;GFLJA:ML= LG ;GKL =>>A;A=F;Q 9F< OAL@ D=KK ;ME:=JKGE= =PH=JAE=FLK AF ;GEH9JAKGF LG L@GK= MK=< >GJ L@= KM: ;@JGFA; 9F< ;@JGFA; LGPA;ALQ N9DM=K O@A;@ ?=F=J9DDQ L9C= >JGE  <9QK LG  Q=9JK G> KLM<Q 9F< AFNGDN= @M?= 9EGMFLKG>EGF=Q9F<J=IMAJ=KA?FA>A;9FL=>>GJL  /G ;GFKLJM;L 9 EG<=D L@9L ;9F := MK=< >GJ L@= IMA;C K;J==FAF? G> MFCFGOF ;GEHGMF<KG><AN=JK=KLJM;LMJ=K9F<GJA?AFKAF9K;=JL9AFAF?L@=AJ9;ML=LGPA;ALQ :Q9HHDA;9LAGFG>L@="DG:9DDQ#9JEGFAR=<.QKL=EG>;@=EA;9D;D9KKA>A;9LAGF O=@9N=E9<=9F=>>GJLLG<=N=DGH9 C ));D9KKA>A;9LAGFEG<=D>GJL@=9;ML= LGPA;ALQ'

(122) =F<HGAFLMKAF?(MFJG<9L9:9K=;@=EA;9DK9F<L@AK;GFKLALML=K L@=E9AFG:B=;LAN=G>L@AKKLM<Q+9H=J$ .  .MEE9JQG>OGJC   (MFJG<9L9:9K= /@= (MFJG <9L9K=L ;GFL9AFK 9 LGL9D G>   ;@=EA;9DK 9DD L@=K= ;@=EA;9DK O=J= =P9EAF=< 9F< 9ML@=FLA;9L=< :9K=< GF L@= ;GJJ=;L KLJM;LMJ= L@= ;GJJ=;L $0+ F9E= 9F< L@= ;GJJ=;L . J=?AKLJQ FME:=J -)  /@= .($' . FGL9LAGFK >GJ =9;@ KLJM;LMJ= O=J= ;9J=>MDDQ ;@=;C=< AF @=E.HA<=J .A?E9D<JA;@9F<+M:@=E /@=.($' .>GJ;AK LJ9FKAKGE=JK9F<- . =F9FLAGE=JKO=J=;9J=>MDDQAFKH=;L=<9F<GFDQ;9FGFA;9D.($' .O=J=L9C=F.  .

(123) AFLG;GFKA<=J9LAGF .9DLK9F<EAPLMJ=KO=J=J=EGN=<>JGEL@=GJA?AF9D<9L9K=L 9K O=J= 9DKG <MHDA;9L= J=;GJ<K 9F< J=;GJ<K EAKKAF? =AL@=J KLJM;LMJ= GJ . J=?AKLJQFME:=J /@=;@=EA;9DK;GFL9AFAF?<A9RGGJ?M9FA<AF=>MF;LAGF9DALA=K O=J= <AK;9J<=< :=;9MK= G> L@= HJ=K=F;= G> J=KGF9F;= KLJM;LMJ=K L@9L J=KMDL AF <A>>=J=FL.($' .FGL9LAGFK>GJL@=K9E=;@=EA;9D /@=J=O=J=;@=EA;9DK L@9L @9< ;GJJ=;L . $0+ F9E= 9F< .($' . FGL9LAGFK  /@= '

(124)  N9DM=K >GJ 9DD L@=K= KGJL=< ;@=EA;9DK O=J= K=9J;@=< AF L@= /GPF=L 9F< -/ . O=:K=JN=JK  -=;GJ<K OAL@ EGJ= L@9F GF= N9DM= O=J= <AK;9J<=<  !AF9DDQ'

(125) N9DM=K>GJ ;@=EA;9DKO=J=J=LJA=N=<>JGEL@=/GPF=L9F<L@= -/ .O=:K=JN=JK .   (=L@G<GDG?Q    =K;JAHLGJ;9D;MD9LAGFK /OG <AE=FKAGF9D J9?GF EGD=;MD9J <=K;JAHLGJK O=J= MK=< >GJ L@= EG<=D ;GFKLJM;LAGF LGL9DG><=K;JAHLGJKO=J= ;GEHML=<>GJ=9;@G>L@=  ;@=EA;9DK MKAF? L@= J9?GF  KG>LO9J= 

(126)   >ADL=JAF? G> L@= <=K;JAHLGJK O9K 9;@A=N=<:Q<AK;9J<AF?<=K;JAHLGJKOAL@GF=GJEGJ=EAKKAF?N9DM=K9KO=DD9K ;GFKL9FL GJ F=9J ;GFKL9FL N9DM=K  .M:K=IM=FLDQ L@= <=K;JAHLGJK OAL@ 9 H9AJ ;GJJ=D9LAGFD9J?=J L@9F O=J= =P;DM<=<   J=<M;=< HGGD G> L@= <=K;JAHLGJ K=LO9KJ=F<=J=<OAL@

(127) <=K;JAHLGJK     =K;JAHLGJ>ADL=JAF?9F<GMLDA=J<=L=;LAGF +JAF;AH9D GEHGF=FL F9DQKAK + O9K =EHDGQ=< LG >ADL=J <=K;JAHLGJK AF GJ<=J LG J=EGN= AJJ=D=N9FL GF=K  /@= + KLM<Q O9K H=J>GJE=< GF 9DD   ;@=EA;9DK MKAF?

(128)  <=K;JAHLGJK  /@= <9L9 O9K9MLG K;9D=< :=>GJ= L@= + KLM<Q /=FHJAF;AH9D;GEHGF=FLKO=J=MF<=JL9C=F>GJL@=+KLM<Q /GKGJL L@= J=D=N9FL <=K;JAHLGJK 9 DG9<AF? K;GJ= L@J=K@GD< G>

(129)

(130)  O9K MK=< O@A;@ >ADL=J=<

(131) <=K;JAHLGJK /@=K=<=K;JAHLGJKO=J=;GFKA<=J=<>GJ>MJL@=JKLM<A=K 9F<GL@=JAJJ=D=N9FL<=K;JAHLGJKO=J=J=EGN=< $F9<<ALAGF>AN=;@=EA;9DKO=J= A<=FLA>A=< 9K HGL=FLA9D GMLDA=JK AF L@= + K;GJ= HDGL 9F< L@MK L@=Q O=J= =P;DM<=< >JGE >MJL@=J KLM<A=K  /@= >AF9D <9L9K=L ;GFKA<=J=< >GJ L@= EG<=D <=N=DGHE=FL;GEHJAK=<G>;@=EA;9DK9DGF?OAL@L@=AJ

(132) <=K;JAHLGJK     (G<=D<=N=DGHE=FL     D9KKA>A;9LAGFK;@=E= /@=IM9FLAL9LAN=LGPA;GDG?A;9DJ=KHGFK=O9K ;D9KKA>A=<AF9IM9DAL9LAN=;D9KKGF L@= :9KAK G> L@= "DG:9DDQ #9JEGFAR=< .QKL=E  /@J== ;D9KK=K O=J= >GJE=< D9KK $ '

(133)  ≤ 300 E? C? <9Q @A?@DQ LGPA; D9KK $$ 

(134)

(135)   '

(136)  ≤ 2000 E? C? <9Q AFL=JE=<A9L= LGPA; 9F< D9KK $$$ '

(137)  

(138)

(139)

(140) E? C? <9Q DGO LGFGF LGPA;  . 

(141) .

(142)         D9KKA>A;9LAGFEG<=D  ;D9KKA>A;9LAGF EG<=D AK L@= E9L@=E9LA;9D J=D9LAGFK@AH :=LO==F 9 K=L G> <=K;JAHLGJK 9F< J=KHGFK= N9JA9:D=K  /@= C )) ;D9KKA>A;9LAGF E=L@G< O9K =EHDGQ=< LG <AK;GN=J L@= 9HHJGHJA9L= J=D9LAGFK@AH :=LO==F EGD=;MD9J KLJM;LMJ=K9F<L@=LGPA;ALQG>;@=EA;9DK /@= C ))9D?GJAL@EAK:9K=<GFL@= C F=9J=KL F=A?@:GJ ;D9KKA>A;9LAGF JMD= 9K <=K;JA:=< :Q #9JL =L 9D   $F L@AK 9D?GJAL@E =9;@ IM=JQ ;@=EA;9D AK ;D9KKA>A=< 9;;GJ<AF? LG L@= ;D9KK=K G> ALK ;DGK=KL F=A?@:GJK  /@= ;DGK=KL F=A?@:GJK 9J= A<=FLA>A=< GF L@= :9KAK G> 9 <AKL9F;= E9LJAP  .=N=J9D E=L@G<K G> <AKL9F;= ;9D;MD9LAGFK :=LO==F ;@=EA;9DK GF L@= :9KAK G> :AF9JQ <9L9 =PAKL LG <9L=  We have selected the ‘Euclidean’ <AKL9F;=E=L@G<>GJ;9D;MD9LAGFG> L@=<AKL9F;=E9LJA;=K 2@AD=9HHDQAF?L@= C )) 9D?GJAL@E L@= GHLAE9D N9DM= G> C F==<K LG := <=L=JEAF=<  2= @9N= MK=< L@= ;JGKK N9DA<9LAGF E=L@G< >GJ L@AK HMJHGK=   K=JA=K G> C N9DM=K O=J= 9KKA?F=<>JGEC LG

(143) 9F<:9K=<GFL@=@A?@=KL) -FGF =JJGJJ9L=9F< DGO=KL;D9KK=JJGJ9FGHLAE9DCN9DM=O9KA<=FLA>A=<      =K;JAHLGJK=D=;LAGF:Q"=F=LA;D?GJAL@EK "=F=LA; 9D?GJAL@EK "K O=J= =EHDGQ=< >GJ A<=FLA>QAF? KA?FA>A;9FL <=K;JAHLGJK9F<>GJJ=EGNAF?FGF KA?FA>A;9FL<=K;JAHLGJK "K9J=H=J>GJE=< GF9J9F<GEHGHMD9LAGFG>;@JGEGKGE=KL@9L;GFKAKLG>EGD=;MD9J<=K;JAHLGJK  *HLAEAR9LAGF G> 9 <=>AF=< >ALF=KK >MF;LAGF AK 9;@A=N=< :Q KAEMD9LAF? L@= =NGDMLAGF9JQHJG;=KKO@A;@?AN=KF=O;@JGEGKGE=K:Q;GE:AF9LAGFKOAL@L@= ;@JGEGKGE=KG>L@=AFALA9DHGHMD9LAGF:QE=9FKG>?=F=LA;GH=J9LAGFKKM;@9K ;JGKKGN=J 9F< EML9LAGF  /@= "K O=J= GHLAEAR=< GF L@= :9KAK G> L@= FGF =JJGJ J9L= ) - O@A;@ J=HJ=K=FLK L@= HJ=<A;LAN= HGO=J G> L@= EG<=D AF ;GJJ=;LDQ;D9KKA>QAF?;@=EA;9DK  $FGJ<=JLG=PLJ9;L9DD

(144) <=K;JAHLGJK>GJ9DD;@=EA;9DK 9;;GEHDAK@L@= + KLM<Q 9F< LG ;GF<M;L <=K;JAHLGJ K=D=;LAGF :Q "=F=LA; D?GJAL@EK O= @9N= =EHDGQ=< L@= “ga_toolbox” and “pca” Matlab modules developed at (AD9FG @=EGE=LJA;K 9F< ,.- -=K=9J;@ "JGMH 0FAN=JKALQ G> (AD9FG A;G;;9(AD9F$L9DQ      (G<=DN9DA<9LAGF /@=  (MFJG <9L9K=L ;@=EA;9DK O=J= J9F<GEDQ KHDAL C==HAF? 

(145)  G> L@= ;@=EA;9DK >JGE =N=JQ ;D9KK AF L@= LJ9AFAF? K=L 9F< L@= J=E9AFAF? 

(146)  AF L@= L=KLK=L/9:D=     /GK=D=;LL@=EGD=;MD9J<=K;JAHLGJK9F<LG;GFKLJM;L L@= ;D9KKA>A;9LAGF EG<=DK L@= LJ9AFAF? K=L O9K MK=<  /@= HJ=<A;LAN= HGO=J G> L@AKEG<=DO9K=N9DM9L=<:Q=EHDGQAF?L@=L=KLK=L;@=EA;9DK  .   .

(147)   /9:D=     D9KKA>A;9LAGFKG>L@=;@=EA;9DKAFL@=LJ9AFAF?9F<L=KL K=LKHJAGJLGL@=C ))EG<=D;GFKLJM;LAGF . . D9KK . D9KK. D9KK. /GL9D. /J9AFAF?K=L /=KLK=L /GL9D.   

(148) .    .   .   . /@= LJ9AFAF? K=L G>  ;@=EA;9DK OAL@ 

(149)  <=K;JAHLGJK O9K 9KKA?F=< >GJ N9JA9:D=K=D=;LAGF:QL@=";GMHD=<OAL@ C ));D9KKA>A;9LAGF /@=AFL=JF9D N9DA<9LAGFG>EG<=DKO9K9KK=KK=<:Q >GD<;JGKKN9DA<9LAGF>GMJ?JGMHKO=J= MK=<>GJL=KLAF?L@=;D9KKE=E:=JK@AHG>L@=GEALL=<?JGMHO@=J=L@=;D9KKG> L@=E9BGJALQG>C F=A?@:GJKO9K9KKA?F=<LGL@=E=E:=JG>L@=GEALL=<?JGMH  /@=:=KLEG<=D;GFKLJM;L=<GFL@=LJ9AFAF?K=LL@9L@9<9@A?@=J) -;N9F< L@= DGO=KL ;D9KK =JJGJ O9K KM:B=;L=< >GJ =PL=JF9D N9DA<9LAGF  /@= L=KL K=L ;GEHJAK=< G>  ;@=EA;9DK O9K MK=< >GJ =PL=JF9D N9DA<9LAGF  !AF9DDQ H=J>GJE9F;= G> L@= :=KL EG<=D O9K9KK=KK=< :Q E=9FKG> H9J9E=L=JK KM;@9K FGF =JJGJJ9L=) -K=FKALANALQKH=;A>A;ALQHJ=;AKAGF9F<=JJGJJ9L= -  /G;D9KKA>Q;@=EA;9DKAFLGL@=LJ9AFAF?9F<L=KLK=LK9KO=DD9KLGH=J>GJEL@= " ;GMHD=< C )) ;D9KKA>A;9LAGF the “classification toolbox” Matlab module <=N=DGH=<9L(AD9FG@=EGE=LJA;K9F<,.-J=K=9J;@?JGMHO9KMK=< .   -=KMDLK9F<<AK;MKKAGF    "=F=LA;D?GJAL@EGML;GE= /@="KLJ9L=?QO9K9HHDA=<AFGJ<=JLGK=D=;LKA?FA>A;9FL<=K;JAHLGJKO@A;@ D9L=J O=J= MK=< >GJ L@= ;GFKLJM;LAGF G> L@= C )) EG<=D  /@= :=KL C )) EG<=D G:L9AF=< >JGE L@= " KLJ9L=?Q ;GFKAKL=< G>  <=K;JAHLGJK 9F< O9K 9KKG;A9L=< OAL@ 9 ) -;N G>

(150)  9F< ) ->AL GF L@= LJ9AFAF? K=L G>

(151)  K== /9:D=   .   /@= C K=D=;LAGF OAL@  >GD< ;JGKK N9DA<9LAGF HJGNA<=< 9F GHLAE9D C N9DM= G>  A =  GFDQ L@= ;DGK=KL ;@=EA;9D O9K MK=< LG HJ=<A;L L@= ;D9KKG>=9;@L9J?=L;@=EA;9D /@=<=K;JAHLGJKL@9LO=J=MK=<AFL@= C )) ;D9KKA>A;9LAGF9J=K@GOFAF/9:D=  .  /9:D=   .  *N=JNA=O G> L@=  <=K;JAHLGJK <=JAN=< :Q L@= ?=F=LA; 9D?GJAL@E;GMHD=<OAL@C ));D9KKA>A;9LAGF  FLJQ. . )9E=. . (/. =. . .H(8K. =K;JAHLAGF (GJ9F9MLG;GJJ=D9LAGFG>D9?  O=A?@L=<:Q .9F<=JKGF=D=;LJGF=?9LANALQ .H=;LJ9DE=9F9:KGDML=. /QH= 9MLG;GJJ=D9LAGFK. E9LJAP :9K=<.

(152)  . . .H+GK8H. . (/. N. . (A. . . . .H(8E. . "/. H. 

(153) . 

(154)  .$

(155) . . F. . .$ . . /.=. . +81.8(-8. . '.8

(156) . . F'. . %8R5. . .(8K. . "/. N. 

(157) . %"$.  . +81.8A8. <=NA9LAGF>JGEMJ<=FE9LJAP O=A?@L=<:Q$ .L9L= )GJE9DAR=<KH=;LJ9DHGKALAN= KME>JGEMJ<=FE9LJAP O=A?@L=<:QHGD9JAR9:ADALQ (GJ9F9MLG;GJJ=D9LAGFG>D9?  O=A?@L=<:QN9F<=J299DK NGDME= (=9F>AJKLAGFAR9LAGFHGL=FLA9D K;9D=<GF 9J:GF9LGE (=9FAF>GJE9LAGFAF<=PGF 9LGEA;;GEHGKALAGF .H=;LJ9DE=9F9:KGDML= <=NA9LAGF>JGEMJ<=FE9LJAP O=A?@L=<:QE9KK "=9JQ9MLG;GJJ=D9LAGFG>D9?  O=A?@L=< :QHGD9JAR9:ADALQ - 3 - .LJM;LMJ9D$F>GJE9LAGFGFL=FL AF<=PF=A?@:GJ@GG<KQEE=LJQ G>

(158) GJ<=J )ME:=JG><GM:D=:GF<K .LJM;LMJ9D$F>GJE9LAGFGFL=FL AF<=PF=A?@:GJ@GG<KQEE=LJQ G> GJ<=J JGLG (GJ=9M9MLG;GJJ=D9LAGF G>D9?DG?>MF;LAGFO=A?@L=< :Q.9F<=JKGF=D=;LJGF=?9LANALQ +81. DAC=GF(GD9J -=>J9;LANALQ:AF (G<A>A=<<JM? DAC=K;GJ=>JGE *HJ=9=L9D JMD=K )ME:=JG>@DGJAF=9LGEK 9D9:9F DAC=AF<=P>JGE9JQKR E9LJAPO=A?@L=<:Q9LGEA; FME:=J .H=;LJ9DEGE=FLG>GJ<=J >JGEMJ<=FE9LJAPO=A?@L=< :Q$ .L9L= "=9JQ9MLG;GJJ=D9LAGFG>D9?  O=A?@L=<:QN9F<=J299DK NGDME= (=9FLGHGDG?A;9D;@9J?=AF<=P G>GJ<=J +81. DAC=GFAGFAR9LAGF. <=K;JAHLGJK E9LJAP :9K=< <=K;JAHLGJK 9MLG;GJJ=D9LAGFK. GFKLALMLAGF9D AF<A;=K $F>GJE9LAGFAF<A;=K E9LJAP :9K=< <=K;JAHLGJK 9MLG;GJJ=D9LAGFK. LGE ;=FLJ=<>J9?E=FLK. $F>GJE9LAGFAF<A;=K. GFKLALMLAGF9D AF<A;=K $F>GJE9LAGFAF<A;=K. 9MLG;GJJ=D9LAGFK. +81. DAC= <=K;JAHLGJK JM? DAC=AF<A;=K GFKLALMLAGF9D AF<A;=K E9LJAP :9K=< <=K;JAHLGJK E9LJAP :9K=< <=K;JAHLGJK 9MLG;GJJ=D9LAGFK. 9MLG;GJJ=D9LAGFK +81. DAC=.  .

(159) . + . . 

(160) 6. +7. . 

(161) 6 .7. . '/!. HGL=FLA9D:AF 3 +3H@GKH@9L=. <=K;JAHLGJK LGE ;=FLJ=< >J9?E=FLK LGE+9AJK. +J=K=F;= 9:K=F;=G>. +9L LGHGDG?A;9D<AKL9F;=  +J=K=F;= 9:K=F;=G> .9L LGHGDG?A;9D<AKL9F;= 1=J@99J!AK@:9K= DAF=LGPA;ALQ >JGE('*"+EEGD D. LGE+9AJK (GD=;MD9J HJGH=JLA=K.  G>  LJ9AFAF? K=L ;@=EA;9DK O=J= ;GJJ=;LDQ HJ=<A;L=< :Q L@= C )) ;D9KKA>A;9LAGF EG<=D  The sensitivity describes the model’s ability to correctly A<=FLA>Q L@= ;D9KK >GJ 9 ;@=EA;9D  !GJ LJ9AFAF? K=L HJ=<A;LAGF L@= C )) ;D9KKA>A;9LAGF ;JGKK N9DA<9L=< EG<=D <AKHD9Q=< K=FKALANALA=K G>

(162) 

(163)  9F<

(164)  >GJ ;D9KK=K $ $$ 9F< $$$ J=KH=;LAN=DQ /9:D=   .   .H=;A>A;ALQ ;@9J9;L=JAR=K L@= 9:ADALQ G> L@= H9JLA;MD9J ;D9KK LG J=B=;L L@= ;@=EA;9DK G> 9DD GL@=J ;D9KK=K  /@= C )) ;D9KKA>A;9LAGF ;JGKK N9DA<9L=< EG<=D <AKHD9QK @A?@ KH=;A>A;ALQ N9DM=K >GJ 9DD  ;D9KK=K A =  L@= EG<=D ;9F HJ=<A;L @A?@DQ LGPA; ;@=EA;9DK;D9KK$OAL@9KH=;A>A;ALQJ9L=G>

(165)  9F<

(166) 9F<

(167) >GJ;D9KK=K $$ 9F< $$$ J=KH=;LAN=DQ  !GJ L@= =PL=JF9D N9DA<9LAGF L@= EG<=D @9K K@GOF K=FKALANALA=K G>

(168) 

(169)  9F<

(170)  >GJ ;D9KK=K $ $$ 9F< $$$ J=KH=;LAN=DQ 9F< ;GJJ=KHGF<AF? KH=;A>A;ALA=K G>

(171) 

(172)  9F<

(173)   /@= EG<=D ;GJJ=;LDQ A<=FLA>A=<;D9KK=KG>GMLG>;@=EA;9DK>JGEL@==PL=JF9DK=L  /9:D=  .  D9KKA>A;9LAGFH9J9E=L=JKG>L@=C ));D9KKA>A;9LAGFEG<=D  ) -  .    !ALLAF? 1 PL=JF9D.

(174) 

(175) 

(176) . .=FKALANALQ D9KK $$ $$$. $

(177) 

(178) 

(179) .

(180) 

(181) 

(182) .

(183) 

(184) 

(185) . $. .H=;A>A;ALQ D9KK $$ $$$.

(186) 

(187) 

(188)  

(189) .

(190) 

(191) 

(192) .

(193) 

(194) 

References

Related documents

Moreover, we have sorted all queries based on correct class prediction by the Estate fingerprints based k-NN model (refer Table S3 for predicted class information);

These statements are supported by Harris et al (1994), who, using MBAR methods, find differ- ences in value relevance between adjusted and unadjusted German accounting numbers.

16 No relevant Risk-phrase related to human toxicity was assigned to this substance and a toxicity value corresponding to the lower end of the criteria for risk-phrase R22 was

pedagogue should therefore not be seen as a representative for their native tongue, but just as any other pedagogue but with a special competence. The advantage that these two bi-

With regard to our research questions, we show that support for tool usage greatly a↵ects student satisfaction (RQ1), that a good experience with the tool has a positive influence

In the first paper we study polaron dynamics in highly ordered molecular crystals and in particular the transition from adiabatic to nonadiabatic transport across the region

This is the step “Add regulation on hubs”, which separates the ensemble Yeast Topology corresponding to networks with identical structure to the inferred network, but random-

Rather than stating, as she does, that events are being produced by ‘grounds’, or that the event is a figuration of a rupture that inexplicably appears against a background of