• No results found

Anomaly Detection and Revenue Loss Estimation in Accounting Data

N/A
N/A
Protected

Academic year: 2022

Share "Anomaly Detection and Revenue Loss Estimation in Accounting Data"

Copied!
74
0
0

Loading.... (view fulltext now)

Full text

(1)

IN

DEGREE PROJECT ENGINEERING PHYSICS, SECOND CYCLE, 30 CREDITS

STOCKHOLM SWEDEN 2020,

Anomaly Detection and Revenue Loss Estimation in Accounting Data

GUSTAV EDHOLM

KTH ROYAL INSTITUTE OF TECHNOLOGY

SCHOOL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCE

(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
(15)
(16)
(17)

d R Rd

x

(18)
(19)

Anomaly Normal

!!

!"

[0, 1] [1, 0]

(20)

(x, y) y

x x

i x

y xi yi

j xj

R

x ∈ Rd y ∈ Rk

y y x

ˆ

y y

y x y

t |y − ˆy| |y − ˆy| > t y

ˆ y > y

LR(x) =

Ax + b A k × d b ∈ Rk

mse(y, ˆy) =

!k

i=1(yi− ˆyi)2

k .

(21)

0 2 4 6 8 10 x

0 1 2 3 4 5 6 7 8 9 10

y

x ∈ R y ∈ R

(x, y) y = bxˆ

x yˆ f (x) = ˆy

x ˆ

y x

x yˆ

N {(xj, yj)}Nj=1

xi

y σy {(xj, yj)}kj=1

(22)

!!

!" !#

!"

$

!"

%

!"

&

!"

'

!

!! < c"

!! ≥ %"

!# < %$

!# ≥ %$

!% ≥ %& !% < %&

x ci

ˆ yi

{(xj, yj)}Nj=k+1

ˆ yl

l {yj}nj=m

ˆ yl=

!n j=myj

n− m + 1 = µly.

σlyly t

K

{fk}Kk=1 = φ yˆ

{ˆyj}Kj=1

(23)

L

ˆ

y y

(x, f (x))

∂f

∂x η

xt+1= xt− η∂f

∂x(xt)

"K k=1

Ω(fk)

(24)

Ω(fk) = γTk+ 1

2λ||wk||2,

L(φ) = l(y, ˆy) +

"K k=1

Ω(fk)

l y yˆ Tk

k wk γ

λ

t ft

L(t) =

"N i=1

l(yi, ˆyi,(t−1)+ ft(xi)) + Ω(ft)

ˆ

yi,(t−1) yi

t− 1

L(t) $

"N i=1

[l(yi, ˆyi,(t−1)) + gift(xi) + 1

2hift2(xi)] + Ω(ft)

gi hi l

gi = ∂yˆi,(t−1)l(yi, ˆyi,(t−1)),

hi = ∂y2ˆi,(t−1)l(yi, ˆyi,(t−1)).

t

L(t) $

"N i=1

[gift(xi) + 1

2hift2(xi)] + Ω(ft).

(25)

η < 1

x

F (x, θ) = ˆy hi

ˆ y

Hi θi

Mi Ai

Hi hi x

h1 = H1(x),

h2 = H2(h1), ...

ˆ

y = Hl+1(hl).

l

hi = Hi(hi−1) = Ai(Mihi−1)

Mi Ai

x L(ˆy)

ˆ y

y x

y yˆ

(26)

Hidden layers Input layer

Output layer

Weights

!

" #$

{xi}ni=1 n

{F (xi, θ)}ni=1 {ˆyi}ni=1

{L(ˆyi)}ni=1

θ

∂L

∂θi

η

θi ← θi − η∂θLi

argminθ[L(F (x, θ))]

(27)

x y

h

ReLU (hi) = max(0, hi)

hl

sigmoid(hl) = ehl ehl+ 1. k

sof tmax(hl) = ehl

!k j=1ehlj,

(28)

CE(ˆy, y) =

"k i=1

yilog(ˆyi).

ˆ y = hl

θ

(29)

x

k k

k

{mi}ki x

mi mi

mi

Si ={xp;||xp− mi||2 ≤ ||xp− mj||2,∀j, 1 ≤ j ≤ k},

mi

mi = 1

|Si|

"

xj∈Si

xj.

k = 2

k

(30)

P (x) =

z d

k

k

Z = XW

Z W k X!X

X x

P#(z) = ˆx

x ˆ

x = P#(P (x))

1 d

"d i=1

(xi− ˆxi)2.

z k

(31)

Encoder Decoder

#"

"

%

z = E(x) x = D(z)ˆ

x

x z x xˆ

T

(32)

T P R = T P T P + F N.

F P R = F P F P + T N.

P recision = T P T P + F P.

N

F P R = 0 N F P R = 1 N

T P R > F P R

(33)

T P R ≈ F P R

0.5

> 0.5

(34)

k

(35)

z

k

α

k

k

(36)

k

y ˆ

y

(37)

k

(38)
(39)

k

(40)
(41)

Data

Acquisition Data

Pre-processing Model

Selection Model

Training Model Testing

(42)

t

t

pc t

pt

T pt− pc

pt > T

T

pt

(43)

Rc Rt

Rc =

"N i=1

picti

Rt=

"N i=1

pitti.

Rt− Rc = 8.5

pc > pt

(44)

d = 1473 N = 9813

(45)
(46)

F (x) = ˆy

(47)

[0, 1]

pc pt

(48)

n

ˆ

y pc

ˆ y > pc

|pc− ˆy| > T T

ˆ y < pc

max(F P R)× max(T P R)

(49)

Rp Rp−Rc

ˆ y t

Rp =

"N i=1

ˆ yiti.

(50)

ˆ

y t

µyˆ =

!N i=1yˆ N

σyˆ =

 !N

i=1yi− µyˆ)2

N .

t yˆ t

p = ˆyt

µp = µyˆµt

ˆ y

t σx2 = V ar(x) µx = E(x).

(51)

σp2 = V ar(ˆyt)

= E(ˆy2t2)− (E(ˆyt))2

= V ar(ˆy)V ar(t) + V ar(ˆy)(E(t))2+ V ar(t)(E(ˆy))2

= σy2ˆσt2+ σy2ˆµ2t + σt2µ2ˆy

= (σy2ˆ+ µ2yˆ)(σt2+ µ2t)− µ2yˆµ2t p

p

µRp = N µp =

"N i=1

pi,

σRp =»

N σp2 = σp

N .

(52)

n

(53)

pt

ˆ

y∈ [0, 1]

(54)

pt

pc

(55)

pc

(56)

λ = 0.1 α = 5000

x d

d d = 1

d = 1

ˆ

ySGBDT yˆN N β γ

ˆ

yens = β ˆySGBDT + γ ˆyN N.

β γ β = 0.3

γ = 0.7

(57)

n n =

(58)

0.5 0.6 0.7 0.8 0.9 1 AUC score

RLT Ensemble RLT SGBDT regression RLT Random Forest regression RLT NN regression ELT NN regression RLT PCA ELT Binary NN classifier

(59)
(60)
(61)

0 5 10 15 20 25 30 35 40 0

0.05 0.1 0.15 0.2 0.25 0.3 0.35

RLT NN Regression predicted revenue distribution RLT RF and SGBDT predicted revenue

Charged Revenue Target Revenue

(62)
(63)
(64)

t

t

(65)

t t

(66)
(67)
(68)
(69)
(70)
(71)
(72)
(73)
(74)

www.kth.se

TRITA-EECS-EX-2020:929

References

Related documents

Även i Moderaternas fall gjordes en jämförelse mellan 1994 och 2007 års statistik kring industriarbetarnas val av parti, vilket ledde till samma resultat som

Eftersom föregå- ende simuleringarna visar på att t-testet inte bevarar signifikansnivån för denna typ av feltermer, är testet inte giltigt och bör därför inte användas, trots

The estimator is semiparametric in the sense that the regression function is nonparametric while the distributions used for the maximum likelihood estimators are parametric..

Varsågoda att frukostmingla, eller hitta ert bord och presentera er för varandra.. Förslag på

Syftet var dels att få en inblick i vilka förutsättningar blivande lärare på Lärarprogrammet på Campus Norrköping får med sig för att lyckas med arbetet inom området

Johan Holmgren and Pernilla Ivehammar, Making headway towards a better understanding of service frequency valuations: a study of how the relative valuation of train service frequency

I Om tv˚a tillst˚and kommunicerar tv˚asidigt ¨ar antingen b˚ada tillst˚anden best¨andiga eller b˚ada obest¨andiga.. I Om alla tillst˚and kommunicerar tv˚asidigt med varandra

If the probability that a (20-year old or older) pine is damaged is independent of its age, then the point process of old and damaged pines is a PPP with intensity. λ old = λP(B) = λ