Improving the accuracy of estimates for complex sampling in auditing 1 . Y. G. Berger 1 P. M. Chiodini 2 M. Zenga 2 1 University of Southampton (UK) 2 University of Milano-Bicocca (Italy) 14-06-2017 1The research leading to these results has received support under the European Commissions 7th Framework Programme (FP7/2013-2017) under grant agreement n312691, InGRID Inclusive Growth Research Infrastructure Diffusion. Berger, Chiodini, Zenga Improving the accuracy of estimates 14-06-2017 1 / 26
Overview Audit Sampling 1 Definitions 2 Simulation study 3 Conclusions and Future research 4 Berger, Chiodini, Zenga Improving the accuracy of estimates 14-06-2017 2 / 26
AUDIT SAMPLING In auditing the goal is to verify if the values of the accounts reported by the company are not materially misstated To determine the total error in the amount reported by the company auditors should audit all accounts in the population of accounts. This is not possible as it is too costly and time expensive! In practice auditors verify only a sample of accounts to estimate the error of the total population of accounts (total amount error & error rate) Common statistical methods to select an audit sample are by without replacement or by probability proportional to size (PPS) Berger, Chiodini, Zenga Improving the accuracy of estimates 14-06-2017 3 / 26
SAMPLING PLANS IN AUDITING For the auditor the only interest is to verify that the error rate is within a pre-assigned value, if the observed rate is greater than the pre-assigned value a census is made. Practically speaking all the sampling methods are reliable for audit sampling, by the way Monetary Unit Sampling (MUS) - a particular case of the PPS - is the most popular! This sampling method directs efforts towards high-valued items which contain the greatest potential of large overstatement Berger, Chiodini, Zenga Improving the accuracy of estimates 14-06-2017 4 / 26
SAMPLING PLANS IN AUDITING The way the auditors make their choices in terms of sampling strategies is frequently based on personal experience! In general: book values distributions are highly positively skewed with different percentage of errors Two main scenarios can be met: - a population with relatively large number of small accounts combined with high rate of mistakes - a population with relatively large number of accounts combined with a small rate of mistakes Berger, Chiodini, Zenga Improving the accuracy of estimates 14-06-2017 5 / 26
DEFINITIONS An accounting population consist of N line items with book (or recorded) values, y 1 , y 2 , , y N and total book amount T y defined by: N � T y = y i . i =1 The audited (true) amount of the N line items in the population is denoted by x 1 , x 2 , ..., x N and the total audited amount is: N � T x = x i . i =1 Berger, Chiodini, Zenga Improving the accuracy of estimates 14-06-2017 6 / 26
DEFINITIONS The error in item i , is z i = y i − x i , 1 ≤ i ≤ N . When z i > 0, the i-th item is said to be overstated and when z i < 0, it is understated. When z i = 0, the account is said to be error free. The total error amount is defined as: N � T z = z i . i =1 For y i � = 0, t i = z i y i = y i − x i is called the fractional error or taint. The y i values ( x 1 , x 2 , ..., x N ) are unknown before sampling, whereas ( y 1 , y 2 , ..., y N ) are known. It is assumed that the amount of any overstatement does not exceed the stated recorded value. Berger, Chiodini, Zenga Improving the accuracy of estimates 14-06-2017 7 / 26
DEFINITIONS The purpose of the audit is to estimate the total error amount T z : n n � � T z = z i = t i × y i i =1 i =1 obtained by the examination of a sample of n items of the account. Berger, Chiodini, Zenga Improving the accuracy of estimates 14-06-2017 8 / 26
Stringer Bound (Stringer, 1963) Let T 1 , ..., T n be the independent random variables which represent the taintings. The distribution of these taintings is some unknown mixture of distributions on the interval [0; 1], so that Pr (0 ≤ T i ≤ 1) = 1. Let denote µ = E ( T i ) and let 0 ≤ t 1: n ≤ t 2: n ≤ ... ≤ t n : n ≤ 1 be the ordered statistics of ( T 1 , T 2 , ..., T n ). For α ∈ (0; 1) and i = 0 , 1 , ..., n − 1 let p = p n ( i ; 1 − α ) be the unique solution of: i � n � p k (1 − p ) n − k = α � k k =0 with p n ( n ; 1 − α ) = 1. Berger, Chiodini, Zenga Improving the accuracy of estimates 14-06-2017 9 / 26
Stringer Bound (Stringer, 1963) The Stringer method for obtaining an upper bound for the total overstatement error can be obtained by combining the upper limits for the sample error rates with the taints: n ˆ � T z = T y p n (0 , 1 − α ) + T y [ p n ( i , 1 − α ) − p n ( i − 1 , 1 − α )] t n − i +1: n i =1 then p n ( i , 1 − α ) is the (1 − α ) upper confidence limit for the binomial parameter when i errors are observed in a sample of size n . Berger, Chiodini, Zenga Improving the accuracy of estimates 14-06-2017 10 / 26
Stringer Bound (Stringer, 1963) Equivalently for a given α , n and number of errors i , it is possible to find the value p n ( i , 1 − α ) that satisfies: i � n � [ p n ( i , 1 − α )] j [1 − p n ( i , 1 − α )] n − j = α. � j j =0 The Stringer bound is sometimes calculated using the Poisson approximation for obtaining the upper confidence limits p n ( i , 1 − α ). Berger, Chiodini, Zenga Improving the accuracy of estimates 14-06-2017 11 / 26
Empirical Likelihood estimator EL is a non-parametric likelihood. Hartley and Rao in 1968 first introduced it in the context of survey sampling as scale-load approach. From early 2000 the EL approach has been introduced also in survey sampling literature. EL approach provides non-parametric confidence intervals similar to the parametric likelihood ratio intervals. Berger, Chiodini, Zenga Improving the accuracy of estimates 14-06-2017 12 / 26
Empirical Likelihood estimator The shape and the orientation of the EL intervals are completely determined by the data. Chen et al. (2003) obtained EL intervals on the population mean for populations containing many zero values thats the case of audit sampling. Parametric likelihood ratio intervals based on parametric mixture distributions perform better than the standard normal theory intervals in terms of coverage, but EL intervals perform better under deviations from the assumed mixture model, by providing non-coverage rate below lower bound closer to the nominal value and also larger lower bound. Berger, Chiodini, Zenga Improving the accuracy of estimates 14-06-2017 13 / 26
EL approach (Berger, 2016) Let U be a finite populations of N units. θ 0 is the unique solution � G ( θ ) = 0 , G ( θ ) = g i ( θ ) i ∈ U The empirical log-likelihood function is defined � � l ( m ) = log ( m i ) = log ( m i ) (1) i ∈ S i ∈ S m i is estimated by the value ˆ m i which maximize l ( m ) subject m i ≥ 0 � m i c = C i ∈ S Berger, Chiodini, Zenga Improving the accuracy of estimates 14-06-2017 14 / 26
EL approach (Berger, 2016) The solution is given by m i = { ( t + η ) T c i } − 1 = ( π i + η T c i ) − 1 ˆ where π i = np i . Berger, Chiodini, Zenga Improving the accuracy of estimates 14-06-2017 15 / 26
Maximum EL estimator The empirical log-likelihood ratio function is defined by m ∗ , θ ) } ˆ r ( θ ) = 2 { l ( ˆ m ) − l ( ˆ where l ( ˆ m ) = � log ( ˆ m i ) i ∈ S l ( ˆ m ∗ , θ ) = � log ( ˆ m ∗ i ( θ )) i ∈ S m ∗ ˆ i ( θ ) is the value that maximize (1) subject to - m i ≥ 0 i = C ∗ with c ∗ i , 0) T and C ∗ = ( C T , 0) T i = ( c T - � m i c ∗ i ∈ S Berger, Chiodini, Zenga Improving the accuracy of estimates 14-06-2017 16 / 26
HOW TO EVALUATE AUDITING RISK Previous studies has already demonstrated that different sampling plans such as Systematic Sampling or Probability Proportional to Size plans (such as Unrestricted Random Sampling, Lahiri Sampling, ...) are all reliable even for all sample size! We evaluate the efficiency of EL Bound respect the Stringer Bound. The parametr of interest is the ratio of error-per-Euro (Taint) Berger, Chiodini, Zenga Improving the accuracy of estimates 14-06-2017 17 / 26
SIMULATION STUDY We simulated data using a real accounting population of credit invoices of an audited society. 40,0 30,0 Frequency 20,0 10,0 0,0 0 1000 2000 3000 4000 5000 Value Berger, Chiodini, Zenga Improving the accuracy of estimates 14-06-2017 18 / 26
SIMULATION STUDY X ∼ LogNormal (7 . 001 , 1 . 71). N = 10000. m = 1000 samples. Error randomly associated to the X i (no fraudulent hypothesis) Error rates : + 5%, + 10% + 20%. Taint simulated values: - 0.1-0.3; - 0.5-0.7; - 0.2-0.7; Sample fractions: * 0.05 * 0.1 Berger, Chiodini, Zenga Improving the accuracy of estimates 14-06-2017 19 / 26
Recommend
More recommend