ROBUST MEAN–VARIANCE PORTFOLIO SELECTION USING CLUSTER ANALYSIS: A COMPARISON BETWEEN KAMILA AND WEIGHTED K-MEAN CLUSTERING  

La Gubu1--- Dedi Rosadi2+---Abdurakhman3

1Department of Mathematics, Gadjah Mada University, Yogyakarta, Indonesia; Department of Mathematics, Haluoleo University, Kendari, Indonesia.
2,3Department of Mathematics, Gadjah Mada University, Yogyakarta, Indonesia.

ABSTRACT

This study presents robust portfolio selection using cluster analysis of mixed-type data. For this empirical study, the daily price data of LQ45 index stocks listed on the Indonesia Stock Exchange were employed. First, six stocks clusters are formed by using the KAMILA algorithm on a combination of continuous and categorical variables. For comparison purposes, weighted k-means cluster analysis was also undertaken. Second, stocks that were representative of each cluster, those with the highest Sharpe ratios, were selected to create a portfolio. The optimum portfolio was determined through classic (non-robust) and the robust estimation methods of fast minimum covariance determinant (FMCD) and S estimation. Using a robust procedure enables the best-performing portfolio to be created efficiently when selecting assets from a large number of stocks, especially as the results are largely unaffected in the presence of outliers. This study found that the performance of the portfolio developed with the KAMILA clustering algorithm and robust FMCD estimation outperformed those created by other methods.

Keywords:KAMILA clustering, Weighted k-means clustering, Robust estimation, FMCD estimation, S estimation, Outliers, Portfolio optimization.

JEL Classification: G11.

ARTICLE HISTORY: Received:30 June 2020, Revised:4 August 2020, Accepted:1 October 2020, Published:19 October 2020

Contribution/Originality: This study presents robust portfolio selection using cluster analysis of mixed-type data, with the possible presence of outliers. The results reveal the portfolio developed with the KAMILA clustering algorithm and robust FMCD estimation outperformed portfolios created by other methods.

1. INTRODUCTION

Portfolio management is one issue that attracts the interest and attention of financial researchers, mainly in relation to developing portfolios of the best securities to maximize investors’ profits. The fundamental theory of portfolio optimization can be traced back to the Markowitz (1952), who advised the selection and allocation of investments based on mean–variance analysis.

More recently, practitioners seek efficient time and cost management in creating the optimum portfolio, which can be achieved by first applying such techniques as cluster analysis to selecting securities. This technique has been undertaken by several researchers (e.g., Guan and Jiang, 2007; Tola, Lillo, Gallegati, and Mantegna, 2008; Chen and Huang, 2009; Nanda, Mahanty, and Tiwari, 2010; Long, Wisitpongphan, Meesad, and Unger, 2014), using different methods of cluster analysis and for selecting securities, although all adopting the Markowitz mean–variance (MV) portfolio model. All the studies reported that cluster analysis to be very efficient in creating an optimum portfolio when there is a large number of securities are available for selection.
However, the MV portfolio model depends on mean vectors and variance–covariance matrices being estimated from highly volatile data, while there were a range of parameter estimation techniques available, which inevitably contain estimation errors. As an important factor in forming the MV portfolio model, estimation errors will therefore affect portfolio optimization significantly. Several studies have been conducted on the relationship between estimation errors and portfolio optimization (Best and Grauer, 1991; Broadie, 1993; Chopra and Ziemba, 1993; Bengtsson, 2004; Ceria and Stubbs, 2006), concluding that although the MV portfolio model had a strong theoretical basis and was relatively easy to compute, it contained weaknesses. For instance, the optimum portfolios created were not well diversified but concentrated within a specific sector. In addition, the model is highly sensitive to changes in mean vectors and variance–covariance matrices.

Therefore, some researchers have investigated robust portfolios, which reduce the effect of outliers on the estimated vector means and the variance–covariance matrices. Several studies have adopted the standard robust estimation approach to developing an optimum robust portfolio is (Victoria-Feser, 2000; Lauprete, 2001; Vaz De Melo and Camara, 2003; Zhou, 2006; Welsch and Zhou, 2007; DeMiguel and Nogales, 2008; Hu, 2012; Kusch, 2012; Supandi, 2017). These studies have used different robust estimates to optimize the portfolio, but all reported that robust portfolios perform better than the classical one in the presence of outliers. However, there has been no research into combining cluster analysis and robust estimation in creating an optimum portfolio. In addition, cluster analysis uses only continuous data, stock return data, excluding categorical data such as sector, listing board, and market capitalization of stocks.

This study bridges this gap in the literature, therefore, by applying the KAMILA algorithm to cluster analysis of mixed data: continuous and categorical, the results of which are combined with robust estimation to create the optimum portfolio. Fast minimum covariance determinant (FMCD) and S estimation were used to calculate the mean and covariance of the mixed data.

The rest of this paper is organized as follows: Section 2 we explains concepts necessary to the discussion; Section 3 presents the empirical results; Section 4 discusses these results; and Section 5 details the conclusions reached.

2. MATERIALS AND METHOD

Following the literature review, it became evident that the problem of selecting the best securities can be resolved more efficiently by clustering stocks and then selecting stock clusters to optimize the portfolio.

In the current study, the KAMILA algorithm was first applied to a combination of continuous—stock prices and trading volumes—and categorical—sector, listing board, and market capitalization of stocks—data to produce several clusters of stocks. Thereafter, the returns and risk are calculated for each stocks cluster from the historical data, followed by the Sharpe ratio to determine the performance of every stock within each cluster. Those stocks with the highest Sharpe ratio are then selected to represent each cluster in creating the optimum portfolio. Finally, FMCD and S estimation were used to determine the weighting of each stock in the portfolio by providing a robust estimation of the mean and variance of those stocks.

A comparison was undertaken between the performances of the portfolios produced by both the KAMILA algorithm and weighted k-means clustering, which both employed the same robust estimation method, to determine the advantages of the former.

The subsequent subsections summarized other concepts related to this study.

2.1. Mean-Variance Portfolio

Markowitz's portfolio theory is based on the mean–variance approach, in which the mean measures the level of the expected return and variance measures the level of risk (Markowitz, 1952); thus, Markowitz's mean–variance (MV) was produced, emphasizing the maximization of expected returns and minimization of risk in developing the optimum portfolio. According to Supandi (2017), the mean–variance portfolio can be created by resolving the following optimization problems:

Taking the expected value of Equation 5, the following approximation can be calculated:

Based on the Kuhn–Tucker theorem (Winston & Goldberg, 2004), the necessary conditions for the optimal of Equation 9 are:

2.2. Cluster Analysis

Cluster analysis is a statistical method that groups together data objects sharing similar characteristics, aiming for within-group homogeneity, or at least as small a variation between the data objects as possible. A brief overview of the two types of cluster analysis, or segmentation, used in this study is now provided.

2.2.1. KAMILA Algorithm

KAMILA is actually the acronym for k-means for mixed large data sets, meaning that the KAMILA algorithm is a development of the k-means clustering method for use with a mixture of continuous and categorical data. There are several other clustering methods capable of dealing with mixed-type data, but there are drawbacks with each: strong parametric assumptions, such as the Gaussian multinomial mixture models (Foss and Markatou, 2018); cannot minimize or maximize the contribution of individual variables, such as Modha–Spangler weighting (Foss, Markatou, Ray, and Heching, 2016); or based on arbitrary weighting to determine the contribution of continuous and categorical variables, such as dummy coding and Gower's distance (Foss and Markatou, 2018).

The KAMILA algorithm, however, combines the two best clustering methods for large mixed-type data: k-means clustering method and Gaussian multinomial mixture models. KAMILA and k-means are similar in not needing parametric assumptions for continuous data; however, KAMILA holds the advantage because it can deal with different levels of overlap for individual variables, unlike 𝐾-means that relies on arbitrary weighting to calculating the Euclidean distance at different overlap levels. 

2.2.1.1. Model

2.2.1.2. Radial Kernel Density Estimation

Univariate kernel density estimation avoids the drawbacks of being too expensive to calculate and tending to overfit data points (Foss & Markatou, 2018) found in multivariate kernel density estimation (Scott, 1992).

2.2.1.3. Algorithm Description  

KAMILA introduces an iterative process to estimate the unknown parameters in Equation 16 (Foss et al., 2016). Foss and Markatou (2018) explained that the iterative process consisted of two stages: partition and estimation. Partition assigns each observation to a cluster, while estimation re-estimates the parameters of interest using these new clusters.

The KAMILA R package applies the simple rule of stopping once group membership remains unchanged between iterations.

2.2.2. Weighted K-Means Clustering

The weighted k-means and k-means algorithms are the same, as follows (Kerdprasop et al., 2005):

  1. Determine the desired number of k clusters.
  2. Initialize k cluster centroids.
  3. Allocate data objects to the nearest cluster based on the centroid produced by Equation 21:
  4. Reallocate each data point after every iteration until the centroid no longer changes  (i.e., Equation 20 is satisfied), at which point cluster analysis is complete.

2.3. Sharpe Ratio

Once the clusters are formed, the performance of every stock within each cluster is assessed using the Sharpe ratio, also known as the Sharpe index, which measures excess return (or risk premium) per unit risk in an asset (Sharpe, 1994): it characterizes how well asset returns compensate investors for the risks they have taken.

In general, the higher the value of a stock’s Sharpe ratio, the better the stock’s performance.

2.4. Outlier Detection 

Outliers are data points that are a significant distance from the majority of other observations and the cluster centroid, or deviate from the general pattern of data in some way. The higher the value of an outlier, the greater its distance from the centroid—outliers typically lie at large distances (Filzmoser, Garrett, & Reimann, 2005). However, not only the distance between an observation and the centroid but also the overall shape of the data should be considered with multivariate data. 

The shape and size of multivariate data are quantified by the covariance matrix, which the Mahalanobis distance measure accommodates.

2.5. Portfolio Selection Using Robust Estimation

In this study, the robust FMCD and robust S estimation methods were used to determine the weighting of the stocks selected for the optimum portfolio. Both methods will now be briefly described.

2.5.1. Robust FMCD Estimation

2.5.2. Robust S-Estimation

3. RESULTS AND DISCUSSION

3.1. Stocks Clustering

The current study used the daily prices of all stocks in the LQ45 index for the Indonesia Stock Exchange (see e.g., https://finance.yahoo.com). For the weighted k-means clustering, the weightings of the continuous and categorical variables was varied: first, combining (0.8, 0.5, 0.2) for the continuous variables, and then, (0.2, 0.5, 0.8) for the categorical variables. The kamila and wkmeans functions in the R package (R Core Team, 2020) revealed that the LQ-45 index stocks could be grouped into six clusters, which are presented in Tables 1–4.

Table-1. Stocks clusters using KAMILA algorithm.
Cluster
Stocks
1
AALI AKRA BMTR LPPF LSIP MNCN SSMS      
2
ADRO ANTM BSDE BUMI INCO MYRX PTBA PWON WSKT  
3
BBCA GGRM UNTR UNVR            
4
ASII EXCL HMSP ICBP JSMR PGAS TLKM      
5
ADHI BBNI BBRI BBTN BJBR BMRI LPKR PPRO PTPP SCMA
SMRA SRIL WIKA              
6
BRPT INDF INTP KLBF SMGR          

Table-2. Stocks clusters using weighted k-means 0.8 : 0.2.
Cluster
Stocks
1
LPKR AKRA AALI MNCN PPRO LSIP ADHI PTPP SSMS WIKA
SMRA LPPF BMTR SRIL BJBR          
2
BUMI TLKM PWON MYRX            
3
BBTN INDF KLBF BSDE SMGR JSMR INTP EXCL SCMA PGAS
WSKT BRPT                
4
UNVR GGRM UNTR              
5
ASII BMRI BBNI ICBP BBRI HMSP BBCA      
6
ADRO ANTM PTBA INCO            

Table-3. Stocks clusters using weighted k-means 0.5 : 0.5.
Cluster
Stocks
1
LPKR AKRA AALI MNCN PPRO LSIP ADHI PTPP SSMS WIKA
SMRA LPPF BMTR SRIL BJBR          
2
BUMI PWON MYRX              
3
INDF KLBF                
4
BSDE SMGR JSMR INTP EXCL SCMA PGAS BBTN WSKT BRPT
ADRO                  
5
ASII TLKM UNVR BMRI BBNI ICBP BBRI GGRM HMSP UNTR
BBCA                  
6
ANTM PTBA INCO              

Table-4. Stocks clusters using weighted k-means 0.2 : 0.8.
Cluster
Stocks
1
LPKR AKRA AALI MNCN PPRO LSIP ADHI PTPP SSMS WIKA
SMRA LPPF BMTR SRIL BJBR          
2
PWON MYRX                
3
INDF KLBF BSDE JSMR EXCL SCMA PGAS BBTN WSKT ADRO
4
SMGR INTP BRPT              
5
ASII TLKM UNVR BMRI BBNI ICBP BBRI GGRM HMSP UNTR
BBCA                  
6
BUMI ANTM PTBA INCO            

3.2. Representative Stocks from Clusters

Following the formation of the six clusters, the Sharpe ratio was calculated for every stock in each cluster, using the latest Bank Indonesia rate of 5.25% per year as the risk return free rate. Based on these calculations, stocks that were representative of each cluster were identified to create the optimum portfolio, as presented in Tables 58.

Table-5. Representative stocks from clusters using KAMILA algorithm.
Cluster
Representation
Return
Risk
Sharpe Ratio
1
BMTR
-0.000164
0.029596
-0.010416
2
INCO
0.002580
0.026729
0.091156
3
BBCA
0.000878
0.012855
0.057126
4
HMSP
0.000664
0.021462
0.024240
5
BBRI
0.000311
0.018511
0.009023
6
BRPT
0.000425
0.022963
0.012221

Table-6. Representative stocks from clusters using weighted k-means 0.8 : 0.2.
Cluster
Representation
Return
Risk
Sharpe Ratio
1
BJBR
0.000225
0.022459
0.003619
2
MYRX
0.000428
0.032915
0.008622
3
BRPT
0.000425
0.022963
0.012221
4
UNTR
0.000805
0.022303
0.029621
5
BBCA
0.000878
0.012855
0.057126
6
INCO
0.002580
0.026729
0.091156

Table-7. Representative stocks from clusters using weighted k-means 0.5 : 0.5.
Cluster
Representation
Return
Risk
Sharpe Ratio
1
BJBR
0.000225
0.022459
0.003619
2
MYRX
0.000428
0.032915
0.008622
3
KLBF
-0.000857
0.018216
-0.054946
4
ADRO
0.000490
0.028072
0.012363
5
BBCA
0.000878
0.012855
0.057126
6
INCO
0.002580
0.026729
0.091156

Table-8. Representative stocks from clusters using weighted k-means 0.2 : 0.8.
Cluster
Representation
Return
Risk
Sharpe Ratio
1
BJBR
0.000225
0.022459
0.003619
2
MYRX
0.000428
0.032915
0.008622
3
ADRO
0.000490
0.028072
0.012363
4
BRPT
0.000425
0.022963
0.012221
5
BBCA
0.000878
0.012855
0.057126
6
INCO
0.002580
0.026729
0.091156

3.3. Detection of Outliers in Representative Stocks from Clusters 

The Mahalanobis distance, introduced in Section 2.4, was used to determine the outliers in in the representative stocks. The mahalanobis and qchisq functions in the R package were used, with a 97.5% threshold for the distance, on 260 daily return data of the LQ45 index stocks for the period August 2017–July 2018. Table 9 presents the number of outliers for the representative stocks identified by the four clustering methods, while Figure 1 depicts the outliers from KAMILA clustering. 

Table-9. Number of outliers in representative stocks from clusters.
Clustering Method
Number of Outliers
Percentage
KAMILA
16
6.15
Weighted k-means 0.8 : 0.2
15
5.77
Weighted k-means 0.5 : 0.5
18
6.92
Weighted k-means 0.2 : 0.8
18
6.92

Figure-1. Outliers of Stocks Representation with KAMILA Clustering.

3.4. Portfolio Weightings and Comparison of Portfolio Performance

Table-10. Portfolio weight of MV classic and robust portfolio using KAMILA algorithm.


Table-11. Portfolio weight of MV classic and robust portfolio using weighted k-means 0.8 : 0.2.


Table-12. Portfolio weight of MV classic and robust portfolio with weighted k-means 0.5 : 0.5.

Table-13. Portfolio weight of MV classic and robust portfolio with weighted k-means 0.2 : 0.8.

Based on the portfolio weightings, as well as the mean vectors and covariance matrices, the Sharpe ratio was calculated for the three portfolio models, as presented in Table 14.

Table-14.  The Sharpe ratio of MV classic portfolio and robust portfolios with KAMILA algorithm and weighted k-means.

4. DISCUSSIONS

Having formed six stocks clusters from the LQ45 index through KAMILA and weighted k-means clustering algorithms, shown in Tables 1–4, the Sharpe ratio was calculated for every stock in each cluster to identify the stocks that were representative of each cluster, presented in Tables 5. Those representative stocks with the highest Sharpe ratios, those identified as the best performing in the cluster, were selected for inclusion in the portfolio. Thus, following KAMILA clustering algorithm,  Table 5 shows that BMTR stocks, with a Sharpe ratio of –0.010416, were chosen from Cluster 1, and INCO, BBCA, HMSP, BBRI, and BRPT stocks for Clusters 2, 3, 4, 5, and 6, respectively. The same process was followed for the weighted k-means clusterings, the results of which can be seen in Tables 6–8.

Table 9, and Figure 1 in relation to KAMILA clustering, indicates the number of outliers found for each clustering method: 16 (6.15%) using KAMILA clustering, and 15 (5.77%), 18 (6.92%), and 18 (6.92%) for weighted k-means 0.8 : 0.2, 0.5 : 0.5, and 0.2 : 0.8 clustering, respectively. Based on these results, it is reasonable to employ a robust estimation method to create optimum portfolios.

All the clustering methods produced stocks with negative returns (e.g., Table 10 shows negative weighting for BMTR stock), denoting short selling, in all four portfolio models, across every risk aversion value γ (see Tables 1113). In contrast, stocks with large returns are always shown with positive weightings in all four portfolio models (e.g., INCO stocks in Table 10). Table 10 also illustrates that the greater the  value, the smaller the weightings of the stocks.

When assessing portfolio performance, though, the risks taken by the investor should also be considered. One way to measure both returns and risks is to calculate the Sharpe ratio, which is shown in Table 14 for all four portfolio models developed through KAMILA and weighted k-means clusterings. These results reveal that the MV portfolio based on KAMILA clustering and robust FMCD estimation outperformed the other alternatives in this empirical study.

5. CONCLUSIONS

The current study demonstrated how to integrate clustering techniques for mixed-type data—continuous and categorical—into portfolio management and create an optimum portfolio. To establish the advantages of this proposed method, the performance of the optimum portfolio was compared to that of the optimum portfolio created through weighted k-means clustering, which can also be applied to mixture-type data. The final results revealed that the best portfolio performance was obtained by combining KAMILA clustering with robust FMCD estimation.

Funding: This study received no specific financial support.  

Competing Interests: The authors declare that they have no competing interests.

Acknowledgement: The first author wishes to thank the Indonesia Endowment Fund for Education (Lembaga Pengelola Dana Pendidikan; LPDP) from the Indonesian Ministry of Finance for the scholarship funding of his Doctoral Program at the Faculty of Mathematics and Natural Sciences of Gadjah Mada University. The authors also acknowledge the financial support from PDD 2020 Program from Deputy of Research and Development, Ministry of Research and Technology / National Agency for Research and Innovation of Republic of Indonesia.

REFERENCES

Bengtsson, C. (2004). The impact of estimation error on portfolio selection for investor with constant relative risk aversion. Working Paper. Department of Economics. Lund University.

Best, M. J., & Grauer, R. R. (1991). On the sensitivity of mean-variance-efficient portfolios to changes in asset means: Some analytical and computational results. The Review of Financial Studies, 4(2), 315-342. Available at: https://doi.org/10.1093/rfs/4.2.315 .

Broadie, M. (1993). Computing efficient frontiers using estimated parameters. Annals of Operations Research, 45(1), 21-58. Available at: https://doi.org/10.1007/bf02282040 .

Ceria, S., & Stubbs, R. A. (2006). Incorporating estimation errors into portfolio selection: Robust portfolio construction. Journal of Asset Management, 7(2), 109-127. Available at: https://doi.org/10.1057/palgrave.jam.2240207 .

Chen, L.-H., & Huang, L. (2009). Portfolio optimization of equity mutual funds with fuzzy return rates and risks. Expert Systems with Applications, 36(2), 3720-3727. Available at: https://doi.org/10.1016/j.eswa.2008.02.027 .

Chopra, V. K., & Ziemba, W. T. (1993). The effects of errors in means, variances, and covariances on optimal portfolio choice. Journal of Portfolio Management, 19(2), 6-11.

Davies, P. L. (1987). Asymptotic behaviour of $ S $-estimates of multivariate location parameters and dispersion matrices. The Annals of Statistics, 15(3), 1269-1292. Available at: https://doi.org/10.1214/aos/1176350505 .

DeMiguel, V., & Nogales, F. J. (2008). Portfolio selection with robust estimation. Journal of Operation Research, 57(3), 560 - 577.

Filzmoser, P., Garrett, R. G., & Reimann, C. (2005). Multivariate outlier detection in exploration geochemistry. Computers & Geosciences, 31(5), 579-587. Available at: https://doi.org/10.1016/j.cageo.2004.11.013 .

Foss, A. H., & Markatou, M. (2018). kamila: clustering mixed-type data in R and Hadoop. Journal of Statistical Software, 83(1), 1-44.

Foss, A., Markatou, M., Ray, B., & Heching, A. (2016). A semiparametric method for clustering mixed data. Machine Learning, 105(3), 419-458. Available at: https://doi.org/10.1007/s10994-016-5575-7 .

Gnanadesikan, R. (1977). Methods for the statistical data analysis of multivariate observations. New York: Wiley.

Guan, H. S., & Jiang, Q. S. (2007). Cluster financial time series for portfolio. Paper presented at the Int. Conf. on Wavelet Analysis and Pattern Recognition, ICWAPR, Beijing, China.

Hardin, J. S. (2000). Multivariate outlier detection and robust clustering with minimum covariance determinant estimation and s-estimation. Dissertation. University of California.  

Hu, J. (2012). An empirical comparison of different approaches in portfolio selection. Project Report 2012:7. Uppsala Universitet.

Kerdprasop, K., Kerdprasop, N., & Sattayatham, P. (2005). Weighted K-means for density-biased clustering. In: Tjoa A.M., Trujillo J. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2005. Lecture Notes in Computer Science (Vol. 3589). Berlin, Heidelberg: Springer.

Kusch, P. (2012). Commodities or not commodities. Portfolio optimization with robust mean-variance and mean-conditional value at risk strategies. Project Report. Umea University Department of Economics.

Lauprete. (2001). Portfolio risk minimization under departures from normality. Disertation. Massachusetts Institue of Technology.

Long, N. C., Wisitpongphan, N., Meesad, P., & Unger, H. (2014). Clustering stock data for multi-objective portfolio optimization. International Journal of Computational Intelligence and Applications, 13(02), 1-13. Available at: https://doi.org/10.1142/s1469026814500114 .

Lopuhaa, H. P. (1989). On the relation between S-estimators and M-estimators of multivariate location and covariance. The Annals of Statistics, 17(4), 1662-1683. Available at: https://doi.org/10.1214/aos/1176347386 .

Markowitz, H. M. (1952). Portfolio selection. Journal of Finance, 7(1), 77-91.

Nanda, S., Mahanty, B., & Tiwari, M. (2010). Clustering Indian stock market data for portfolio management. Expert Systems with Applications, 37(12), 8793-8798. Available at: https://doi.org/10.1016/j.eswa.2010.06.026.

R Core Team. (2020). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. Retrieved from: https://www.R-project.org/ .

Rousseeuw, P. J., & Driessen, K. V. (1999). A fast algorithm for the minimum covariance determinant estimator. Technometrics, 41(3), 212-223. Available at: https://doi.org/10.1080/00401706.1999.10485670 .

Rousseeuw, P., & Yohai, V. (1984). Robust regression by means of S-estimators. In Robust and Nonlinear Time Series Analysis. Lecture Notes in Statistics (Vol. 26, pp. 256 - 272). New York: Springer.

Scott, D. W. (1992). Multivariate density estimation: Theory, practice, and visualization. New York: John Wiley & Sons Inc.

Sharpe, W. F. (1994). The sharpe ratio. Journal of Portfolio Management, 21(1), 49-58. Available at: https://doi.org/10.3905/jpm.1994.409501 .

Supandi, E. D. (2017). Developing of mean-variance portfolio modeling using robust estimastin and robust optimization method. Ph.D. Dissertation. Mathematics Department Gadjah Mada University, Indonesia. 

Tola, V., Lillo, F., Gallegati, M., & Mantegna, R. N. (2008). Cluster analysis for portfolio optimization. Journal of Economic Dynamics and Control, 32(1), 235-258.

Vaz De Melo, B., & Camara, R. P. (2003). Robust multivariate modelling in finance. Working Paper Series 355, Federal University at Rio de Janeiro, Rio de Janeiro, Brazil.

Victoria-Feser, M. P. (2000). Robust portfolio selection. Research Report. Universite of Geneve.

Welsch, R. E., & Zhou, X. (2007). Application of robust statistics to asset allocation models. REVSTAT–Statistical Journal, 5(1), 97-114.

Winston, W. L., & Goldberg, J. B. (2004). Operations research: Applications and algorithms (4th ed.). Canada: Thomson Learning Inc.

Zhou, X. (2006). Application of  robust statistics to asset allocation models. Thesis. Massachusetts Institute of Technology.  

Views and opinions expressed in this article are the views and opinions of the author(s), Asian Economic and Financial Review shall not be responsible or answerable for any loss, damage or liability etc. caused in relation to/arising out of the use of the content.