Performance of Classification Methods to Evaluate Groundwater (Case Study: Shoosh Aquifer)

  • سال انتشار: 1393
  • محل انتشار: فصلنامه اکوپرشیا، دوره: 2، شماره: 2
  • کد COI اختصاصی: JR_ECOPER-2-2_007
  • زبان مقاله: انگلیسی
  • تعداد مشاهده: 92
دانلود فایل این مقاله

نویسندگان

Mohamad Sakizadeh

Assistant professor, Faculty of Sciences, Shahid Rajaee Teacher Training University, Tehran, Iran

چکیده

The objective of this study was to classify the Shoosh Aquifer to several zones with different water quality in Khuzestan Province, Iran. In this regard, the performance of classification methods (Discriminant function and Cluster analysis) for the classification of groundwater based on the level of pollution with an emphasis on the problem of over-fitting in training data were considered. An over-fitted model will generally have poor predictiveperformance, as it can exaggerate minor fluctuations in the data. Cluster Analysis(CA) was adopted to spatially explain the similarity of sampling stations with respect to measured parameters. Three methods for variable selection were used including regularized discriminant analysis, principal component analysis and Wilks's lambda method. The best algorithm for variable selection was Wilks'lambda which resulted in reducing the generalization error of the test sample to ۰.۱ for leave-one-out and ۴-fold cross-validation. The second best performed algorithm was regularized discriminant function with ۰.۱۶۷ and ۰.۱۳۳ misclassification error for the two above-mentioned methods, respectively. Principal component analysis did not proved to be a promising algorithm for variable selection in the classification methods.

کلیدواژه ها

Cluster Analysis, Discriminant function, groundwater quality, Over-fitting, Variable selection

اطلاعات بیشتر در مورد COI

COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.

کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.