Social media based digital file size estimation method using sampling technique with \alpha control chart in big data
- سال انتشار: 1403
- محل انتشار: مجله آنالیز غیر خطی و کاربردها، دوره: 15، شماره: 9
- کد COI اختصاصی: JR_IJNAA-15-9_029
- زبان مقاله: انگلیسی
- تعداد مشاهده: 167
نویسندگان
Department of Computer Science and Applications, Dr. Harisingh Gour Vishwavidyalaya (M.P.), India
Department of Mathematics and Statistics, Dr. Harisingh Gour Vishwavidyalaya (M.P.), India
چکیده
Due to the emergence of social networking platforms, a large number of users around the world are being part and partial of this platform. At a fraction of the time users on social media are communicating digital files in the form of text, video, images, voice and music which ultimately generates big data. The matter of interest is to estimate precisely the average file size at time duration (occasion). The time may hours or days or months. This paper presents a sample-based methodology to deal with mean size estimation of digital communication content spreading on a social media platform. An estimator is suggested using a random sample from big data and its properties are derived. A simulation method is suggested that computes the confidence interval (CI) for the prediction of précised range of digital file size. The proposed method produces an optimal confidence interval at the suitable choice of constant. These estimated confidence intervals can be used for developing \alpha-control charts for constant monitoring of the growth in file size in social media storage at the data centre. If the growth of mean digital file size crosses the upper limit then additional storage infrastructure is needed at the administration level of the social media site. One can generate machine learning algorithms proposed method for monitoring the growth of average digital file size over time duration.کلیدواژه ها
Big-Data, Sampling, estimation, Social media, Simulation, Confidence Interval (CI), Bias, MSE, Optimum Choice, Control Chart, α-Control Chartاطلاعات بیشتر در مورد COI
COI مخفف عبارت CIVILICA Object Identifier به معنی شناسه سیویلیکا برای اسناد است. COI کدی است که مطابق محل انتشار، به مقالات کنفرانسها و ژورنالهای داخل کشور به هنگام نمایه سازی بر روی پایگاه استنادی سیویلیکا اختصاص می یابد.
کد COI به مفهوم کد ملی اسناد نمایه شده در سیویلیکا است و کدی یکتا و ثابت است و به همین دلیل همواره قابلیت استناد و پیگیری دارد.