미세먼지의 다차원적 시공간 경제 분석 - 2019-2020년 제3차 SSK Networking Symposium (최상원)

세미나

제목: 2019-2020년 제3차 SSK Networking Symposium (최상원)

작성일: 2020-02-14
조회수: 132

연구 제목: A machine learning prediction of PM2.5(㎍/m³) in Seoul
일정: 2020.02.14
장소: LW 컨벤션
연구 배경 및 목적
With increase of life quality, people have started to care more about environment that can affect our health. In the same vein, particular matter(PM)(㎍/m³) has emerged as one of the serious regional problems in Korea and many research results about PM have been reported. Moreover, in 2018, the damage cost of fine dust in Korea ran up to 3.3 billion dollars. So the government needs to make policies for PM reduction to minimize damage of PM with order of priority in variables. Accordingly, the purpose of this study is to choose a PM prediction model and show which variables are high affect to PM for it to be used as an important reference in making PM reduction policies.
연구방법
The model is based on machine learning (ML) algorithm. Linear Regression, Decision tree Regression, Support Vector Regression and Random forest Regression based on boosting algorithm are used to predict PM 2.5 in Seoul. In this model, the dependent variable is monthly density of PM 2.5 in Seoul. Because level of PM 2.5 in Seoul fluctuates depending on each season(high in spring and winter but low in summer and autumn). In addition, there are many reports stating that density of PM in Korea is affected by different environmental factors of China; therefore, independent variables are divided into two sectors: from Korea and China. Independent variables from China are PM 2.5 density level in Shandong, Hebei, Jiangsu. These regions show high density of PM 2.5 from November to next April but it gets lower from May to October, a pattern similar to PM2.5 in Seoul. On the other hand, independent variables from Korea are focused on the number of cars and the ratio of west wind per month. And the number of cars divided into 3 sectors(an official vehicle, a business vehicle and personal vehicle) and each sector is divided into 4 levels(a passenger car, a van, a truck and a special vehicle). There is a little change in consumption trend of fuel over the time, but a seasonal difference in ratio of west wind is similar to that of PM2.5 in Seoul as well. Time range of all data is from Jan, 2015 to October, 2019 and the number of sample is 58. And the data set used in this study are split into 70% for train and 30% for test. After building each model, root mean square error (RMSE) is used for an evaluation.
연구결과
Each model has been initiated to choose best model for prediction of PM2.5 in Seoul and find a variable importance. The result shows that Random Forest Regression is best model for prediction of PM2.5 and the variables ‘Yangzhou city, ratio of west wind and month’ have high priorities in affect of prediction. But this study has limit as follow. The number of sample is small, because the Chinese government offers PM2.5 data of some of cities only from Jan, 2015. The 58 samples are used to but it is not enough to initiate machine learning. And it causes insufficient train for each model and high RMSE of each model, even if a hyperparameter tuning has been done. So in a following research, by changing a train-test data split ratio, part and adding new variable for modeling, it is expected that the problem of sample size will be solved and a performance of model will be improved.

첨부파일:: 첨부파일이 없습니다.

목록 답글

다음글: 2019-2020년 제3차 SSK Networking Symposium (이호준); / 연구소; 연구 제목: 지역경제 성장이 미세먼지 발생과 교역에 미치는 영향 일정: 2020.02.14 장소: LW 컨벤션 연구 배경 및 목적 미세먼지는 대기 중에 떠다니는 먼지의 지름이 10μm 또는 2.5μm보다 작은 입자상 물질이다. 우리나라는 중국 미세먼지 유입과 국내 미세먼지 발생 등으로 미세먼지 주의보 및 경보 발령일수가..

이전글: 2019-2020년 제3차 SSK Networking Symposium (지앙민); / 연구소; 연구 제목: 중국 및 한국 미세먼지 발생의 지역 생산 및 소비기반 회계 분석 일정: 2020.02.14 장소: LW 컨벤션 연구 배경 및 목적 한국 및 중국을 포함하는 동북아시아 지역은 세계 최대의 미세먼지 배출지역으로, 미세먼지 저감을 위한 2000년부터 한·중·일 3국 과학자들의 공동연구를 시작해서 다양한..

퀵메뉴

TOP

바로가기 메뉴

주요안내

FONT SIZE

세미나

나도한마디

퀵메뉴