学位论文

【摘要】

As the demand for high-performance electronic devices has increased, the semiconductor manufacturing process is being developed centering on the production of multi-chip products. In multi-chip products, re-entrance occurs by repeating the process several times in the packaging line, and the setup change of equipment is frequently incurred. These are major factors that make the scheduling of the semiconductor packaging line difficult. The production environment frequently changes due to internal and external variabilities. In addition, since the calculation time required for scheduling is very important at the manufacturing site, prompt schedule generation is required. As the research of the semiconductor packaging line scheduling becomes active, the reinforcement learning based scheduling research aiming at the global optimization is increasing. In view of the utilization of scheduling research based on reinforcement learning, there is a need for a method capable of reacting to various production environment changes and obtaining a good schedule in a short time.This study aims at obtaining the robustness of the scheduling model based on deep reinforcement learning. We propose a regularzied training method for semiconductor packaging lines scheduling based on deep reinforcement learning without performance degradation and re-training when a new production environment is given as a test data. In order to apply reinforcement learning to flexible job-shop scheduling problem, we designed state, action and reward considering overall process and trained deep Q network which is a representative algorithm of deep reinforcement learning. The regularzied training method proposed in this study is divided into four stages and designed to train the generalities of the problems reflected in various production environment and the specificity of each problem. Experiments were conducted using scheduling problems of different complexity, and it was verified that the performance was superior to other scheduling models based on rule-based and deep reinforcement learning.This study is the first research that focuses on the robustness of the model in the reinforcement learning based scheduling. Moreover, the result of this study enhances the practicality of research in real factory application.

【预览】

附件列表
Files	Size	Format	View
정규화 학습을 이용한 심층 강화학습 기반 반도체 패키징 라인 스케줄링 기법의 강건성 향상	5916KB	PDF	download


정규화 학습을 이용한 심층 강화학습 기반 반도체 패키징 라인 스케줄링 기법의 강건성 향상
반도체 패키징 라인;유연 잡샵 스케줄링;강건성;심층 강화학습;정규화 학습;670.42

University:서울대학교 대학원
关键词: 반도체 패키징 라인; 유연 잡샵 스케줄링; 강건성; 심층 강화학습; 정규화 학습; 670.42;
Others : http://s-space.snu.ac.kr/bitstream/10371/161031/1/000000157767.pdf
美国\|英语
来源: Seoul National University Open Repository
PDF


	文献评价指标
	下载次数：0次	浏览次数：1次

【 摘 要 】

【 预 览 】

【摘要】

【预览】