SCI, SCI-E, SCOPUS Indexed Journals
(SICE) Yeon-Ji Park, Geun-Je Yang, Chae-Bong Sohn* and Soo Jun Park* “GPDminer: a tool for extracting named entities and analyzing relations in biological literature”, BMC Bioinformatics 25.101 (2024): 1-18. (IF 3.0 Q2)
Abstract:
Purpose: The expansion of research across various disciplines has led to a substantial increase in published papers and journals, highlighting the necessity for reliable text mining platforms for database construction and knowledge acquisition. This abstract introduces GPDMiner(Gene, Protein, and Disease Miner), a platform designed for the biomedical domain, addressing the challenges posed by the growing volume of academic papers.
Methods: GPDMiner is a text mining platform that utilizes advanced information retrieval techniques. It operates by searching PubMed for specific queries, extractingand analyzing information relevant to the biomedical field. This system is designed to discern and illustrate relationships between biomedical entities obtained from automated information extraction.
Results: The implementation of GPDMiner demonstrates its efficacy in navigatingthe extensive corpus of biomedical literature. It efficiently retrieves, extracts, and analyzes information, highlighting significant connections between genes, proteins,and diseases. The platform also allows users to save their analytical outcomes in various formats, including Excel and images.
Conclusion: GPDMiner offers a notable additional functionality among the arrayof text mining tools available for the biomedical field. This tool presents an effective solution for researchers to navigate and extract relevant information from the vast unstructured texts found in biomedical literature, thereby providing distinctive capabilities that set it apart from existing methodologies. Its application is expected to greatly benefit researchers in this domain, enhancing their capacity for knowledge discovery and data management.
(SCIE) Do-Yup Kim , Chae-Bong Sohn , and Hyun-Suk Lee “Dynamic Joint Scheduling of Anycast Transmission and Modulation in Hybrid Unicast-Multicast SWIPT-Based IoT Sensor Networks”, IEEE Sensors Journal 23.24 (2023): 31345-31358. (IF 4.3 Q1)
Abstract: Simultaneous wireless information and power transfer (SWIPT) technologies are vital in powering Internetof-Things (IoT) sensor networks. Despite their importance, the traditionally used separate receiver (SR) architecture with a time- or power-splitting (TS/PS) mode in SWIPT usually results in high energy consumption, especially during the information decoding (ID) process due to energy-intensive local oscillators and mixers. To overcome this, an integrated receiver (IR) architecture has been introduced, sparking the development of compatible SWIPT modulation schemes. However, the aspect of modulation scheduling for IR architecture in SWIPT-based IoT sensor networks appears to be little explored. This article bridges this research gap by proposing a joint unicast/multicast, IoT sensor, and modulation (UMSM) scheduling algorithm. We use mathematical modeling and optimization methods to maximize the weighted sum of average unicast service throughput and energy harvested by IoT sensors, while ensuring minimal average throughput for both unicast and multicast services, along with the minimum average harvested energy. Our simulation results demonstrate the effectiveness of our algorithm in improving energy harvesting (EH) and throughput performance while maintaining necessary constraints.
(SCIE) Yeon-Ji Park, Min-a Lee, Geun-Je Yang, Soo Jun Park* and Chae-Bong Sohn* “Web Interface of NER and RE with BERT for Biomedical Text Mining”, Applied Sciences 13.5163 (2023): 1-11. (IF 2.838 Q2)
Abstract: The BioBERT Named Entity Recognition (NER) model is a high-performance model designed to identify both known and unknown entities. It surpasses previous NER models utilized by text-mining tools, such as tmTool and ezTag, in effectively discovering novel entities. In previous studies, the Biomedical Entity Recognition and Multi-Type Normalization Tool (BERN) employed this model to identify words that represent specific names, discern the type of the word, and implement it on a web page to offer NER service. However, we aimed to offer a web service that includes Relation Extraction (RE), a task determining the relation between entity pairs within a sentence. First, just like BERN, we fine-tuned the BioBERT NER model within the biomedical domain to recognize new entities. We identified two categories: diseases and genes/proteins. Additionally, we fine-tuned the BioBERT RE model to determine the presence or absence of a relation between the identified gene-disease entity pairs. The NER and RE results are displayed on a web page using the Django web framework. NER results are presented in distinct colors, and RE results are visualized as graphs in NetworkX and Cytoscape, allowing users to interact
(SCIE) Dae-Yeol Kim, Soo-Young Cho, Kwangkee Lee* and Chae-Bong Sohn* “A Study of Projection-Based Attentive Spatial-Temporal Map for Remote Photoplethysmography Measurement”, Bioengineering 9.638 (2022): 1-14. (IF 5.046 Q2)
Abstract: The photoplethysmography (PPG) signal contains various information that is related to CVD (cardiovascular disease). The remote PPG (rPPG) is a method that can measure a PPG signal using a face image taken with a camera, without a PPG device. Deep learning-based rPPG methods can be classified into three main categories. First, there is a 3D CNN approach that uses a facial image video as input, which focuses on the spatio-temporal changes in the facial video. The second approach is a method that uses a spatio-temporal map (STMap), and the video image is pre-processed using the point where it is easier to analyze changes in blood flow in time order. The last approach uses a preprocessing model with a dichromatic reflection model. This study proposed the concept of an axis projection network (APNET) that complements the drawbacks, in which the 3D CNN method requires significant memory; the STMap method requires a preprocessing method; and the dyschromatic reflection model (DRM) method does not learn long-term temporal characteristics. We also showed that the proposed APNET effectively reduced the network memory size, and that the low-frequency signal was observed in the inferred PPG signal, suggesting that it can provide meaningful results to the study when developing the rPPG algorithm.
(SCIE) Yeon-Ji Park, Min-a Lee, Geun-Je Yang, Soo Jun Park* and Chae-Bong Sohn* “Biomedical Text NER Tagging Tool with Web Interface for Generating BERT-Based Fine-Tuning Dataset”, Applied Sciences 12.12012 (2022): 1-13. (IF 2.838 Q2)
Abstract: In this paper, a tagging tool is developed to streamline the process of locating tags for each term and manually selecting the target term. It directly extracts the terms to be tagged from sentences and displays it to the user. It also increases tagging efficiency by allowing users to reflect candidate categories in untagged terms. It is based on annotations automatically generated using machine learning. Subsequently, this architecture is fine-tuned using Bidirectional Encoder Representations from Transformers (BERT) to enable the tagging of terms that cannot be captured using Named-Entity Recognition (NER). The tagged text data extracted using the proposed tagging tool can be used as an additional training dataset. The tagging tool, which receives and saves new NE annotation input online, is added to the NER and RE web interfaces using BERT. Annotation information downloaded by the user includes the category (e.g., diseases, genes/proteins) and the list of words associated to the named entity selected by the user. The results reveal that the RE and NER results are improved using the proposed web service by collecting more NE annotation data and fine-tuning the model using generated datasets. Our application programming interfaces and demonstrations are available to the public at via the website link provided in this paper.
(SCIE) Dae-Yeol Kim, Kwangkee Lee* and Chae-Bong Sohn*, “Assessment of ROI Selection for Facial Video-Based rPPG”, Sensors 21.7923 (2021): 1-15. (IF 3.576 Q1)
Abstract: In general, facial image-based remote photoplethysmography (rPPG) methods use colorbased and patch-based region-of-interest (ROI) selection methods to estimate the blood volume pulse (BVP) and beats per minute (BPM). Anatomically, the thickness of the skin is not uniform in all areas of the face, so the same diffuse reflection information cannot be obtained in each area. In recent years, various studies have presented experimental results for their ROIs but did not provide a valid rationale for the proposed regions. In this paper, to see the effect of skin thickness on the accuracy of the rPPG algorithm, we conducted an experiment on 39 anatomically divided facial regions. Experiments were performed with seven algorithms (CHROM, GREEN, ICA, PBV, POS, SSR, and LGI) using the UBFC-rPPG and LGI-PPGI datasets considering 29 selected regions and two adjusted regions out of 39 anatomically classified regions. We proposed a BVP similarity evaluation metric to find a region with high accuracy. We conducted additional experiments on the TOP-5 regions and BOT-5 regions and presented the validity of the proposed ROIs. The TOP-5 regions showed relatively high accuracy compared to the previous algorithm’s ROI, suggesting that the anatomical characteristics of the ROI should be considered when developing a facial image-based rPPG algorithm
(SCIE) Soo-Young Cho, Dae-Yeol Kim, Su-Yeong Oh and Chae-Bong Sohn*, “Reducing System Load of Effective Video Using a Network Model”, Applied Sciences 11.9665 (2021): 1-18. (IF 2.679 Q2)
Abstract: Recently, as non-face-to-face work has become more common, the development of streaming services has become a significant issue. As these services are applied in increasingly diverse fields, various problems are caused by the overloading of systems when users try to transmit high-quality images. In this paper, SRGAN (Super Resolution Generative Adversarial Network) and DAIN (Depth-Aware Video Frame Interpolation) deep learning were used to reduce the overload that occurs during real-time video transmission. Images were divided into a FoV (Field of view) region and a non-FoV (Non-Field of view) region, and SRGAN was applied to the former, DAIN to the latter. Through this process, image quality was improved and system load was reduced.
(SCIE) Yoojin Jeong, and Chae-Bong Sohn*, “Readily Design and Try-On Garments by Manipulating Segmentation Images”, Electronics 9.1553 (2020): 1-11. (IF 2.412 Q2)
Abstract: Recently, fashion industries have introduced artificial intelligence to provide new services, and research to combine fashion design and artificial intelligence has been continuously conducted. Among them, generative adversarial networks that synthesize realistic-looking images have been widely applied in the fashion industry. In this paper, a new apparel image is created using a generative model that can apply a new style to a desired area in a segmented image. It also creates a new fashion image by manipulating the segmentation image. Thus, interactive fashion image manipulation, which enables users to edit images by controlling segmentation images, is possible. This allows people to try new styles without the pain of inconvenient travel or changing clothes. Furthermore, they can easily determine which color and pattern suits the clothes they wear more, or whether the clothes other people wear match their clothes. Therefore, user-centered fashion design is possible. It is useful for virtually trying on or recommending clothes.
(SCIE) Chan-Il Park, and Chae-Bong Sohn*, “Data Augmentation for Human Keypoint Estimation Deep Learning based Sign Language Translation”, Electronics 9.1257 (2020): 1-9. (IF 2.412 Q2)
Abstract: Deep learning technology has developed constantly and is applied in many fields. In order to correctly apply deep learning techniques, sucient learning must be preceded. Various conditions are necessary for sucient learning. One of the most important conditions is training data. Collecting sucient training data is fundamental, because if the training data are insucient, deep learning will not be done properly. Many types of training data are collected, but not all of them. So, we may have to collect them directly. Collecting takes a lot of time and hard work. To reduce this eort, the data augmentation method is used to increase the training data. Data augmentation has some common methods, but often requires dierent methods for specific data. For example, in order to recognize sign language, video data processed with openpose are used. In this paper, we propose a new data augmentation method for sign language data used for learning translation, and we expect to improve the learning performance, according to the proposed method.
(SCIE) Tegg Taekyong Sung, Jeongsoo Ha, Jeewoo Kim, Alex Yahja, Chae-Bong Sohn*, and Bo Ryu, “DeepSoCS: A Neural Scheduler for Heterogeneous System-on-Chip (SoC) Resource Scheduling”, Electronics 9.936 (2020): 1-16. (IF 2.412 Q2)
Abstract: In this paper, we present a novel scheduling solution for a class of System-on-Chip (SoC) systems where heterogeneous chip resources (DSP, FPGA, GPU, etc.) must be efficiently scheduled for continuously arriving hierarchical jobs with their tasks represented by a directed acyclic graph. Traditionally, heuristic algorithms have been widely used for many resource scheduling domains, and Heterogeneous Earliest Finish Time (HEFT) has been a dominating state-of-the-art technique across a broad range of heterogeneous resource scheduling domains over many years. Despite their long-standing popularity, HEFT-like algorithms are known to be vulnerable to a small amount of noise added to the environment. Our Deep Reinforcement Learning (DRL)-based SoC Scheduler (DeepSoCS), capable of learning the “best” task ordering under dynamic environment changes, overcomes the brittleness of rule-based schedulers such as HEFT with significantly higher performance across different types of jobs. We describe a DeepSoCS design process using a real-time heterogeneous SoC scheduling emulator, discuss major challenges, and present two novel neural network design features that lead to outperforming HEFT: (i) hierarchical job- and task-graph embedding; and (ii) efficient use of real-time task information in the state space. Furthermore, we introduce effective techniques to address two fundamental challenges present in our environment: delayed consequences and joint actions. Through an extensive simulation study, we show that our DeepSoCS exhibits the significantly higher performance of job execution time than that of HEFT with a higher level of robustness under realistic noise conditions. We conclude with a discussion of the potential improvements for our DeepSoCS neural scheduler.
(SCOPUS) Eunsu Goh, Daeyeol Kim, Suyeong Oh, and Chae-Bong Sohn*, “Automatic Effect Generation Method for 4D Films”, International Journal of Computing and Digital Systems, 9.2 (2020): 291-298.
Abstract: The 4D film is a technology that stimulates the viewer’s senses by using motion chairs and special equipment to increase immersion. 4D movies have recently gained enormous popularity by satisfying the five senses of users by using water spray and wind scent of motion chairs. Recently, efforts have been made to apply 4D systems to personal equipment such as mobile devices. However, to create 4D content that can be used on 4D devices, a large number of skilled workers have to make manual effects for several decades. In this paper, we propose a method of generating 4d effects by classifying audio signals and motion of important objects in video using 4D movie’s program stream.
(SCOPUS) Yoojin Jeong, Kyoung Chul Kim, Kwang-Chul Son, and Chae-Bong Sohn*, “A Hanbok Design and Improve the Results using GAN”, International Journal of Engineering Research and Technology, 12.12 (2020): 3038-3040.
Abstract: In this study, Generative adversarial network (GAN) was used to design Korean traditional clothes, Hanbok. Style transfer methods are used to create Hanbok images based on contour images of Hanbok by learning domain translation between color domain and edge domain with GAN algorithm. Among the Style transfer methods, DiscoGAN was used. Furthermore, CycleGAN and SRGAN were used to improve the resulted images of DiscoGAN.
(SCOPUS) Sooyoung Cho, Daeyeol Kim, Sinwoo Yoo, Kyunghak Lee, Chae-Bong Sohn. “Automatic Music Selection Algorithm Based on Background Image.” International Journal of Innovative Technology and Exploring Engineering (IJITEE) 8.8S2 (2019): 332-335.
Abstract Background/Objectives: Game music has the characteristic in which determined music is repeated according to the area in the game.
Methods/Statistical analysis: In this paper, we propose an algorithm in which various music is repeated in game. The game background is extracted to the image by utilizing the screen-shot function. First, gave the histogram of similar images. The classification of the background is determined using the learned histogram, and one of the music corresponding to the tag created by the user is reproduced.
Findings: For each image, a histogram was determined. RGB and lab histograms are represented through the table. As a result, you can see that game screenshots and other images were judged to be similar images when they were entered.
Improvements/Applications: It can be used for video processing and other editing functions. Learning through algorithms can be used in many ways.
(SCOPUS) Sang-Geun Choi, and Chae-Bong Sohn. “Detection of HGG and LGG Brain Tumors using U-Net.” Medico-Legal Update 19.1 (2019): 560-565.
Background/Objectives: Advancement in medical equipment has enabled accurate and quick diagnosis in medical field. However, an increase in the number of medical staff is slower than the rate of medical equipment development. It has resulted in increased risk of diagnostic misinterpretation. The purpose of this paper is to help diagnosis of medical staff through artificial neural network (ANN).
Methods/Statistical analysis: We selected U-Net among artificial neural networks. U-Net is highly accurate in medical imaging. The dataset for learning the network was obtained from the Brain Tumor Segmentation Challenge (BraTS). This dataset contains four classes of brain tumor data and it is suitable for learning variety of brain tumors. We used F-Score to measure the accuracy of the learned network.
Findings: In this paper, we compare the performance of the network by conducting two experiments. First, we checked the learning progress of the network. Second, we compared the results of learning with mixed and single datasets. In the first experiment, when allowing the network to learn for a total of 200 generations, it was confirmed that the results of 100 generations were the most accurate. In the second experiment, the network learned by three groups of datasets. The first group consisted of HGG data only, and the second group was composed of LGG data only, and the last group was made up of mixing HGG and LGG data. When comparing the results of the first group with the third group, the accuracy of HGG patient was 0.6696 and 0.6222, respectively. Subsequently, the results of the second and the third group were 0.6315 and 0.6228, respectively.
Improvements/Applications: In this experiment, we compared the results obtained when the datasets were mixed and when they were used singly. The results show similar accuracy. However, when using a mixture of datasets, the accuracy is lower, which is enough to assist the diagnosis of the medical staff. It is expected that this will help the development of the medical image processing field by confirming the position and size of the brain tumor accurately regardless of the data of any grade for brain tumor.
(SCOPUS) Sooyoung Cho, Daeyeol Kim, Sinwoo Yoo, and Chae-Bong Sohn. “Generative Adversarial Network-Based Face Recognition Dataset Generation.” International Journal of Applied Engineering Research 13.22 (2018):15734-15739
Abstract: Facial recognition technique has many advantages than other biometric recognition solutions and recent studies and outcomes of automation process level almost the same as a human does. Applying Deep-Learning solution in this region is very common these days, but there are many obstacles to put in. This paper deals one of them of which the preparation of a certain scale of a dataset by combining existing dataset and another dataset this paper suggests. Celeb A and 2nd version of VGG face dataset are the base dataset that the discriminator agent of Generative Adversarial Network can be trained, and the generator will refer the new dataset with thousands of western portraits we added. This suggested new dataset is tested with Deep-face network as the one of existing facial recognition solutions, and we confirmed that we can use this technique for other similar dataset preprocessing layers. There are some facts to need to consider when it applies to other targets, as analyzed differences between the real facial pictures and the ones was generated.
(SCOPUS) Minyeong Gwon, Eunsu Goh and Chae-Bong Sohn. “The VR Trip Simulator with Multi Networking of Rule-based Model.” International Journal of Applied Engineering Research 13.22 (2018):15754-15757
Abstract: Unity 3D tools (‘Unity’) can be used to develop VR applications that can simulate various environments. In this paper, we develop VR Trip Simulator (‘Simulator’) for the purpose of travel. The simulator to be introduced in this paper was developed based on Rule Based Model. Rule Based Engine is added to form a State – Rule – Action structure for various models. The NPC AI, which is developed by using it, takes various actions appropriate to the situation. Simulation is carried out, the information related to the destination is automatically provided to the user, and the practicality of the simulator is enhanced. In addition, by establishing networking in the TCP / IP communication environment, it communicates with various users in real time. This increases the expertise of network programming and makes up the funniest element in the simulator, not just information.
(SCIE) Sooyoung Cho, Daeyeol Kim, Changhyung Kim, Kyoung-Yoon Jeong & Chae-Bong Sohn. “360-degree video traffic reduction using cloud streaming in mobile.” Wireless Personal Communications 105.2 (2018): 635-654.
Abstract: Recently, commercialization of 360° video streaming service and various studies are being conducted in a mobile environment. It also makes 360° video streaming service through a mobile cloud available. The mobile cloud can easily provide characteristics of the terminal and existing features of the cloud service, such as contents and service of the application, in the mobile environment. Using this function, it is applied to 360° video streaming services in mobile environment. Unlike the conventional filming method which only shows the angle that camera operator intended, 360° videos can display the direction desired by the viewer in real time by recording a view in every direction at the same time. By displaying the real-time 3D image information to the user, the viewer can have more realistic contents and interactive experience. 360° video typically has a resolution of 4 k or more, which causes network load in mobile streaming. The adaptive HTTP streaming service currently provides 360 resolution video streams in proportion to their bandwidth. However, this method is not responsible for the quality of the video. Therefore, we propose a highquality video streaming method with low network load in mobile environment. 360 video is divided into FoV (field of view) and non-FoV, and the image is transmitted with high quality for FoV and low quality for non-FoV. In this paper, we propose a method of FoV on the background frame (FBF) differentiated from the existing Http adaptive streaming method. It is possible to view high resolution video in a mobile environment while maintaining a regular level of video quality even in a non-viewing area.
(SCOPUS) Tegg Taekyong Sung, Changhyung Kim, Kyunghak Lee and Chae-Bong Sohn. “Exploring Navigation using Deep Reinforcement Learning.” International Journal of Applied Engineering Research 13.19 (2018):14447-14450
Abstract: This paper discusses a navigation system with deep reinforcement learning approach. Reinforcement learning maximizes designed reward function and can be applied diverse domains, such as vision, language, or robotics. Especially, one of the methods, model-free learns how to maximize the objective without achieving any environment information as a trial-and-error. We review recent methodologies of navigation using reinforcement learning and discuss the impact of different observation spaces from the agent. Furthermore, we experiment the navigating robot using the model-free algorithm and a physical simulator.
(SCOPUS) Daeyeol Kim, Tegg Taekyong Sung, SooYoung Cho, Gyunghak Lee and Chae-Bong Sohn. “A Single Predominant Instrument Recognition of Polyphonic Music Using CNN-based Timbre Analysis.” International Journal of Engineering & Technology, 7 (3.34) (2018): 590-593
Abstract: Classifying musical instrument from polyphonic music is a challenging but important task in music information retrieval. This work enables to automatically tag music information, such as genre classification. In previous, almost every work of spectrogram analysis has been used Short Time Fourier Transform (STFT) and Mel Frequency Cepstral Coefficient (MFCC). Recently, sparkgram is researched and used in audio source analysis. Moreover, for deep learning approach, modified convolutional neural networks (CNN) widely have been researched, but many results have not been improved drastically. Instead of improving backbone networks, we have researched on preprocessing process.
In this paper, we use CNN and Hilbert Spectral Analysis (HSA) to solve the polyphonic music problem. The HSA is performed at the fixed length of polyphonic music, and a predominant instrument is labeled at its result. We have achieved the state-of-the-art result in IRMAS dataset and 3% performance improvement in individual instruments.
(SCOPUS) Sooyoung Cho, Sang-Geun Choi, Daeyeol Kim, Gyunghak Lee and Chae-Bong Sohn. “How to Generate Image Dataset based on 3D Model and Deep Learning Method.” International Journal of Engineering & Technology, 7 (3.34) (2018): 221-225
Abstract: Performances of computer vision tasks have been drastically improved after applying deep learning. Such object recognition, object segmentation, object tracking, and others have been approached to the super-human level. Most of the algorithms were trained by using supervised learning. In general, the performance of computer vision is improved by increasing the size of the data. The collected data was labeled and used as a data set of the YOLO algorithm. In this paper, we propose a data set generation method using Unity which is one of the 3D engines. The proposed method makes it easy to obtain the data necessary for learning. We classify 2D polymorphic objects and test them against various data using a deep learning model. In the classification using CNN and VGG-16, 90% accuracy was achieved. And we used Tiny-YOLO of YOLO algorithm for object recognition and we achieved 78% accuracy. Finally, we compared in terms of virtual and real environments it showed a result of 97 to 99 percent for each accuracy.
(SCOPUS) Jin Sol Choi, Daeyeol Kim, Sooyoung Cho, Sinwoo Yoo and Chae-Bong Sohn. “Visual Speech Recognition System with Deep Neural Networks.” International Journal of Applied Engineering Research 13.15 (2018): 12073-12076
Abstract: Recent artificial intelligence manufactures based on voice recognition cannot be used by the deaf. In order to solve this problem, we present ‘Visual Speech Recognition System’ using deep learning with lip movement. This system analyzes mouth shape and process time series data through the 3-dimensional convolutional neural network and gated recurrent unit. Our visual speech recognition system deals with Korean vocabulary, and creates subtitles based on oral movements of the subjects in the video. This system recognizes individual words rather than the whole sentences. We achieved 91.8% accuracy. This system could be applicable for someone who being deaf, having the difficulty of hearing, or anyone who requires communication without the voice.
(SCOPUS) Jin Sol Choi, Daeyeol Kim, Sooyoung Cho and Chae-Bong Sohn. “Deep Learning-Based Lip Analysis System.” JP Journal of Heat and Mass Transfer SP.1 (2018): 29-33
Abstract: Recent artificial intelligence manufactures based on voice recognition cannot be used by the deaf. In order to solve this problem, we present ‘Lip Analysis System’ using deep learning with lip movement. This system analyzes mouth shape and process time series data through the 3-dimensional convolution neural network and gated recurrent unit. Our Lip Analysis System deals with Korean vocabulary, and creates subtitles based on oral movements of the subjects in the video. This system recognizes individual words rather than the whole sentences. We achieved 91.8% accuracy. This system could be applicable for someone who being deaf, having the difficulty of hearing, or anyone who requires communication without the voice.
(SCOPUS) Tegg Taekyong Sung, Daeyeol Kim, Soo Jun Park, and Chae-Bong Sohn. “Dropout Acts as Auxiliary Exploration”, International Journal of Applied Engineering Research 13.10 (2018): 7977-7982
Abstract: Deep neural networks have successfully been used in machine learning field, and scientists have been experimented that one of its methods, reinforcement learning is corresponded to the functions of basal ganglia in the brain. One of the critical issues in reinforcement learning is performing the optimal action for an agent. Commonly, this can be achieved by balancing between exploitation and exploration. Recently, dropout, one of the stochastic regularization methods, can be worked for discovering exploration. In this paper, we extend dropout as an auxiliary exploration in reinforcement learning, especially in continuous action problems. This method can be easily applied to any algorithms involving function approximator. We have empirically found the optimal dropout rates and position from layers in neural networks. Comparing to standard networks, dropout applied layers achieved higher rewards in most control tasks. Moreover, we suggest a promising methodology for developing dropout method using the probabilistic switch. With its probabilistic behavior, this can be attached to neuromorphic chip to perform dropout.
(SCOPUS) Changhyung Kim and Chae-Bong Sohn. “Smart Home AMI Service by IoT in DTV Channel.” Far East Journal of Electronics and Communications 17.4 (2017): 801-806
Abstract: AMI and smart home service based on IoT are variously provided by using new information communication devices such as smart phone and Internet TV. However, it is costly and time consuming due to the inconvenience that a user has to learn how to use a new IT (information telecommunication) device. In this respect, TV is as stable, standard, and familiar household appliance as many people have used for a long time. Especially at home, TV is more utilized than any other IT devices in terms of users experience, penetration rate, and industry standard. However, traditional TV has many limitations in using IoT service. This paper suggests a system that uses IoT service on traditional TV like smart home service and AMI by transmission DTV (Digital Television) broadcast channels. Therefore, IoT service can be used in traditional TV channels.
(SCOPUS) Changhyung Kim, Tae Kyung Sung, Kyung Chul Kim, Kyung Yoon Jeong, Seong Jeong and Chae-Bong Sohn. “Low Delay Method for PSIP Information Converter and Transmission in ATSC Digital Broadcast.” Far East Journal of Electronics and Communications SP.2 (2017): 123-129
Abstract: Traditional broadcasters who had provided the analog broadcast services have moved to digital broadcasting services due to the economic breakthrough of digital broadcasting. However, local MSO (Multi-System Operator) needs various service methods, such as changing virtual channel or reconstructing PSIP, to retransmit the digital broadcast. MPEG-2 TS (Transport Stream) which is transferred from the terrestrial digital broadcasting has various PSIP (Program and System Information Protocol). In this paper, we suggest a new method to convert the PSIP information of the MPEG-2 TS with low delay by receiving a digital terrestrial broadcast via the PSIP (Program and System Information Protocol) analysis.
(SCOPUS) Jung-Ho Kim, Yong-Suk Choi, Soon-Chul Kwon, Kwang-Chul Son, Chae-Bong Sohn and Seung-Hyun Lee. “The Influence on Changes of Visual Function by Watching 3D Images – Focused on Blink Rate and Accommodative Response -.” INFORMATION 17.12(B) (2014): 6589-6597
(SCOPUS) Jung-Ho Kim, Soon Chul Kwon, Kwang Chul Son, Chae-Bong Sohn and Seung Hyun Lee2. “Effect of 2Dimesion and 3Dimension Images on Human Factors.” International Journal of Internet, Broadcasting and Communication 6.2 (2014): 13-16
(SCOPUS) Kwang-Chul Son, Soon-Chul Kwon, Hyung-Won Jung, Chae-Bong Sohn, “The Characteristics of the Crystal of CdSe thin films fabricated by electochemical techniques”, Life Science Journal, Vol. 11, No. 7s, 2014
(SCIE) Hye Jeong Cho, Chae-Bong Sohn and Seoung-Jun Oh, “Video Content-Based Bit Rate Estimation Scheme for Transcoding in IPTV Services”, KSII TIIS, Vol. 8, No. 3, 2014
Abstract: In this paper, a new bit rate estimation scheme is proposed to determine the bit rate for each subclass in an MPEG-2 TS to H. 264/AVC transcoder after dividing an input MPEG-2 TS sequence into several subclasses. Video format transcoding in conventional IPTV and Smart TV services is a time-consuming process since the input sequence should be fully transcoded several times with different bit-rates to decide the bit-rate suitable for a service. The proposed scheme can automatically decide the bit-rate for the transcoded video sequence in those services which can be stored on a video streaming server as small as possible without losing any subject quality loss. In the proposed scheme, an input sequence to the transcoder is sub-classified by hierarchical clustering using a parameter value extracted from each frame. The candidate frames of each subclass are used to estimate the bit rate using a statistical analysis and a mathematical model. Experimental results show that the proposed scheme reduces the bit rate by, on an average approximately 52% in low-complexity video and 6% in high-complexity video with negligible degradation in subjective quality.
(SCI) Soo Young Cho, Jin Choul Chai, Soo Jun Park, Hyemyung Seo, Chae-Bong Sohn, and Young Seek Lee, “EPITRANS: A Database that Integrates Epigenome and Transcriptome Data”, Molecules and Cells”, Vol. 36, No. 5, 2013
Abstract: Epigenetic modifications affect gene expression and thereby govern a wide range of biological processes such as differentiation, development and tumorigenesis. Recent initiatives to define genome-wide DNA methylation and histone modification profiles by microarray and sequencing methods have led to the construction of databases. These databases are repositories for international epigenetic consortiums or provide mining results from PubMed, but do not integrate the epigenetic information with gene expression changes. In order to overcome this limitation, we constructed EPITRANS, a novel database that visualizes the relationships between gene expression and epigenetic modifications. EPITRANS uses combined analysis of epigenetic modification and gene expression to search for cell function-related epigenetic and transcriptomic alterations (Freely available on the web at http://epitrans.org).
(SCIE) Sea-Nae Park, Dong-Gyu Sim, Seoung-Jun Oh, Chang-Beom Ahn, Yung-Lyul Lee, Hochong Park, Chae-Bong Sohn, and Jeongil Seo, “Residual Signal Compression Based on the Blind Signal Decomposition for Video Coding”, LNCS 4412, 2007
(SCIE) Su-Yeol Jeon, Chae-Bong Sohn, Ho-Chong Park, Chang-Beom Ahn, and Seoung-Jun Oh, “Spatial Interpolation Algorithm for Consecutive Block Error Using the JND Method”, LNCS 4319, 2006
(SCIE) Jun-Seong Hong, Jong-Hyun Choi, Chang-Beom Ahn, Chae-Bong Sohn, Seoung-Jun Oh, and Hochong Park, “Dual-Domain Quantization for Transform Coding of Speech and Audio Signals”, LNCS 3767, 2005
(SCIE) Sang-Jun Yu, Chae-Bong Sohn, Seoung-Jun Oh, and Chang-Beom Ahn, “Multimedia: An SIMD – Based Efficient 4×4 2DTransform Method”, LNCS 3480, 2005
International Conferences
(NIPS-2018) Aleksandra Malysheva, Tegg Taekyong Sung, Chae-Bong Sohn, Malysheva, Daniel Kudenko, Aleksei Shpilman. “Deep Multi-Agent Reinforcement Learning with Relevance Graphs.” Thirty-second Conference on Neural Information Processing Systems. arXiv preprint arXiv:1811.12557 (2018).
Abstract: Over recent years, deep reinforcement learning has shown strong successes in complex single-agent tasks, and more recently this approach has also been applied to multi-agent domains. In this paper, we propose a novel approach, called MAGnet, to multi-agent reinforcement learning (MARL) that utilizes a relevance graph representation of the environment obtained by a self-attention mechanism [17], and a message-generation technique inspired by the NerveNet architecture [18]. We applied our MAGnet approach to the Pommerman game [11] and the results show that it significantly outperforms state-of-the-art MARL solutions, including DQN, MADDPG, and MCTS.
(DTMBIO-KMH18) Tegg Taekyung Sung, Chae-Bong Sohn, Soo Jun Park, “GDMiner: Gene-Disease relation Miner system”, ACM 12th International Workshop on Data and Text Mining in Biomedical Informatics (DTMBio) October 22, 2018
Abstract: The numbers of articles and journals that are published are increasing at a considerable rate, and the published information is growing continuously and fast. Because of this, researches to acquire knowledge automatically have been carried out in the areas of information retrieval, information extraction and text mining. Information retrieval approaches are good for specific topics that the number of related articles is small. But, if the number is bigger, searching skill and knowledge acquisition ability are useless. Though many efforts have been made to extract information from literature, many approaches have concentrated on specific entities, such as proteins, genes and their interactions, and much information is still remained in unstructured text. So, we have developed a system that discovers relations between various categories of biomedical entities. Our system collects abstracts from PubMed by queries representing a topic and visualizes relationship from the collection by automatic information extraction.
(IEEE ICCE-2018) HyeonSu Kim, SangBum Nam, SangGeun Choi, ChangHyung Kim, Tegg TaeKyong Sung, and Chae-Bong Sohn. “HLS-based 360 VR using spatial segmented adaptive streaming.” 2018 IEEE international conference on consumer electronics. IEEE, 2018.
Abstract: Recently, by advances in VR (Virtual Reality) contents and HMD (Head Mounted Display), 360VR video related research and development have been actively progressed. Also, mostly recent VR contents are provided with ultra-high definition, over 4K (UHD) and 8K (SUHD). The transmit efficiency which using the most efficient video compression, H.265, to handle such 360VR videos can be effected due to overtransmitting unseen fields in network streaming service. In this paper, a server and a network load problem can be solved by extracting and utilizing information in userconcentrated FOV (Field of View). Regarding to this concept, we propose the Spatial Segmented Adaptive Streaming (SSAS) method. By transmitting original quality video in a currently concentrated field, while transmitting degraded quality video in other fields, network load can be reduced. However, this selectively transmit method has caused switching quality delay by FOV movement. Therefore, we propose the HLS-based real-time adaptive streaming method through video fields and preencoding per quality.
(IEEE ICCE-2012) Dae-Young Noh, Ji-Eun Kim, Chae-Bong Sohn, and Seoung-Jun Oh. “A Fast Luminance Intra 4×4 Pre diction Mode Decision Method by Statistical Analysis of Residual Data in H.264/AVC.” 2012 IEEE international conference on consumer electronics. IEEE, 2012.
Abstract: In H.264/AVC, intra prediction mode decision using rate distortion optimization (RDO) improves coding efficiency but requires high computational complexity. There is a close correlation between the best mode by RDO and the energy of the residual data. In this paper we propose a fast intra 4×4 block prediction mode decision method by statistical analysis of the relationship between RDO and residual data. The proposed method reduces the intra 4×4 block encoding time by about 57.4%, while decreasing coding gain by about 0.29%.
International Journals
Kyu Jung Choi and Chae-Bong Sohn. “AI Referee with Mask R-CNN”, European Journal of Advances in Engineering and Technology, Vol 7, No. 2, 2020
Abstract: In this paper, Object detection is a fundamental field of computer vision and has received much attention in recent years and has made great development. As the development progressed, there were many cases applied to various fields like Sports, surveillance, autonomous driving. This paper describes the algorithm of object detection and describes the papers to which it is applied. In particular, the three-second rule of basketball will be heard as an example. If the attacker or defender without a ball is in the paint zone for 3 seconds, the offense is 3 seconds.
Seung-Soo Jeong and Chae-Bong Sohn, “Temporal Error Concealment Algorithm Using Adaptive Multi-Side Boundary Matching Principle”, International Journal of Computer Science and Network Security, Vol. 8, No. 12, 2008
Sang-Jun Yu and Chae-Bong Sohn, “Enhanced Transform Domain Intra Prediction for MPEG-2 to H.264/AVC Transcoding”, International Journal of Computer Science and Network Security, Vol. 7, No. 12, 2007
Chae-Bong Sohn, and Hye-Jeong Cho, “An Efficient SIMD-based Quarter-Pixel Interpolation Method for H.264/AVC”, International Journal of Computer Science and Network Security, Vol. 6, No. 11, 2006
Domestic KCI Indexed Journals
박연지, 양근제, “한국어 BERT 모델을 활용한 청각 정보 기반 광고 영상 분류 방법론”, 디지털콘텐츠학회논문지, 25.1 (2024): 121-131
나준영, 이광기, 고은수, 김대열, 손채봉, “마이데이터 환경에서 개인의 민감 데이터 주권확보를 위한 비대칭 키 암호화 기반 원격진료시스템”, 33.6 (2023): 485-494
이민아, 박연지, 나준영, 손채봉, “KoBERT, KoGPT-2, KoBART 활용 및 하이퍼파라미터 최적화를 진행한 리뷰 감성분석 애플리케이션 구현”, 디지털콘텐츠학회논문지, 24.11 (2023): 2831-2840
문종현, 손채봉, “드론 환경에서 실시간 객체 탐지를 위한 딥러닝 네트워크 기술 동향”, 선진국방연구, 6.2 (2023): 181-196
정서영, 손채봉, 유정호, “콘크리트 균열 깊이 추정에 유의미한 이미지 특성 변수에 관한 연구”, 한국퍼실리티매니지먼트학회지, 16.2 (2021): 43-51
박연지, 정유진, 손채봉, “딥러닝을 이용한 군 내외 거수자 행동 인식: 키포인트 2D 스케일링을 중심으로”, 선진국방연구, 4.1 (2021): 43-59
최규정, 오수영, 손채봉, “지능형 감시 정찰 시스템 구축을 위한 OpenPose와 Deep Learning 기술 적용방안 연구”, 선진국방연구, 3.3 (2020): 113-132
최진솔, 김경철, 손채봉, “사운드 디자인을 위한 K-POP 음악의 Wave-U-Net 및 주파수 분석을 통한 자동 Bass line 표기”, 한국디자인리서치, 4.3 (2019): 159-168
정유진, 김경철, 손채봉, “Generative Adversarial Network을 이용한 한복 디자인 DiscoGAN, CycleGAN, Munit을 중심으로”, 한국디자인리서치, 4.3 (2019): 22-29
정성, 손채봉. “DS3와 ARIA 알고리즘을 이용한 인터페이스 다중 연동 보안장치의 구현.” 대한전자공학회논문지 55.8 (2018): 127-133
조수영, 김대열, 김문석, 손채봉. “얼굴 인식 데이터 세트 생성에 관한 연구.” 한국디자인리서치 3.1 (2018): 85-93
최진솔, 최상근, 김문석, 손채봉. “CNN과 OpenPose 라이브러리를 활용한 실시간 수화 통역기.” 한국디자인리서치 3.1 (2018): 94-101
조수영, 손채봉, 김문석. “Generative adversarial nets를 이용한 빈센트 반 고흐 이미지 생성 시스템.” 한국디자인리서치 2.3 (2017): 85-92
김대열, 손채봉, 김문석. “개인 맞춤형 광고 제작 및 송출에 관한 연구.” 한국디자인리서치 2.3 (2017): 18-25
Dae Yeol Kim, Soo Young Cho, Chan Hyeong Park, Chae-Bong Sohn. “Action Game with Automatic Background Music Generation Using Genetic Algorithm.” Korean Society For Computer Game 29.2 (2016): 99-106
이기웅, 손채봉. “컴퓨터 게임을 위한 음악 기호의 변화에 강인한 악보인식 시스템.” 한국컴퓨터게임학회논문지 28.4 (2015): 17-26
김남훈, 정형원, 손채봉, 손광철. “온라인 게임 서비스 영속성을 위한 다중 연결 시스템.” 한국컴퓨터게임학회논문지 27.3 (2014): 17-26
전성하, 전현무, 신성관, 손채봉, 양훈기. “이동 물체의 태깅을 위한 디지털 빔포밍 기반 RFID 시스템.” 한국정보통신학회논문지 18.7 (2014): 1713-1720
김성일, 손채봉. “ISDB-T 시스템을 위한 SNR 추정기 구현.” 방송공학회논문지 18.6 (2013): 927-934
손채봉, 손광철, 정형원. “증강현실 기반 문화재 학습 게임 프로토타입 설계.” 한국컴퓨터게임학회논문지 26.3 (2013): 119-124
손채봉, 박수준, 오승준, 안창범, 박호정, 심동규. “u-피트니스 시스템 기술.” 한국통신학회지 (2009): 14-18
조혜정, 김지은, 손채봉, 정광수, 오승준. “통계적 분석 기반 불법 복제 비디오 영상 감식 방법.” 방송공학회논문지 14.6 (2009): 661-675