The previous research of speech separation has significantly improved separation performance based on the time-domain method: encoder, separator, and decoder. Most research has focused on revising the architecture of the separator. In contrast, a single 1-D convolution layer and 1-D transposed convolution layer have been used as encoder and decoder, respectively. This study proposes deep encoder and decoder architectures, consisting of stacked 1-D convolution layers, 1-D transposed convolution layers, or residual blocks, for the time-domain speech separation. The intentions of revising them are to improve separation performance and overcome the tradeoff between separation performance and computational cost due to their stride by enhancing their mapping ability. We applied them to Conv-TasNet, the typical model in the time-domain speech separation. Our results indicate that the better separation performance is archived as the number of their layers increases and that changing the number of their layers from 1 to 12 results in more than 1 dB improvement of SI-SDR on WSJ0-2mix. Additionally, it is suggested that the encoder and decoder should be deeper, corresponding to their stride since their task may be more difficult as the stride becomes larger. This study represents the importance of improving these architectures as well as separators.
{"title":"Deep encoder and decoder for time-domain speech separation","authors":"Kohei TAKAHASHI, Toshihiko SHIRAISHI","doi":"10.1299/mej.23-00124","DOIUrl":"https://doi.org/10.1299/mej.23-00124","url":null,"abstract":"The previous research of speech separation has significantly improved separation performance based on the time-domain method: encoder, separator, and decoder. Most research has focused on revising the architecture of the separator. In contrast, a single 1-D convolution layer and 1-D transposed convolution layer have been used as encoder and decoder, respectively. This study proposes deep encoder and decoder architectures, consisting of stacked 1-D convolution layers, 1-D transposed convolution layers, or residual blocks, for the time-domain speech separation. The intentions of revising them are to improve separation performance and overcome the tradeoff between separation performance and computational cost due to their stride by enhancing their mapping ability. We applied them to Conv-TasNet, the typical model in the time-domain speech separation. Our results indicate that the better separation performance is archived as the number of their layers increases and that changing the number of their layers from 1 to 12 results in more than 1 dB improvement of SI-SDR on WSJ0-2mix. Additionally, it is suggested that the encoder and decoder should be deeper, corresponding to their stride since their task may be more difficult as the stride becomes larger. This study represents the importance of improving these architectures as well as separators.","PeriodicalId":45233,"journal":{"name":"Mechanical Engineering Journal","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135699318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Model experiments on the influence of tunnel hood cross-sectional area on the reduction of the pressure gradient of the compression wavefront in high-speed railways","authors":"T. Fukuda, S. Saito","doi":"10.1299/mej.22-00155","DOIUrl":"https://doi.org/10.1299/mej.22-00155","url":null,"abstract":"","PeriodicalId":45233,"journal":{"name":"Mechanical Engineering Journal","volume":"1 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"66399388","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Reduce the sway of the crane payload using on-off damping radial spring-damper","authors":"Kien T. Nguyen, L. Tran","doi":"10.1299/mej.22-00321","DOIUrl":"https://doi.org/10.1299/mej.22-00321","url":null,"abstract":"","PeriodicalId":45233,"journal":{"name":"Mechanical Engineering Journal","volume":"1 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"66400358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Daichi Todaka, Koki Sato, Hiroshi Nagatomo, S. Ando, Q. Shi
{"title":"Diffuseness quantification in an acoustic test chamber using the isotropy derived by the angular spectrum of sound waves based on the spherical harmonics expansion","authors":"Daichi Todaka, Koki Sato, Hiroshi Nagatomo, S. Ando, Q. Shi","doi":"10.1299/mej.22-00371","DOIUrl":"https://doi.org/10.1299/mej.22-00371","url":null,"abstract":"","PeriodicalId":45233,"journal":{"name":"Mechanical Engineering Journal","volume":"1 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"66401139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tomochika Takahashi, Yuta Kinai, Toshiko Osada, Satoshi Kobayashi
{"title":"Effect of water atomized powder size on the flowability and sintered properties in metal binder jet 3D printing","authors":"Tomochika Takahashi, Yuta Kinai, Toshiko Osada, Satoshi Kobayashi","doi":"10.1299/mej.22-00476","DOIUrl":"https://doi.org/10.1299/mej.22-00476","url":null,"abstract":"","PeriodicalId":45233,"journal":{"name":"Mechanical Engineering Journal","volume":"2 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"66401391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Takero Hirose, Akira Nakazato, K. Kurita, S. Aoki, S. Koshimizu, Tatsuo Yoshida
{"title":"Reduction of welding residual stress using ultrasonic vibration load (Effects of material properties on reduction rate)","authors":"Takero Hirose, Akira Nakazato, K. Kurita, S. Aoki, S. Koshimizu, Tatsuo Yoshida","doi":"10.1299/mej.23-00056","DOIUrl":"https://doi.org/10.1299/mej.23-00056","url":null,"abstract":"","PeriodicalId":45233,"journal":{"name":"Mechanical Engineering Journal","volume":"1 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"66402465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Masafumi Tazuke, Tsuyoshi Miyakoshi, A. Hosoi, K. Michishio, N. Oshima, H. Kawada
{"title":"Very high-cycle fatigue properties of 90° unidirectional CFRP laminates and evaluation of fatigue limits by free volume measurement using positron microscopy","authors":"Masafumi Tazuke, Tsuyoshi Miyakoshi, A. Hosoi, K. Michishio, N. Oshima, H. Kawada","doi":"10.1299/mej.23-00089","DOIUrl":"https://doi.org/10.1299/mej.23-00089","url":null,"abstract":"","PeriodicalId":45233,"journal":{"name":"Mechanical Engineering Journal","volume":"1 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"66403519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
T. Kurahashi, K. Takeuchi, Towa Koike, M. Kishida, Y. Murakami, F. Ikeda
{"title":"Numerical studies for defect topology identification based on the adjoint variable and the finite element methods","authors":"T. Kurahashi, K. Takeuchi, Towa Koike, M. Kishida, Y. Murakami, F. Ikeda","doi":"10.1299/mej.23-00090","DOIUrl":"https://doi.org/10.1299/mej.23-00090","url":null,"abstract":"","PeriodicalId":45233,"journal":{"name":"Mechanical Engineering Journal","volume":"1 1","pages":""},"PeriodicalIF":0.5,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"66403659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In human motion capture systems, reflective markers attached to the body have been widely used to track motion using optical cameras. However, when the speed of motion increases, because the brightness and angle of view of the camera are limited, and the markers often fall off, particularly of detailed body parts such as fingers in full-body movements, other parts of the body (palms) have been investigated. This study attempted to acquire finger movements during a high-speed throwing task without attaching markers using automatic image recognition technology based on deep learning (DeepLabCut) and verified its accuracy compared to conventional methods. As a result, the absolute distance between the 3D coordinates obtained from the two motion capture systems was an average of 15.5 to 29.4 mm depending on tracked points, and the correlation coefficients between them ranged from 0.932 to 0.999. Therefore, the shapes of the time-series profiles of the 3D coordinates obtained from the two motion- capture systems were similar. These results suggest that motion measurement using markerless motion capture is possible in environments where conventional motion capture systems are difficult to use.
{"title":"Markerless motion capture of hands and fingers in high-speed throwing task and its accuracy verification","authors":"Ayane KUSAFUKA, Naoki TSUKAMOTO, Kohei MIYATA, Kazutoshi KUDO","doi":"10.1299/mej.23-00220","DOIUrl":"https://doi.org/10.1299/mej.23-00220","url":null,"abstract":"In human motion capture systems, reflective markers attached to the body have been widely used to track motion using optical cameras. However, when the speed of motion increases, because the brightness and angle of view of the camera are limited, and the markers often fall off, particularly of detailed body parts such as fingers in full-body movements, other parts of the body (palms) have been investigated. This study attempted to acquire finger movements during a high-speed throwing task without attaching markers using automatic image recognition technology based on deep learning (DeepLabCut) and verified its accuracy compared to conventional methods. As a result, the absolute distance between the 3D coordinates obtained from the two motion capture systems was an average of 15.5 to 29.4 mm depending on tracked points, and the correlation coefficients between them ranged from 0.932 to 0.999. Therefore, the shapes of the time-series profiles of the 3D coordinates obtained from the two motion- capture systems were similar. These results suggest that motion measurement using markerless motion capture is possible in environments where conventional motion capture systems are difficult to use.","PeriodicalId":45233,"journal":{"name":"Mechanical Engineering Journal","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135705468","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}