Machine learning driven methodology for enhanced nylon microplastic detection and characterization | Scientific Reports – Nature.com

Contamination level and representativeness of subsampled areas

Analysis of the procedural blank sample indicated that contamination from the experiment environment was low. The detailed results are summarized in Table S1.

The positive control sample was used to examine the representativeness of the nine subsampled regions, determined by the ratio of the estimated mass of particles on the filter to the initial 0.05mg of nylon microspheres. It was found that the method slightly overestimated the mass of nylon microspheres, with an obtained ratio of 1.270.06. (Detailed information is displayed in Fig. S1). In an endeavour to optimize the number of regions for our spectral imaging model, we systematically analyzed the ratio between estimated and actual values across varying numbers of regions.

Generally, for identification of MP using the O-PTIR technique, there are three commonly used methods, i.e., DFIR imaging, point spectra measurements, and HSI. However, each of these methods brings some challenges: for example, DFIR imaging is fast yet provides unreliable results while HSI and point spectra measurements allow for accurate results, but they are time-consuming for data collection. With the QCL system integrated within the O-PTIR microscope, the microscope can generate a single frequency IR image of a 480m640m (spatial resolution: 2m) area of a filter in approximately 3min and 20s. When an appropriate wavenumber and a threshold value are selected, the generated image shows the majority of MP particles while ruling out most non-MP particles. With this method however, careful selection of a suitable wavenumber and a threshold value for MP particles are necessary; multiple threshold values might be needed in case of interference from the complex non-MP particles. In our study, the discrimination between MPs and non-MP particles based on single-wavenumber images proved to be unfeasible, as illustrated in Fig. S2.

The second method commonly used for MP identification is point spectra measurements. After particles are observed in the mIRage microscope, point spectra could be collected for each particle and compared against parent plastic to achieve chemical identification of the particles. This method presented two challenges: (1) When using visible light for particle location under the microscope, non-MP particles were inevitably included as spectral acquisition targets, thus adding to the analysis time. (2) For an individual particle, the O-PTIR spectra could vary significantly across different spots of the particle (see example in Fig.1). This necessitates the collection of spectra from multiple areas of the particle to enhance the reliability of identification results. Consequently, the analysis time will be multiplied. For example, it takes 25s to obtain a spectrum (a total of 5 scans acquired for each single spectrum), so if there are 100 particles from the regions of interest on the filter and three spectra are required for each particle, the total analysis time needed is at least 2h. This estimation only accounts for the raw data acquisition, excluding additional durations associated with manual adjustments such as repositioning the objective or refocusing. In light of this, such an approach becomes exceedingly time-intensive, especially when a vast number of particles are in play.

Two spots of the particle encircled in a red dashed line (A) selected for point spectra collection and (B) the corresponding O-PTIR spectra of the two spots. a.u. is arbitrary units. The scale bar is 20m.

HSI was the third method employed for MP identification. HSI generates an image where each pixel contains a full spectrum. Hence, it is a reliable method for MP identification. However, this reliability comes at the cost of drastically longer data collection time, which makes HSI impractical for routine MP analysis. For example, capturing a hyperspectral image for a 480m640m area (spatial resolution of 2m and spectral resolution of 2cm1, from a spectral range of 7691801cm1) requires almost two weeks.

In response to the challenges mentioned above, we have developed a reliable MP detection framework with an improved speed that is suitable for detecting a large quantity of nylon MPs. It can collect spectral data from nine areas (the size of each area is480m640m) of a filter (at a spatial resolution of 2m) within just approximately 2h. Powered by machine learning, the reliability of this framework is not compromised in response to reduced data collection time.

In order to effectively utilize DFIR imaging for high-throughput analysis of MPs, it is crucial to carefully select specific wavenumbers that provide the greatest discriminatory power between MP and non-MP particles. Making incorrect choices in wavenumber selection can directly impact the accuracy of identification. Acquiring too many wavenumbers increases measurement time, resulting in decreased throughput. For instance, adding just one more wavenumber can lead to an approximate 30-min increase in the time required for our proposed MP detection framework to collect data from a single filter. To identify the important wavenumbers and determine the optimal number of such wavenumbers, a database collected from bulk nylon plastic was assembled, containing 1038 spectra of MP and 1052 spectra of non-MP.

We found several types of non-MP particles in our dataset. Figure2 displays the spectra of two non-MP classes (type I non-MP and type II non-MP), along with the mean spectrum of MP, enabling a comparison. Upon initial inspection, type I non-MP exhibits a prominent sharp peak in the 17001800cm1 spectral range, while type II non-MP displays a broad peak in the 10001200cm1 spectral range. In contrast, the apparent characteristic peaks of MPs are two consecutive sharp peaks in the 15001650cm1 range.

Mean spectra for nylon MP class and two non-MP types from the database constructed, following standard normal variate (SNV) to minimize the multiplicative effects.

Two thirds of the spectra from each class were randomly selected as the training dataset for model development, and the remaining samples formed the test dataset. Based on the obtained results, the model utilizing the full wavenumber spectrum yields a correction accuracy rate of 85.31% (see Table 2). The confusion matrix of the SVM-Full wavenumber model (Fig.3A) implies that there are 8 point spectra of MPs wrongly classified as non-MPs and 97 of non-MPs mistakenly assigned as MP.

Confusion matrix showing classification accuracy for the test set of SVM-Full model using full spectral variables (A) and SVM-Four model (B).

Subsequently, the coefficient based feature importance for the full wavenumber model (Fig.4) was plotted to visualize the contribution of individual spectral variables. According to Fig.4, we could choose the important wavenumbers to our dataset based on the feature importance. The higher feature importance signifies stronger discriminative capability. Based on the analysis of the coefficients of the SVM-Full wavenumber model, wavenumbers 1711cm1, 1635cm1, 1541cm1, and 1077cm1 (indicated in Fig.4) showed the feature importance, hence, were selected as important wavenumbers for distinguishing between MPs and non-MPs. As seen from Table 2, the model optimized with these four wavenumbers demonstrates an enhanced correction rate of 91.33%. Meanwhile, the SVM-Four wavenumbers model (Fig.3B) resulted in 34 point spectra of MPs wrongly classified as non-MPs and 28 of non-MPs mistakenly assigned as MP, which shows it is a balanced model for classification tasks. The SVM-Four wavenumbers model appears to outperform the SVM-Full wavenumber model in terms of specificity, CCR, and MCC, suggesting that it is a better model for this classification task. However, the SVM-Full wavenumber model has a higher sensitivity, making it better at identifying true positive cases.

The coefficients (or weights) of the SVM model, which indicate the importance of each feature (wavelength), are then plotted. Four wavenumbers which has relatively higher feature importance than other are marked above the curves (i.e., 1711cm1, 1635cm1, 1541cm1, and 1077cm1).

After the selection of the four important wavenumbers, DFIR images were obtained at the important wavenumbers from the nine subsampled regions of the filter. Particle identification could be performed through visual inspection of these DFIR images. For instance, Fig.5A shows an optical image of a small region of a filter with a particle in the centre, and Fig.5B shows chemical images of that region based on the intensity of 1711cm1, 1635cm1, 1541cm1, and 1077cm1 bands. The absorbance intensity of each chemical image was normalized to the same range. The particle in this region exhibits high signal intensity at 1635cm1 and 1541cm1, while showing weak signal intensity at 1711cm1 and 1077cm1, indicating that it is a MP particle. On the other hand, non-MP particles would show weak signal intensity at 1635cm1 and 1541cm1, while showing strong signal intensity at 1711cm1 and/or 1077cm1 (See Figs. 6A,B for an example of non-MP particles).

An optical image of an area of a prepared filter, with a MP particle in the center of the image (A), single frequency images of that area using 1711cm1, 1635cm1, 1541cm1 and 1077cm1 band intensity, with the absorbance intensity of each chemical image normalized to the same range (B), support vector machine (SVM) prediction results of the particles in this area (C), and normalized O-PTIR spectra of the particle and the bulk plastic (D). The +1 in (C) indicates where the spectrum of the particle in (D) was collected. The scale bar is 20m.

An optical image of an area of a prepared filter, with a non-MP particle in the center of the image (A), single frequency images of that area using 1711cm1, 1635cm1, 1541cm1 and 1077cm1 band intensity, with the absorbance intensity of each chemical image normalized to the same range (B), support vector machine (SVM) prediction results of the particles in this area (C), and normalized O-PTIR spectra of the particle and the bulk plastic. The +1 in (C) indicates where the spectrum of the particle in (D) was collected. The scale bar is 20m.

However, for accurate particle identification, visual inspection is not advisable due to low accuracy. Meanwhile, application of SVM-Full model requires a huge amount of time in the collection of point spectra from all particles. Therefore, an SVM-Four wavenumbers model was trained from the four important wavenumbers to predict each particle accurately. Spectral data at the four important wavenumbers were extracted from the same database used for the SVM-Full wavenumber model. The trained SVM model on the selected four wavenumbers demonstrated good performance, evidenced by a high CCR, MCC, sensitivity and specificity (Table 2).

After applying the SVM classifier to the particle in Fig.5A, each pixel of the particle was labelled as either MP (red) or non-MP (blue), providing an intuitive and accurate identification result. Figure5C displays the SVM prediction results for one example area. As can be seen, most pixels in the particle have been labelled as MP, with a small portion labelled as non-MP. The result for a particle was determined by the majority vote of the labels of all pixels within the particle. Thereby this particle was identified as a MP particle. This was further confirmed by the full spectrum of this particle (Fig.5D). Also, by applying the SVM classifier to the particle in Fig.6A, the particle was predicted to be a non-MP particle (Fig.6C). Figure6D presents a spectrum of this particle, which validates the predicted outcome.

Our developed SVM model offers several distinct advantages over the traditional correlation-based method for MP identification. Firstly, the SVM model only requires four wavenumbers as input, significantly reducing the complexity of data collection compared to the correlation-based approach, which involves obtaining spectra from each particle and calculating correlation coefficients. This efficiency translates into a substantial time-saving advantage. Therefore, the developed method is particularly useful when dealing with a large number of particles on the filter. Secondly, the correlation-based method often relies on establishing a threshold for identification, introducing a subjective element into the process. In contrast, the SVM model automates the assignment of particles to MP or non-MP categories, contributing to a more consistent and reliable MP identification process. Last but not least, once essential wavenumbers are identified and a simplified model is developed, the SVM approach can be extended to identify a range of polymers. This versatility is a significant advantage, enabling the model to adapt to various MP compositions beyond the scope of the original correlation-based method.

Using the novel identification procedure developed, it was possible to investigate the effectiveness of several sample pre-processing steps in a more representative and less biased and efficient way. To this end, high-temperature filtration and alcohol pretreatment were chosen as methods for reducing non-MP. The performance of these two treatments was evaluated separately, including the analysis of the spectra and DFIR images at four selected wavenumbers. The evaluation included an assessment of their impact on the spectra of MP and their effectiveness in removing non-MP. To assess the effectiveness of particle removal, the MP particle/all particle ratio (MP/All) detected by four wavenumbers SVM model was used. A treatment was considered effective if it significantly increased this ratio.

By boiling the nylon bulk, MP particles were released. The released particles were subsequently enriched on the filters through high-temperature filtration and room-temperature filtration, respectively. The mean spectrum of MP from high-temperature filtration, the mean spectrum of MP from room-temperature filtration, and the mean spectrum of nylon bulk were plotted together for comparison (Fig. S3). Results showed that when the mean spectrum of nylon bulk was compared to the mean spectra of MP (regardless of the filtration temperature), no consistent peak shift was found. When the mean spectrum of MP from high-temperature filtration and the mean spectrum of MP from room-temperature filtration were compared, no consistent peak shift was found either. These findings demonstrate that exposure to high temperatures reaching water boiling point will not impact the spectral profiles of MPs when compared to the original bulk plastic.

After the thermal degradation of nylon bulk, the particles released were captured on filters through high-temperature filtration and room-temperature filtration, respectively. Using our developed SVM classifier, particles in the nine subsampled regions of the filter were counted and subsequently the ratio MP/All was calculated. The MP/All ratio from the room-temperature filtration was 0.0900.012, and from the high-temperature filtration was 0.080.012, respectively. The normal t-test results indicated that the effectiveness of high-temperature filtration in removing non-MP was not evident.

Gerhard et al.18 reported that slip agents (such as fatty acid and fatty acid esters) of plastic products are released concomitantly with the release of MP particles, and these slip agents might be dissolved in hot water and washed away during the filtration process. In light of this, our results suggest that the nylon bulk used in our study might have just a small amount fatty acid or their esters. Indeed, Hansen et al.19 reported that as additives in plastics, the amount of slip agents could be as low as 0.1%, and the removal of a small amount of additives from MP samples might not statistically significant. Furthermore, based on observations of the prepared filters, we did not see a thin residue on the room-temperature filter, which was observed by Gerhard et al.18 who confirmed that most part of the thin residue in their experiment was identified as additives. This supports that the amounts of hot water-rinseable additives in our samples were low, however this would generally be sample specific.

After the degradation of nylon bulk in boiling water, the particles released were retained on filters. An alcohol treatment was subsequently applied to the filters to reduce non-MP particles. The mean spectra of MP before and after an alcohol treatment and the mean spectrum of nylon bulk were plotted together and compared (Fig. S4). Results revealed that when the mean spectrum of nylon bulk was compared to the mean spectra of MP (regardless of the alcohol treatment), no consistent peak shift was observed. When the mean spectra of MP before and after the alcohol treatment were compared, no consistent peak shift was observed either.

To further explore the effects of alcohol treatment on released particles, this paragraph focuses on spectral changes of individual particles. The spectral data of individual particles was baseline corrected, smoothed, and normalized to between 0 and 1 prior to comparison. Figure7 shows spectra as well as optical images of 4 MP particles before and after an alcohol rinse. For all four particles presented, peak shifts for signature bands of MP in the range of 769cm1 to 1801cm1 were not observed. Particle 1 has a peak at 1741cm1 before the alcohol treatment; this is a peak that has been assigned to the formation of carbonyl groups during polyamide 66 photo-20 and thermal-oxidation21, which implicates a pathway of oxidation in hot water for the particles during high temperature treatment (filtration at 70C). However, the reduction in signal intensity of this peak after the alcohol treatment might indicate that the alcohol treatment could remove some of the oxidized substances. The spectrum of particle 2 has two new peaks at 1007cm1 and 1029cm1, respectively, after exposure to alcohol, which was possibly due to alcohol residue, as these two new peaks correspond to C=O stretching bonds of alcohol22. No introduction or disappearance of the peak was observed in the spectra of particle 3 and particle 4. By observing the optical images of these MP particles, it can be concluded that alcohol treatment did not have an effect on their morphology.

Optical images of nylon MP particles 1, 2, 3, 4, with the particles circled and marked with numbers. The scale bar is 10m.

Figure8 shows spectra as well as optical images of 4 non-MP particles before and after an alcohol rinse. Particle 1 and particle 2 appear to be yellowish to brownish. These types of non-MPs are easy to be discriminated against from MPs based on visual observation of optical images, as most of the MP particles in our experiments are whitish, similar to the color of their bulk plastic samples. However, judgement based on color is not always correct. Subsequent spectral analyses confirmed that particle 1 and particle 2 are not MP. After the alcohol treatment, most parts of these two particles were washed away, leaving black remnants on the filter. Though the elimination was not complete, it proved that alcohol could remove non-MP particles. Particle 3 is whitish with a glossy surface, and it is a chlorinated polyethylene particle. After the alcohol treatment, particles with a spectrum similar as chlorinated polyethylene (We do not have any appliances containing polyethylene) remained where it had been, and the spectrum was not changed substantially. The glossiness of the particle was reduced; however, this indicates that alcohol treatment could not remove this type of contaminant. Particle 4 is a white particle, and it is covered by a brown, lumpy object on the upper left. The noise spectrum cannot be identified by the database with high certainty. After the alcohol treatment, it appeared dull grey, and its spectrum looked like that of nylon showing five signature peaks (1633cm1, 1533cm1, 1464cm1, 1416cm1, 1370cm1). This implies that the alcohol might be able to remove some contaminants, such as additives, which cover the surface of the MP particle. Li et al.23 have reported the same finding that alcohol could wash away some additives attached to the surface of MP particles. The above experiments prove that an alcohol treatment could remove some particle contaminants and wash away some impurities covering MP particles.

Optical images of non-MP particles 1, 2, 3, 4, with the particles circled and marked with numbers. The scale bar is 10m.

To further explore the significance of alcohol treatment, the developed SVM classifier was used to count the particles in the nine subsampled regions of the filter, based on which the MP/All was calculated. The MP/All ratio before the alcohol treatment was 0.1290.129; and after the alcohol treatment was 0.2860.207, respectively. The paired t-test of the data indicates that an alcohol treatment of the same areas of the filter significantly increases the MP/All (p<0.05). In summary, alcohol treatment was significantly effective in reducing non-MP contaminants.

The proposed MP detection framework was specifically adapted for application to detect MPs released from nylon teabags. However, it's important to note that not the entire framework was employed in this context. Rather, a selective application was implemented, excluding the components based on DFIR imaging and the SVM-Four wavenumber model. After steeping teabags in hot water, MPs were released and collected on a filter through filtration at room temperature. This filter was rinsed with alcohol and air-dried in the fume hood prior to O-PTIR data collection. The contaminants from the teabag are not the same as those originating from reference nylon bulk plastics. For example, teabags might have some contaminants from tea residuals, as noted by Xu et al.13. Particles released from nylon teabags were identified through point spectra measurements due to the relatively low particle count (i.e., <5 particles) observed in the subsampled regions of the filter (see Conventional MP identification).

Characterization of MP particles released from teabags was carried out using the MATLAB image processing toolbox function regionprops, which calculates properties of each particle including area, length (length of the major axis of the fitted ellipse), width (length of the minor axis of the fitted ellipse), and circularity. In Fig.9, we present four optical images toshow nylon MP particles, which have been released from three nylon teabags; they are circled and marked with numbers. To provide a comprehensive analytical context, the spectra of three key references are plotted alongside: a nylon reference sphere, a sample of nylon in bulk form, and the material of the nylon teabag itself. This juxtaposition allows for a direct comparison between the spectra of the isolated particles and these standard nylon references, this contributes to a more detailed understanding of the appearance as well as the spectral properties of the particles.

Optical images of nylon MP particles 1, 2, 3, 4 released from nylon teabag, with the particles circled and marked with numbers.

The average quantity of MP in the nine subsampled regions of the filter was 8.71.2. Extrapolating to the whole filter, we would estimate 31943.7 MP particles released from steeping three teabags, or approximately 106.314.6 MP particles were released from one teabag. The particle counts/quantities of MPs released from teabags previously reported are listed in Table 1. Our reported count is comparable to that reported by Ouyang et al.9, who found 393 MPs using FTIR-based particle-based analysis, although their brewing time was much longer than ours (1h vs 5min). Regarding Hernandez et al.7, their brewing temperature and time are very similar to ours. Nevertheless, as they did not conduct particle-based analysis, their results were overestimated8. The detection limits of O-PTIR spectroscopy and Raman spectroscopy are similar, with O-PTIR spectroscopy being around 500nm and Raman spectroscopy being around 1m. Based on this, we were surprised to find that the number of MPs we detected was one to two orders of magnitude lower than the 5,800 20,400 per teabag (brewed at 95C for 5min) reported by Busse et al.8 using Raman spectroscopy. Busse et al.8 conducted particle-based analysis, indicating that their results should be considered reliable. However, it is important to notethat their use of Raman spectroscopy may have led tomisidentification of non-MP particles as MPs in an unexpected way. To illustrate, Busse et al.8 identified and counted polyethylene (PE) particles in the teabag leachate. However, these PE particles could also be behenamide (CH3(CH2)20CONH2), which is a typical slip additive widely used in PE plastic. Behenamide exhibits a high level of spectral similarity with PE in Raman spectroscopy, up to 90%, mainly due to the strong Raman signal associated with its saturated alkyl chains (i.e., (CH)) and relatively weak Raman signals from carbonyl and amine groups23. The observed disparities between our results and those of Busse et al.8 could also potentially be attributed to the use of different types of teabags.The counts/quantities reported by other studies listed are expressed in the mass of MPs released per teabag11,12, or the number of MP particles per kg of teabags10. Therefore, direct comparisons with these studies are not possible in our paper.Subsequently, the length, width, area, and circularity of each particle were measured and calculated using the MATLAB function regionprops. Figure10A shows the surface area of the MP particles. Except for the two MP particles with the smallest (100m2) and largest (680m2) surface area, the majority of the remaining particles have surface areas ranging from 150 to 550m2. Figure10B shows the distribution of the length of MP particles. As can be seen, the maximum length is 40m and the minimum length is 16m, while most MP particles have a length ranging from 18 to 28m. Figure10C displays the width of the MP particles. As seen from the graph, the smallest width is 9m, while the largest width is 30m. The majority of the MPs have a width range between 12 and 24m. Figure10D shows the circularity of the MP particles. Among all the MP particles, only 4 have a low circularity (0.10.4), while most of the MP particles have circularity ranging from 0.65 to 0.95. Circularity is a measure of how closely a shape resembles a perfect circle. Circularity values near 1 represent perfect circles, while values close to 0 indicate shapes that deviate significantly from circularity. Based on the literature, particles that are more circular in shape are found to be less toxic, while those that deviate from a circular shape, manifesting more stretched or fiber-like, are associated with a higher level of toxicity24.

Length (A), width (B), area (C) and circularity (D) of MP particles released from steeping a single teabag.

View original post here:
Machine learning driven methodology for enhanced nylon microplastic detection and characterization | Scientific Reports - Nature.com

Related Posts

Comments are closed.