<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">vestifm</journal-id><journal-title-group><journal-title xml:lang="ru">Известия Национальной академии наук Беларуси. Серия физико-математических наук</journal-title><trans-title-group xml:lang="en"><trans-title>Proceedings of the National Academy of Sciences of Belarus. Physics and Mathematics Series</trans-title></trans-title-group></journal-title-group><issn pub-type="ppub">1561-2430</issn><issn pub-type="epub">2524-2415</issn><publisher><publisher-name>The Republican Unitary Enterprise Publishing House "Belaruskaya Navuka"</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.29235/1561-2430-2018-54-1-77-83</article-id><article-id custom-type="elpub" pub-id-type="custom">vestifm-301</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>ФИЗИКА</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="en"><subject>PHYSICS</subject></subj-group></article-categories><title-group><article-title>МЕТОДЫ ПОСТРОЕНИЯ ОБУЧАЮЩЕГО НАБОРА ДЛЯ КАЛИБРОВКИ С ПОМОЩЬЮ ФЛУОРЕСЦЕНТНОЙ СПЕКТРОСКОПИИ НЕБОЛЬШИХ ВЫБОРОК ОБРАЗЦОВ</article-title><trans-title-group xml:lang="en"><trans-title>TRAINING SUBSET SELECTION METHODS FOR CALIBRATION WITH FLUORESCENCE SPECTROSCOPY IN SMALL DATA SETS OF SAMPLES</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Ходасевич</surname><given-names>М. А.</given-names></name><name name-style="western" xml:lang="en"><surname>Khodasevich</surname><given-names>M. A.</given-names></name></name-alternatives><bio xml:lang="ru"><p>кандидат физико-математических наук, заместитель заведующего лабораторией</p></bio><bio xml:lang="en"><p>Ph. D. (Physics and Mathematics), Deputy Head of the Laboratory</p></bio><email xlink:type="simple">m.khodasevich@ifanbel.bas-net.by</email><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Саскевич</surname><given-names>Н. А.</given-names></name><name name-style="western" xml:lang="en"><surname>Saskevich</surname><given-names>N. A.</given-names></name></name-alternatives><bio xml:lang="ru"><p>кандидат физико- математических наук, старший научный сотрудник</p></bio><bio xml:lang="en"><p>Ph. D. (Physics and Mathematics), Senior Researcher</p></bio><email xlink:type="simple">n.saskevich@ifanbel.bas-net.by</email><xref ref-type="aff" rid="aff-1"/></contrib></contrib-group><aff-alternatives id="aff-1"><aff xml:lang="ru"><institution>Институт физики им. Б. И. Степанова Национальной академии наук Беларуси, Минск</institution></aff><aff xml:lang="en"><institution>B. I. Stepanov Institute of Physics of the National Academy of Sciences of Belarus, Minsk</institution></aff></aff-alternatives><pub-date pub-type="collection"><year>2018</year></pub-date><pub-date pub-type="epub"><day>05</day><month>04</month><year>2018</year></pub-date><volume>54</volume><issue>1</issue><fpage>77</fpage><lpage>83</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Ходасевич М.А., Саскевич Н.А., 2018</copyright-statement><copyright-year>2018</copyright-year><copyright-holder xml:lang="ru">Ходасевич М.А., Саскевич Н.А.</copyright-holder><copyright-holder xml:lang="en">Khodasevich M.A., Saskevich N.A.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://vestifm.belnauka.by/jour/article/view/301">https://vestifm.belnauka.by/jour/article/view/301</self-uri><abstract><p>При проведении калибровки с помощью многопараметрического спектрального анализа ограниченность количества образцов и невозможность априорного контроля величины искомого параметра обусловливают важность решения задачи построения обучающего набора с целью уменьшения ошибки калибровки. На примере калибровки температуры с помощью проекции на латентные структуры в диапазоне от 66 до 150 °C по спектрам флуоресценции Yb3+:CaF2 , зарегистрированным в диапазоне 880–1120 нм с разрешением около 0,2 нм и шагом по температуре 2 °C, продемонстрированы возможные варианты построения обучающего набора из небольших по количеству образцов выборок с применением равномерного распределения, алгоритма Кеннарда – Стоуна и методов кластерного анализа в пространстве главных компонент, показано их влияние на точность калибровки. Применение метода главных компонент позволяет проводить отбор спектров без привлечения априорных знаний о температуре, которой соответствуют спектры флуоресценции. Показано, что среднеквадратичная ошибка предсказанной величины температуры при равномерном распределении образцов обучающего набора по пространству первой главной компоненты составляет 3,98 °C, при использовании алгоритма Кеннарда – Стоуна – 1,07 °C. Минимальная среднеквадратичная ошибка 0,98 °C может быть достигнута с помощью иерархического кластерного анализа пространства главных компонент исследуемых спектров.</p><p> </p></abstract><trans-abstract xml:lang="en"><p>A limited number of samples and an impossible a priori control of a desired parameter value stipulate how it is important to solve the problem of selecting a training set for calibration by the multivariate spectral analysis in order to reduce a calibration error. Possible variants of a training subset selection from small data sets are shown for temperature calibration with fluorescence spectra of Yb3+:CaF2 recorded in the range of 880–1120 nm with a resolution of about 0.2 nm for the temperature range from 66 to 150 °C and at a step of 2 °C. The methods applied are the uniform distribution, the Kennard and Stone algorithm and the cluster analysis in principal component space. The effect of the method choice on the calibration accuracy has been evaluated. The application of the principal component analysis gives the possibility to select spectra without a priori knowledge of a temperature, to which the fluorescence spectra correspond. The root-mean-square error of the predicted temperature value is shown to be 3.98 ° C for the uniform distribution of the training subset samples over the space of the first principal component and 1.07 ° C for the Kennard and Stone algorithm. The minimum root-mean-square prediction error of 0.98 °C is shown to be achieved with the training subset selection by the hierarchical cluster analysis in the space of the principal components of the spectra studied.</p></trans-abstract><kwd-group xml:lang="ru"><kwd>флуоресцентная спектроскопия</kwd><kwd>калибровка</kwd><kwd>кластерный анализ</kwd><kwd>проекция на латентные структуры</kwd></kwd-group><kwd-group xml:lang="en"><kwd>fluorescence spectroscopy</kwd><kwd>calibration</kwd><kwd>cluster analysis</kwd><kwd>projection to latent structures</kwd></kwd-group></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Kalivas, J. H. Calibration Methodologies / J. H. Kalivas // Comprehensive Chemometrics: Chemical and Biochemical Data Analysis / editors-in-chief: R. Tauler, B. Walczak, S. D. Brown. – Elsevier, 2009. – P. 1–32.</mixed-citation><mixed-citation xml:lang="en">Kalivas, J. H. Calibration Methodologies. Tauler R., Walczak B., Brown S. D. (eds.). Comprehensive Chemometrics: Chemical and Biochemical Data Analysis. Elsevier, 2009, pp. 1–32. Doi: 10.1016/b978-044452701-1.00072-7</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">Rapid identiﬁcation of soil cadmium pollution risk at regional scale based on visible and near-infrared spectroscopy / Tao Chen [et al.] // Environ. Pollut. – 2015. – Vol. 206. – P. 217–226.</mixed-citation><mixed-citation xml:lang="en">Tao Chen, Qingrui Chang, Clevers J.G.P.W., Kooistra L. Rapid identiﬁcation of soil cadmium pollution risk at regional scale based on visible and near-infrared spectroscopy. Environmental Pollution, 2015, vol. 206, pp. 217–226. Doi: 10.1016/j. envpol.2015.07.009</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Noninvasive and fast measurement of blood glucose in vivo by near infrared (NIR) spectroscopy / Xue Jintao [et al.] // Spectrochim. Acta Part A: Molecular and Biomolecular Spectroscopy. – 2017. – Vol. 179. – P. 250–254.</mixed-citation><mixed-citation xml:lang="en">Xue Jintao, Ye Liming, Liu Yufei, Li Chunyan, Chen Han. Noninvasive and fast measurement of blood glucose in vivo by near infrared (NIR) spectroscopy. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2017, vol. 179, pp. 250–254. Doi: 10.1016/j.saa.2017.02.032</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Joint analyses model for total cholesterol and triglyceride in human serum with near-infrared spectroscopy / Lijun Yao [et al.] // Spectrochim. Acta Part A: Molecular and Biomolecular Spectroscopy. – 2016. – Vol. 159. – P. 53–59.</mixed-citation><mixed-citation xml:lang="en">Lijun Yao, Ning Lyu, Jiemei Chen, Tao Pan, Jing Yu. Joint analyses model for total cholesterol and triglyceride in human serum with near-infrared spectroscopy. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, 2016, vol. 159, pp. 53–59. Doi: 10.1016/j.saa.2016.01.022</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">Da Silva, V. H. Portable near-infrared instruments: Application for quality control of polymorphs in pharmaceutical raw materials and calibration transfer / V. H. da Silva, J. J. da Silva, C. F. Pereira // J. Pharm. Biomed. Anal. – 2017. – Vol. 134. – P. 287–294.</mixed-citation><mixed-citation xml:lang="en">Da Silva V. H., da Silva J. J., Pereira C. F. Portable near-infrared instruments: Application for quality control of poly-morphs in pharmaceutical raw materials and calibration transfer. Journal of Pharmaceutical and Biomedical Analysis, 2017, vol. 134, pp. 287–294. Doi: 10.1016/j.jpba.2016.11.036</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Identiﬁcation of anisodamine tablets by Raman and near-infrared spectroscopy with chemometrics / Lian Li [et al.] // Spectrochim. Acta Part A: Molecular and Biomolecular Spectroscopy. – 2014. – Vol. 127. – P. 91–97.</mixed-citation><mixed-citation xml:lang="en">Lian Li, Hengchang Zang, Jun Li, Dejun Chen, Tao Li, Fengshan Wang. Identiﬁcation of anisodamine tablets by Raman and near-infrared spectroscopy with chemometrics. Spectrochimica Acta Part A: Molecular and Biomolecular Spec-tro scopy, 2014, vol. 127, pp. 91–97. Doi: 10.1016/j.saa.2014.02.022</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Sulub, Y. Determination of polymer blends composed of polycarbonate and rubber entities using near-infrared (NIR) spectroscopy and multivariate calibration / Y. Sulub, J. DeRudder // Polym. Test. – 2013. – Vol. 32, № 4. – P. 802–809.</mixed-citation><mixed-citation xml:lang="en">Sulub Y., DeRudder J. Determination of polymer blends composed of polycarbonate and rubber entities using near- infrared (NIR) spectroscopy and multivariate calibration. Polymer Testing, 2013, vol. 32, no. 4, pp. 802–809. Doi: 10.1016/j. polymertesting.2013.03.008</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">Application of benchtop and portable near-infrared spectrometers for predicting the optimum harvest time of Verbena ofﬁcinalis / C. K. Pezzei [et al.] // Talanta. – 2017. – Vol. 169. – P. 70–76.</mixed-citation><mixed-citation xml:lang="en">Pezzei C. K., Schonbichler S. A., Kirchler C. G., Schmelzer J., Hussain S., Huck-Pezzei V. A., Popp M., Krolitzek J., Bonn G. K. Application of benchtop and portable near-infrared spectrometers for predicting the optimum harvest time of Verbena ofﬁcinalis. Talanta, 2017, vol. 169, pp. 70–76. Doi: 10.1016/j.talanta.2017.03.067</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">Comparing predictive ability of laser-induced breakdown spectroscopy to visible near-infrared spectroscopy for soil property determination / M. Knadel [et al.] // Biosystems Engineering. – 2017. – Vol. 156. – P. 157–172.</mixed-citation><mixed-citation xml:lang="en">Knadel M., Gislum R., Hermansen C., Peng Y., Moldrup P., de Jonge L. W., Greve M. H. Comparing predictive ability of laser-induced breakdown spectroscopy to visible near-infrared spectroscopy for soil property determination. Biosystems Engineering, 2017, vol. 156, pp. 157–172. Doi: 10.1016/j.biosystemseng.2017.01.007</mixed-citation></citation-alternatives></ref><ref id="cit10"><label>10</label><citation-alternatives><mixed-citation xml:lang="ru">Eating quality evaluation of Khao Dawk Mali 105 rice using near-infrared spectroscopy / P. Siriphollakul [et al.] // LWT – Food Science and Technology. – 2017. – Vol. 79. – P. 70–77.</mixed-citation><mixed-citation xml:lang="en">Siriphollakul P., Nakano K., Kanlayanarat S., Ohashi S., Sakai R., Rittiron R., Maniwara P. Eating quality evaluation of Khao Dawk Mali 105 rice using near-infrared spectroscopy. LWT – Food Science and Technology, 2017, vol. 79, pp. 70–77. Doi: 10.1016/j.lwt.2017.01.014</mixed-citation></citation-alternatives></ref><ref id="cit11"><label>11</label><citation-alternatives><mixed-citation xml:lang="ru">Monitoring of alcohol strength and titratable acidity of apple wine during fermentation using near-infrared spectroscopy / B. Peng [et al.] // LWT – Food Science and Technology. – 2016. – Vol. 66. – P. 86–92.</mixed-citation><mixed-citation xml:lang="en">Peng B., Ge N., Cui L., Zhao H. Monitoring of alcohol strength and titratable acidity of apple wine during fermentation using near-infrared spectroscopy. LWT – Food Science and Technology, 2016, vol. 66, pp. 86–92. Doi: 10.1016/j. lwt.2015.10.018</mixed-citation></citation-alternatives></ref><ref id="cit12"><label>12</label><citation-alternatives><mixed-citation xml:lang="ru">Определение химических характеристик и производителя дивинов по их широкополосным спектрам пропускания / М. А. Ходасевич [и др.] // Оптика и спектроскопия. – 2016. – Т. 120, № 6. – С. 1046–1051.</mixed-citation><mixed-citation xml:lang="en">Khodasevich M. A., Sinitsyn G. V., Skorbanova E. A., Rogovaya M. V., Kambur E. I., and Aseev V. A. Determination of the Chemical Parameters and Manufacturer of Divins from Their Broadband Transmission Spectra. Optics and Spectroscopy, 2016, vol. 120, pp. 978–982. Doi: 10.1134/s0030400x16050155</mixed-citation></citation-alternatives></ref><ref id="cit13"><label>13</label><citation-alternatives><mixed-citation xml:lang="ru">Применение многопараметрического анализа широкополосных спектров пропускания для идентификации вин с географическим наименованием происхождения / М. А. Ходасевич [и др.] // Приборы и методы измерений. – 2016. – Т. 7, № 1. – С. 104–113.</mixed-citation><mixed-citation xml:lang="en">Khodasevich M. A., Skorbanova E. A., Obade L. I., Degtyar N. F., Kambur E. I., Rogovaya M. V. Application of multivariate analysis of transmission spectra to identify wines with protected geographical indication (IGP). Pribory i metody izmerenii = Devices and Methods of Measurements, 2016, vol. 7, no. 1, pp. 104–113 (in Russian). Doi: 10.21122/2220-9506-2016-7-1-104-113</mixed-citation></citation-alternatives></ref><ref id="cit14"><label>14</label><citation-alternatives><mixed-citation xml:lang="ru">Alves, J. C. L. Quantiﬁcation of conventional and advanced biofuels contents in diesel fuel blends using near-infrared spectroscopy and multivariate calibration / J. C. L. Alves, R. J. Poppi // Fuel. – 2016. – Vol. 165. – P. 379–388.</mixed-citation><mixed-citation xml:lang="en">Alves J. C. L., Poppi R. J. Quantiﬁcation of conventional and advanced biofuels contents in diesel fuel blends using near-infrared spectroscopy and multivariate calibration. Fuel, 2016, vol. 165, pp. 379–388. Doi: 10.1016/j.fuel.2015.10.079</mixed-citation></citation-alternatives></ref><ref id="cit15"><label>15</label><citation-alternatives><mixed-citation xml:lang="ru">Clustering and training set selection methods for improving the accuracy of quantitative laser induced breakdown spectroscopy / R. B. Anderson [et al.] // Spectrochim. Acta Part B. – 2012. – Vol. 70. – P. 24–32.</mixed-citation><mixed-citation xml:lang="en">Anderson R. B., Bell J. F., Wiens R. C., Morris R. V., Clegg S. M. Clustering and training set selection methods for improving the accuracy of quantitative laser induced breakdown spectroscopy. Spectrochimica Acta Part B, 2012, vol. 70, pp. 24–32. Doi: 10.1016/j.sab.2012.04.004</mixed-citation></citation-alternatives></ref><ref id="cit16"><label>16</label><citation-alternatives><mixed-citation xml:lang="ru">Abdi, H. Partial least squares regression and projection on latent structure regression / H. Abdi // Wiley Interdisciplinary Reviews: Computational Statistics. – 2010. – Vol. 2, № 1. – P. 97–106.</mixed-citation><mixed-citation xml:lang="en">Abdi H. Partial least squares regression and projection on latent structure regression. Wiley Interdisciplinary Reviews: Computational Statistics, 2010, vol. 2, pp. 97–106. Doi: 10.1002/wics.51</mixed-citation></citation-alternatives></ref><ref id="cit17"><label>17</label><citation-alternatives><mixed-citation xml:lang="ru">Determining the Stark structure of Yb3+ energy levels in Y3Al5O12 and CaF2 using principal component analysis of tem perature dependences of ﬂuorescence spectra / M. Khodasevich [et al.] // J. Lmin. – 2017. – Vol. 187. – P. 295–297.</mixed-citation><mixed-citation xml:lang="en">Khodasevich M., Varaksa Y., Sinitsyn G., Aseev V., Demesh M., Yasukevich A. Determining the Stark structure of Yb3+ energy levels in Y3Al5O12 and CaF2 using principal component analysis of temperature dependences of ﬂuorescence spectra. Journal of Luminescence, 2017, vol. 187, pp. 295–297. Doi: 10.1016/j.jlumin.2017.03.014</mixed-citation></citation-alternatives></ref><ref id="cit18"><label>18</label><citation-alternatives><mixed-citation xml:lang="ru">Esbensen, K. H. Principal Component Analysis: Concept, Geometrical Interpretation, Mathematical Background, Algorithms, History, Practice / K. H. Esbensen, P. Geladi // Comprehensive Chemometrics. – 2009. – Vol. 2. – P. 211–226.</mixed-citation><mixed-citation xml:lang="en">Esbensen K. H., Geladi P. Principal Component Analysis: Concept, Geometrical Interpretation, Mathematical Background, Algorithms, History, Practice. Comprehensive Chemometrics, 2009, vol. 2, pp. 211–226. Doi: 10.1016/b978-044452701-1.00043-0</mixed-citation></citation-alternatives></ref><ref id="cit19"><label>19</label><citation-alternatives><mixed-citation xml:lang="ru">Применение регрессии на латентные структуры для определения температуры активированной ионами эрбия свинцово-фторидной наностеклокерамики по спектрам апконверсионной флуоресценции / В. А. Асеев [и др.] // Оптика и спектроскопия. – 2015. – Т. 118, № 2. – С. 47–49.</mixed-citation><mixed-citation xml:lang="en">Aseev V. A., Varaksa Yu. A., Kolobkova E. V., Sinitsyn G. V., Khodasevich M. A. Application of Projection on Latent Structures for Determining Temperature of Erbium-Doped Lead Fluoride Nano-Glass-Ceramics from Upconversion Fluorescence Spectra. Optics and Spectroscopy, 2015, vol. 118, pp. 727–728. Doi: 10.1134/s0030400x15050033</mixed-citation></citation-alternatives></ref><ref id="cit20"><label>20</label><citation-alternatives><mixed-citation xml:lang="ru">Kennard, R. W. Computer aided design of experiments / R. W. Kennard, L. A. Stone // Technometrics. – 1969. – Vol. 11, № 1. – P. 137–148.</mixed-citation><mixed-citation xml:lang="en">Kennard R. W., Stone L. A. Computer aided design of experiments. Technometrics, 1969, vol. 11, no. 1, pp. 137–148. Doi: 10.2307/1266770</mixed-citation></citation-alternatives></ref><ref id="cit21"><label>21</label><citation-alternatives><mixed-citation xml:lang="ru">Daszykowski, M. Representative subset selection / M. Daszykowski, B. Walczak, D. L. Massart // Anal. Chim. Acta. – 2002. – Vol. 468. – P. 91–103.</mixed-citation><mixed-citation xml:lang="en">Daszykowski M., Walczak B., Massart D. L. Representative subset selection. Analytica Chimica Acta, 2002, vol. 468, pp. 91–103. Doi: 10.1016/s0003-2670(02)00651-7</mixed-citation></citation-alternatives></ref><ref id="cit22"><label>22</label><citation-alternatives><mixed-citation xml:lang="ru">Мандель, И. Д. Кластерный анализ / И. Д. Мандель. – М.: Финансы и статистика, 1988. – 176 с.</mixed-citation><mixed-citation xml:lang="en">Mandel I. D. Cluster analysis. Moscow, Finansy i statistika Publ., 1988, 176 p. (in Russian).</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
