Identification of genetically plastic forms among Belarusian ancient flax (Linum usitatissimum convar. elongatum Vav. et Ell.) varieties using the Linum Insertion Sequence LIS-1

Main Article Content

Maria Parfenchyk
https://orcid.org/0000-0002-9667-2623
Valentina Lemesh
Elena Lagunovskaya
Valentina Sakovich
Andrei Buloichik
Elena Guzenko
Lyubov Khotyleva

Abstract

The Linum Insertion Sequence 1 (LIS-1) occurs in the genetically plastic flax genotypes in response to the lack or excess of mineral and water nutrition, but also naturally, and can be transmitted to the progeny. We have analyzed 21 ancient Belarusian varieties of flax Linum usitatissimum convar. elongatum Vav. et Ell. The LIS-1 presence or absence was checked for individual plants in at minimum two generations with primer-specific polymerase chain reaction (PCR) and agarose gel electrophoresis. The studied flax varieties formed four groups: non-responsive varieties (LIS-1 was not found, group NR); responsive, which formed and completely lost the insertion (group R0); responsive, which formed and retained LIS-1 (group R1); and responsive unstable (group R2). A statistically significant difference was found in ‘plant height’ (p < 0.05), ‘technical length of the stem’ (p < 0.05) between R0 and NR, and R2 and NR LIS-1 groups. The machine learning algorithm random forest classifier was used to predict the presence, absence or heterozygosity of LIS-1 in flax plants based on their growth and reproductive characteristics. As a result, the accuracy of the prediction was 98% on test data. In terms of sources for the selection of fibre flax varieties adaptive to environmental challenges, the most promising group consists of responsive varieties that have formed LIS-1 insertion (R0, R1 and R2 groups).

 

 

Article Details

How to Cite
Parfenchyk, M., Lemesh, V., Lagunovskaya, E., Sakovich, V., Buloichik, A., Guzenko, E. and Khotyleva, L. (2024) “Identification of genetically plastic forms among Belarusian ancient flax (Linum usitatissimum convar. elongatum Vav. et Ell.) varieties using the Linum Insertion Sequence LIS-1”, Genetic Resources, 5(9), pp. 45–60. doi: 10.46265/genresj.DBNO8764.
Section
Original Articles
References

Bajorath, J (2022). “Revisiting active learning in drug discovery through open science”. Artificial Intelligence in the Life Sciences 2, pp. 100051–100051. DOI: https://doi.org/10.2144/fsoa-2022-0010. DOI: https://doi.org/10.1016/j.ailsci.2022.100051

Bates, D et al. (2015). “Fitting linear mixed-effects models Usinglme4”. Journal of Statistical Software(1), pp. 67–67. DOI: https://doi.org/10.18637/jss.v067.i01. DOI: https://doi.org/10.18637/jss.v067.i01

Bickel, C, Lukacs, and C Cullis (2012). “The loci controlling plasticity in flax”. Research and Reports in Biology 3, pp. 1–11. DOI: https://doi.org/10.2147/RRB.S27198. DOI: https://doi.org/10.2147/RRB.S27198

Breiman, L (2001). “Random Forests”. Machine Learning 45, pp. 5–32. DOI: https://doi.org/10.1023/A:1010933404324. DOI: https://doi.org/10.1023/A:1010933404324

Chen, Y, R Lowenfeld, and C A Cullis (2009). “An environmentally induced adaptive (?) insertion event in flax”. Journal of Genetics and Molecular Biology 1, pp. 38–047.

Chen, Y, R G Schneeberger, and C A Cullis (2005). “A site-specific insertion sequence in flax genotrophs induced by environment”. The New Phytologist 167, pp. 171–80. DOI: https://doi.org/10.1111/j.1469-8137.2005.01398.x. DOI: https://doi.org/10.1111/j.1469-8137.2005.01398.x

Cullis, C (1976). “Environmentally induced changes in ribosomal RNA cistron number in flax”. Heredity 36, pp. 73–79. DOI: https://doi.org/10.1038/hdy.1976.8. DOI: https://doi.org/10.1038/hdy.1976.8

Cullis, C A (1981). “DNA sequence organization in the flax genome”. Biochim Biophys Acta 652(1), pp. 1–15. DOI: https://doi.org/10.1016/0005-2787(81)90203-3. DOI: https://doi.org/10.1016/0005-2787(81)90203-3

Cullis, C A (1986). “Phenotypic consequences of environmentally induced changes in plant DNA”. Trends in Genetics 2(86), pp. 90285–90289. DOI: https://doi.org/10.1016/0168-9525(86)90285-4. DOI: https://doi.org/10.1016/0168-9525(86)90285-4

Cullis, C A (2019). “Origin and Induction of the Flax Genotrophs”. Genetics and Genomics of Linum, pp. 227–234. DOI: https://doi.org/10.1007/978-3-030-23964-0_14. DOI: https://doi.org/10.1007/978-3-030-23964-0_14

Diederichsen, A (2019). “A Taxonomic View on Genetic Resources in the Genus Linum L. for Flax Breeding”. Genetics and Genomics of Linum, pp. 227–234. DOI: https://doi.org/10.1007/978-3-030-23964-0_1. DOI: https://doi.org/10.1007/978-3-030-23964-0_1

Durrant, A and O I Joarder (1978). “Regulation of hairless septa in flax genotrophs”. Genetica 48, pp. 171–183. DOI: https://doi.org/10.1007/BF00155567. DOI: https://doi.org/10.1007/BF00155567

Durrant, A and T Jones (1971). “Reversion of induced changes in amount of nuclear DNA in Linum”. Heredity 27, pp. 431–439. DOI: https://doi.org/10.1038/hdy.1971.106. DOI: https://doi.org/10.1038/hdy.1971.106

Durrant, A and D Nicholas (1970). “An unstable gene in flax”. Heredity 25, pp. 513–527. DOI: https://doi.org/10.1038/hdy.1970.60. DOI: https://doi.org/10.1038/hdy.1970.60

Ehrensing, D (2008). Oilseed Crops: Flax (EM 8952-E). Oregon State University Extension Service.

Evans, G, A Durrant, and H Rees (1966). “Associated Nuclear Changes in the Induction of Flax Genotrophs”. Nature 212, pp. 697–699. DOI: https://doi.org/10.1038/212697a0. DOI: https://doi.org/10.1038/212697a0

FAO (2012). Synthetic account of the Second Global Plan of Action for Plant Genetic Resources for Food and Agriculture. URL: https://www.fao.org/3/i2650e/i2650e.pdf.

Goldsbrough, P B, T H Ellis, and C A Cullis (1981). “Organisation of the 5S RNA genes in flax”. Nucleic Acids Res 9(22), pp. 5895–904. DOI: https://doi.org/10.1093/nar/9.22.5895. DOI: https://doi.org/10.1093/nar/9.22.5895

Harris, C R, K J Millman, and S J Van Der Walt (2020). “Array programming with NumPy”. Nature 585, pp. 357–362. DOI: https://doi.org/10.1038/s41586-020-2649-2. DOI: https://doi.org/10.1038/s41586-020-2649-2

Hu, Z and E P Xing (2021). “Toward a ’Standard Model’ of Machine Learning”. Harvard Data Science Review. DOI: https://doi.org/10.1162/99608f92.1d34757b. DOI: https://doi.org/10.1162/99608f92.1d34757b

Hunter, J D (2007). “Matplotlib: A 2D Graphics Environment”. Computing in Science & Engineering 9(3), pp. 90–95. DOI: https://doi.org/10.1109/MCSE.2007.55. DOI: https://doi.org/10.1109/MCSE.2007.55

I Privalov, F, S I Grib, and I S Matys (2021). “National seed bank of genetic economically useful plant resources is a scientific object of a National property of the Republic of Belarus”. Crop Farming and Plant Growing 2, pp. 10–14.

I Vavilov, N (1926). “Studies on the origin of cultivated plants”. Bull Appl Botany 16(2), pp. 3–248.

Kassambara, A (2023). rstatix: Pipe-Friendly Framework for Basic Statistical Tests. R package version 0.7.2. URL: https://rpkgs.datanovia.com/rstatix/.

Kluyver, T (2016). “Jupyter Notebooks - a publishing format for reproducible computational workflows”. In: Positioning and Power in Academic Publishing: Players, Agents and Agendas. Ed. by F Loizides and B Schmidt, pp. 87–90. DOI: https://doi.org/10.3233/978-1-61499-649-1-87.

Maggioni, L et al. (2001). Flax Genetic Resources in Europe Ad hoc meeting. Rome, Italy: International Plant Genetic Resources Institute, pp. 72–73. URL: https://www.ecpgr.cgiar.org/fileadmin/bioversity/publications/pdfs/Ad_Hoc_Fibre_Crops_WG_ad_hoc_meeting_Flax_genetic_resources_in_Europe_Czech_Rep_2001.pdf.

Nicotra, A B et al. (2010). “Plant phenotypic plasticity in a changing climate”. Trends in plant science 15(12), pp. 684–692. DOI: https://doi.org/10.1016/j.tplants.2010.09.008. DOI: https://doi.org/10.1016/j.tplants.2010.09.008

Nôžková, J et al. (2016). Descriptor list for flax (Linum usitatissimum L.) DOI: https://doi.org/10.15414/2016.9788055214849. DOI: https://doi.org/10.15414/2016.9788055214849

Oliveros, J C (2007-2015). Venny. An interactive tool for comparing lists with Venn’s diagrams. URL: https://bioinfogp.cnb.csic.es/tools/venny/index.html.

Rachinskaya, O A et al. (2011). “Genetic polymorphism of flax Linum usitatissimum based on the use of molecular cytogenetic markers”. Russ J Genet 47, pp. 56–65. DOI: https://link.springer.com/article/10.1134/S1022795411010108. DOI: https://doi.org/10.1134/S1022795411010108

Raghunathan, S and U D Priyakumar (2022). “Molecular representations for machine learning applications in chemistry”. International Journal of Quantum Chemistry 122(7), pp. 26870–26870. DOI: https://doi.org/10.1002/qua.26870. DOI: https://doi.org/10.1002/qua.26870

Sa, R et al. (2021). “Chromosome-Level Genome Assembly and Annotation of the Fiber Flax (Linum usitatissimum) Genome”. Front Genet 12, pp. 735690–735690. DOI: https://doi.org/10.3389/fgene.2021.735690. DOI: https://doi.org/10.3389/fgene.2021.735690

Sambrook, J and D W Russell (2006). “Purification of nucleic acids by extraction with phenol:chloroform”. CSH protocols. DOI: https://doi.org/10.1101/pdb.prot4455. DOI: https://doi.org/10.1101/pdb.prot4045

Seabold, S and J Perktold (2010). “Statsmodels: Econometric and statistical modeling with Python”. In: Proceedings of the 9th Python in Science Conference. DOI: https://doi.org/10.25080/Majora-92bf1922-011

The pandas development team (2020). pandas-dev/pandas: Pandas 1.0.0 (v1.0.0). Zenodo. URL: https://doi.org/10.5281/zenodo.3630805.

Virtanen, P et al. (2020). “SciPy 1.0: fundamental algorithms for scientific computing in Python”. Nature methods 17(3), pp. 261–272. DOI: https://doi.org/10.1038/s41592-019-0686-2. DOI: https://doi.org/10.1038/s41592-019-0686-2

Volkamer, A et al. (2023). “Machine Learning for Small Molecule Drug Discovery in Academia and Industry”. Artificial Intelligence in the Life Sciences. DOI: https://doi.org/10.1016/j.ailsci.2022.100056. DOI: https://doi.org/10.1016/j.ailsci.2022.100056

Waskom, M L (2021). “seaborn: statistical data visualization”. Journal of Open Source Software 6(60), pp. 3021–3021. DOI: https://doi.org/10.21105/joss.03021. DOI: https://doi.org/10.21105/joss.03021

Wickham, H (2016). Ggplot2: Elegant graphics for data analysis. 2nd. Springer International Publishing. URL: https://ggplot2.tidyverse.org. DOI: https://doi.org/10.1007/978-3-319-24277-4

Wickham, H et al. (2022). dplyr: A Grammar of Data Manipulation. URL: https://dplyr.tidyverse.org.

Yang, Z et al. (2022). “Classification of JAK1 Inhibitors and SAR Research by Machine Learning Methods”. Artificial Intelligence in the Life Sciences 2, pp. 100039–100039. DOI: https://doi.org/10.1016/j.ailsci.2022.100039. DOI: https://doi.org/10.1016/j.ailsci.2022.100039