¡¶¡¶öÏÓã¹ÙÍø½øÈëby2272ÃÛÑ¿¡·Ãâ·Ñ¸ßÇåÔÚÏßԢĿ - È«¼¯...¡·¾çÇé¼ò½é£º×îµÍÆøÎ£ºÇൺ¡¢Î«·»ºÍ³ÄϵØÇø25¡æ×óÓÒÆäËûµØÇø23¡æ×óÓÒ¸´¸ç¸çÄã˵ʲô¡¶öÏÓã¹ÙÍø½øÈëby2272ÃÛÑ¿¡·Ãâ·Ñ¸ßÇåÔÚÏßԢĿ - È«¼¯...¡¸Éî¶Èѧϰ¡¹´ÓרÀûÎı¾ÖÐÌáÈ¡»¯Ñ§·´Ó¦¡ª¡ªChEMUÊý¾Ý¼¯Ô´´2021-08-31 10:28¡¤GoDesign¡ª¡ªÇ°ÑÔ¡ª¡ª½ñÖÚÈ˹¤ÖÇÄÜÊÖÒÕÉú³¤·ÉËÙ¸÷Ðи÷Òµ¶¼ÔÚÓ¦ÓÃÆäÖеĻúеѧϰ¡¢Éî¶ÈѧϰËã·¨¶Ô¸ÐÐËȤµÄÄ¿µÄ¾ÙÐÐÕ¹ÍûÔÚÓлúºÏ³É¡¢Ò©ÎïºÏ³ÉÁìÓò»úеѧϰÓëÉî¶ÈѧϰËã·¨±»ÓÃÀ´Õ¹ÍûÒ»¸öÓлú·´Ó¦µÄ²úÆ·»ò·´Ó¦ÎïÉõÖÁÓÃÀ´Õ¹ÍûÒ»¸öÒ©Îï·Ö×ÓµÄÄæºÏ³Éõè¾¶×ÅʵÕâЩËã·¨µÄʵÖʶ¼ÊÇ»ùÓÚͳ¼ÆÑ§¡¢¸ÅÂÊѧµÄÊýѧģ×Ó¶øÊýѧģ×ÓÀë²»¿ªÊý¾ÝÒò´ËÏëÒªÈÃÉè¼Æ³öÀ´µÄÄ£×Ó¸üºÃµØÕ¹ÍûÓлú·´Ó¦ÎÊÌâ¾ÍÐèÒª´ó×Ú¡¢ÇÒ¸ßÖÊÁ¿µÄÓлú·´Ó¦Êý¾ÝÄÇôÏëÒª»ñµÃ¸»×ãµÄ·´Ó¦Êý¾ÝÒ»·½Ãæ¿ÉÒÔ´Ó²»ÍêÈ«¿ªÔ´µÄReaxysÏÂÔØµ«»ñÈ¡Êý¾Ý»áÊܵ½µÄÖÖÖÖÏÞÖÆ£»£»£»¶øÍ¬Ê±Ò²ÓÐÒ»²¿·Ö¿ªÔ´µÄÊý¾Ý¼¯ÀýÈçUSPTO 1976-2016[1]ËüÃǵÄÎÊÌâÊÇÊý¾ÝûÓиüÐÂÓëά»¤Êý¾ÝÖÊÁ¿ÀǼ®²»ÆëΪÁ˴×Ô¶¯ÌáÈ¡»¯Ñ§·´Ó¦Îı¾ÖеÄÓлú·´Ó¦µÄÄ£×ÓÎÒÃÇ¿ÉÒÔ½èÖú×ÔÈ»ÓïÑÔ´¦Öóͷ£Ïà¹ØÊÖÒÕ¾ÙÐÐÎı¾ÐÅÏ¢ÍÚ¾ò2012ÄêLowe, D. M.µÈÈË¿ª·¢µÄLeadMineÈí¼þ£¨NextMove Software¹«Ë¾£©ÒÔ¼°2012Äê֮ǰµÄһЩÏà¹ØÊÂÇé¶¼Êǽ¨ÉèÔÚ´ó×ÚÈ˹¤ÍøÂçµÄ´Ê¿âÓëÖÆ¶©µÄÓï¹æÔòÔò»ù´¡ÉÏÏÈʶ±ð³öÎı¾ÖеĻ¯Ñ§ÊµÌåÃû£¨chemical entity mentioned£©ÔÙ¶Ô»ùÓÚʵÌåËù¹éÊôµÄ¶¯´Ê¾ÙÐзÖÀà»ñµÃÓлú·´Ó¦µÄ·´Ó¦Îï¡¢²úÆ·ÓëÊÔ¼Á¡¢ÈܼÁµÈ[2-4]Ö®ºóÔÚÎı¾ÖÐ×Ô¶¯ÌáÈ¡»¯Ñ§·´Ó¦µÄÁìÓòÖÐѧÊõ½ç½ÒÏþЧ¹ûµÄÖ÷ÒªÊÇIBMÓëChEMUÁ½¸öÍŶӯäÖÐIBMÄ¿µÄÊÇ×Ô¶¯»¯ÓлúʵÑéÊÒÒò´Ë´Ó×Ô¶¯ÌáÈ¡Îı¾ÖеÄÓлú·´Ó¦¼°·´Ó¦²Ù×÷Á÷³Ìµ½Õ¹Íû·´Ó¦ÔÙµ½ÄæºÏ³Éõè¾¶ÆÊÎö¶¼ÓÐËùÏ£ÍûËûÃÇÌáÈ¡·´Ó¦µÄ˼Ð÷ÊÇʹÓÃTransformerÄ£×Ó½«·´Ó¦Îı¾·Òë³ÉÌØ¶¨¶¯´ÊΪ·ÖÀàµÄ½á¹¹»¯Óï¾äÈçͼ1Ëùʾ֮ºóÔÙ½øÒ»²½Ê¶±ð»¯Ñ§ÊµÌåÃûÓë·ÖÀà»ñµÃ·´Ó¦ÏêϸÄÚÈÝ¿ÉÒÔä¯ÀÀIBM RXN for ChemistryµÄÍøÕ¾[5-6]ͼ1 IBM RXNÖн«»¯Ñ§·´Ó¦Îı¾×ªÒë³É½á¹¹»¯Îı¾Ê¾Àý[6]¡ª¡ªChEMUÊý¾Ý¼¯¡ª¡ª´ËºóÕßChEMUÊÇCheminformatics Elsevier Melbourne University labËûÃÇÔÚ2020Äê4Ô·ÝÐû²¼ÁË1500ÌõÈ˹¤±ê×¢ºÃµÄרÀûÖеÄÓлú·´Ó¦Îı¾µÄÊý¾Ý¼¯²¢ÓÐÈýÊ®¶àÖ»²½¶Ó¼ÓÈ뾺Èü[7]±ê×¢µÄÎı¾°üÀ¨ÁË·´Ó¦²úÆ·¡¢ÆðʼÎï¡¢ÊÔ¼Á´ß»¯¼Á¡¢ÈܼÁ¡¢Î¶ȡ¢²úÂʵÈ10ÖÖʵÌåÃûÈç±í1ËùʾÒÔ¼°·´Ó¦²Ù×÷¶¯´Ê£¨EVENT_TRIGGER£©¶¯´ÊÓ뻯ºÏÎïÖ®¼äµÄ¹ØÏµ²ÎÊý£¨Arg1£©ÒÔ¼°¶¯´ÊÓë·´Ó¦Ìõ¼þ£¨Î¶ȡ¢²úÂʵȣ©Ö®¼äµÄ¹ØÏµ²ÎÊý£¨ArgM£©±ê×¢Îı¾Ê¾ÀýÈçͼ2Ëùʾͨ¹ý±ê×¢Îı¾Öз´Ó¦µÄ¸÷¸ö²¿·ÖÓëÌõ¼þÎÒÃDz»µ«¿ÉÒÔ»ñµÃÓлú·´Ó¦Ê½»¹¿ÉÒÔ»ñµÃÏà¹Ø·´Ó¦Ìõ¼þÓë²úÁ¿¡¢²úÂʵÈЧ¹û±í1 ChEMUÊý¾Ý¼¯10ÖÖʵÌåÃû¼°½ç˵[7]ͼ2 ChEMUÊý¾Ý¼¯Îı¾±êעʾÀý[7]ChEMUÊý¾Ý¼¯Ö÷Òª·ÖΪÈý¸öʹÃüÒ»¸öÊÇÖ»Íê³É10¸öʵÌåÃûµÄʶ±ðÁíÒ»¸öÊÇÖ»Íê³É·´Ó¦²Ù×÷¶¯´ÊÓëʵÌåÃûÖ®¼ä¹ØÏµ²ÎÊý£¨Arg1ÓëArgM£©µÄÕ¹ÍûÉÐÓÐÒ»¸öÊǰüÀ¨ÁËǰÁ½ÕßʹÃüµÄend to endʹÃü¹ØÓÚÍøÂçÓлú·´Ó¦Ê½µÄÊý¾ÝµÚÒ»ÏîʹÃü¼´¿ÉÍê³É¡ª¡ªBiLSTM+CNN+CRFÄ£×ÓÌåÏÖ¡ª¡ªÔÚÈýÏîʹÃüÖÐÒ»¼Òר×öÉúÎïÒ½Ò©ÁìÓò×ÔÈ»ÓïÑÔ´¦Öóͷ£µÄ¹«Ë¾MelaxTechnologies Inc.¾ù»ñµÃµÚÒ»¶øÔÚʵÌåÃûʶ±ðʹÃüÖÐÌåÏÖµÚ¶þºÃµÄÔ½ÄÏÍŶÓVinAIÒÔF1 score 95.21%ÂÔµÍÓÚµÚÒ»µÄ95.70%[8]¶øËûÃÇÊÇÉÙÊýÌåÏְμâÇÒ¹ûÕæ×Ô¼ºµÄÄ£×ÓµÄÍŶÓËûÃǵÄÄ£×Ӽܹ¹Èçͼ3Ëùʾͼ3 VinAIÍŶӵÄÃüÃûÌåʶ±ðÄ£×Ó(BiLSTM-CNN-CRF)¼Ü¹¹[8]ÔÚÄ£×ÓµÄÊäÈ벿·ÖËûÃÇʹÓã¨a£©Word2Vec skip-gramÄ£×ÓԤѵÁ·µÄ´ÊǶÈ루b£©»ùÓÚһάCNNµÄ×Ö·û¼¶´ÊǶÈ루c£©ELMoÄ£×ÓԤѵÁ·µÄÓï¾³»¯µ¥´ÊǶÈëÈýÖÖ²î±ð´ÊǶÈëÅþÁ¬¶ø³ÉµÄÏòÁ¿×÷ΪÊäÈë¾Ò»²ãË«ÏòÊÇ·ÇÆÚÓ°ÏóÍøÂ磨BiLSTM£©²¶»ñÐòÁÐÐÅÏ¢ÔÙ¾Ìõ¼þËæ»ú³¡£¡£¨CRF£©²¶»ñ±ê×¢Ö®¼äµÄÂþÑܼÍÂÉÊä³ö±ê×¢¶ø±êעģʽÊdz£¼ûµÄBIOģʽ¼´±ê×¢Ò»¸ö´ÊµÄ´ÊÍ·£¨BBegin£©Óë´ÊÖУ¨IInside£©ÒÔ¼°ÆäËû´Ê£¨OOther£©À´È·¶¨ÊµÌåÃûµÄ½çÏßÀýÈçͼ3ÖÐB-REAGENT_CATALYSTÓëI-REAGENT_CATALYSTµÄ±ê×¢¶ÔÓ¦sulfuricacidÊÇREAGENT_CATALYSTΪÁËÑéÖ¤ÈýÖÖ´ÊǶÈë¶ÔÄ£×ÓµÄÌåÏÖËûÃÇ»®·ÖïÔÌÒ»ÖÖ´ÊǶÈë»ñµÃµÄЧ¹ûÈç±í2ËùʾÏà±ÈûÓÐԤѵÁ·µÄ×Ö·û¼¶CNN´ÊǶÈëÁ½ÖÖԤѵÁ·µÄ´ÊǶÈë¶ÔÄ£×ÓÌåÏÖµÄÓ°Ïì¸ü´ó±í2 ïÔÌÆäÖÐÒ»ÖÖ´ÊǶÈëʱµÄÄ£×ÓÌåÏÖ[8]¡ª¡ª×ܽáÓëÕ¹Íû¡ª¡ªÔÚ»¯Ñ§ÃüÃûÌåʶ±ðʹÃüÖÐBiLSTM+CRFÅäºÏԤѵÁ·µÄ´ÊǶÈëÒ»Ñùƽ³£¿É×÷Ϊbaseline¼¶±ðµÄÒªÁì¹ØÓÚ×Ô¶¯ÌáÈ¡µÄ·´Ó¦µÄËùÓÐÄ£×Ó׼ȷÂÊÔÙ¸ßÒ²ÎÞ·¨µÖ´ï100%Òò´Ë»¹ÐèÉú³¤Ð£¶Ô·´Ó¦µÄËã·¨£¨½«½ÏÈÝÒ×»ìÏýµÄ·´Ó¦ÎïÓëÈܼÁ¡¢´ß»¯¼Á¾ÙÐÐУ¶Ô£©ºóÆÚÈôÊǽ¨ÉèÓлú·´Ó¦Êý¾Ý¿âÕÕ¾ÉÐèÒª½øÒ»²½È˹¤Ð£¶Ô£¨Ð£¶ÔËã·¨¿ÉÒÔ¼õÇáÈ˹¤Ð£¶Ô¼ç¸ºÈÔ¾ßÓÐÒâÒ壩¶øÔÚ´ËÖ®ºóChEMUʵÑéÊÒ×¼±¸ÓÚ2021ÄêÔöÌíÁ½ÏîʹÃüÒ»¸öÊÇÕÒµ½ÓëרÀû»¯Ñ§·´Ó¦ÎÄÄÚÇéËÆµÄ»¯Ñ§·´Ó¦Óë·´Ó¦Ìõ¼þÁíÒ»¸öÊÇʶ±ðרÀû»¯Ñ§Îı¾ÖеÄÖÖÖÖ±í´ïʽ֮¼äµÄÖ¸´ú£¨Ö¸´úÏû½âÕÒµ½Ö¸´ú´ÊµÄ¹éÊô£©[9]ǰÕßΪÓлúʵÑéÕß¼ìË÷ÏàËÆ·´Ó¦Óëɸѡ·´Ó¦Ìõ¼þÌṩ±ãµ±ºóÕßÊÇ´ó¹æÄ£×Ô¶¯»¯ÌáȡרÀûÎı¾ÖÐÓлú·´Ó¦ÖбØÐèÂõ¹ýµÄÒ»µÀ¿²Òò´ËÖµµÃ¶Ô»¯Ñ§Îı¾ÍÚ¾ò¸ÐÐËȤµÄÑо¿ÕßÒ»Á¬¸ú½øÓë¼ÓÈë²Î¿¼ÎÄÏ×£º[1]Lowe, D. M. Chemical reactions from US patents https://figshare.com/articles/Chemical_reactions_from_US_patents_1976-Sep2016_/5104873[2]Lowe, D. M. Extraction of chemical structures and reactions from the literature. Diss. University of Cambridge, 2012. DOI: 10.17863/CAM.16293[3]Ai, C. S., Paul E. Blower Jr, and Robert H. Ledwith. "Extraction of chemical reaction information from primary journal text." J. Chem. Inf. Comput. Sci. 30.2 (1990):163-169. DOI: 10.1021/ci00066a012[4]Jessop, D. M., Sam E. A., and Peter M. R. "Mining chemical information from open patents." J.cheminform. 3.1(2011):1-17. DOI: 10.1186/1758-2946-3-40[5] Vaucher,A.C., Zipoli, F., Geluykens, J., et al. Automated extraction of chemical synthesis actions from experimental procedures. Nat.Commun. 11, 3601(2020). DOI: 10.1038/s41467-020-17266-6[6]IBM RXN for chemistry https://rxn.res.ibm.com[7]He, J., et al. "Overview of chemu 2020: Named entity recognition and event extraction of chemical reactions from patents." International Conference of the Cross-Language Evaluation Forum for European Languages. Springer, Cham, 2020. DOI:10.1007/978-3-030-58219-7_18[8]Dao, M. H., and Dat Q. N."VinAI at ChEMU 2020: An accurate system for named entity recognition in chemical reactions from patents." CLEF, 2020.[9]ChEMU http://chemu.eng.unimelb.edu.au
¡¶¡¶öÏÓã¹ÙÍø½øÈëby2272ÃÛÑ¿¡·Ãâ·Ñ¸ßÇåÔÚÏßԢĿ - È«¼¯...¡·ÊÓÆµËµÃ÷£ºÓóÍ·ÖØÐ³ÆðÁËÇá»úǹÏòÍâÉä»÷ÓÐÁËÇá»úǹµÄ»ðÁ¦Ô¶´¦µÄÈËȺ±»Ñ¹ÖÆÅ¿ÔÚµØÉÏת¶¯²»µÃÔÆÊåÔòÓò½Ç¹ÏòÔ¹âϵĵØÃæÈκοÉÒÉÖ®´¦µãÉäÉèÖ÷½ÃæÐ¿îºìÆìH5µÄ³µ»úϵͳÄÚ´æ´Ó6GÉý¼¶ÖÁ8GÆìÔϰæÒÔÉϵÄÉèÖÃÔöÌíÁËǰÅŸôÒô²£Á§1.5TÆìÔϰæÔöÌíÁË360¶ÈÈ«¾°Ó°Ïñ¡¢Ç°ÅÅ×ùÒμÓÈÈ2.0TÆì³©°æÔöÌíÁË·¢¹âÆì±ê¡¢³µµÀ¾ÓÖмá³Ö±ðµÄгµ»¹¶Ôѡװ°ü¾ÙÐÐÁËÓÅ»¯ÔöÌíÁËǰÅÅ×ùÒÎ͸·ç¡¢×Ô¶¯²´³µµÈÉèÖÃÕæÏàÃ÷È·´óÒ¯µÄÐıðÌáÓÐÓôÃÆÁËËûÔõôҲÏë²»µ½×Ô¼º³öÓÚÐÅÈÎÇëÀ´µÄ±£Ä·¾¹»áÊÇÕâ¸öÑù×ÓµÄ
2025-08-01 08:20:54