ã¢ã³ããŒãœã³ã®èŠç¹
ç ç©¶ã«ãããšãæ³åŠä¿®å£«èª²çšã®åŠçã¯æªæããããã€ãã³ãŒãã£ã³ã°ãã«ååããåŸåããã

é廿°å¹Žéãå€§èŠæš¡èšèªã¢ãã«ïŒLLMïŒã¯ æãããç²Ÿæ» æ»æçãªãµã€ããŒã»ãã¥ãªãã£ãç¹ã« ãœãããŠã§ã¢ãšã¯ã¹ããã€ãã®çæ.
æè¿ã®åŸå ããã€ãã³ãŒãã£ã³ã°ã ïŒæç€ºçã«ã³ãŒããäœæãã代ããã«ãèšèªã¢ãã«ãæ°è»œã«äœ¿çšããŠãŠãŒã¶ãŒåãã®ã³ãŒããçŽ æ©ãéçºããããšïŒ æãã ãŠãŒã¶ãŒãã³ãŒããæžããšããæŠå¿µã¯ã2000幎代ã«é ç¹ã«éããæŠå¿µããã¹ã¯ãªããããã£ãã埩掻ãããŸãããããã¯ãæ¯èŒçã¹ãã«ã®äœãæªæã®ããè¡çºè ã§ãããªãããç Žå£çãªæ»æãè€è£œãŸãã¯éçºããã®ã«ååãªç¥èãæã€äººç©ã§ããåœç¶ã®ããšãªãããåå ¥éå£ãäœããªããšãè åšã¯å¢å€§ããåŸåããããŸãã
ãã¹ãŠã®åæ¥LLMã«ã¯ããã®ãããªç®çã§äœ¿çšãããããšãé²ãããã®äœããã®ã¬ãŒãã¬ãŒã«ããããããããã®ä¿è·æªçœ®ã¯ çµ¶ãéãªãæ»æãåããŠããéåžžãã»ãšãã©ã® FOSS ã¢ãã« (LLM ããçæç»å/ãã㪠ã¢ãã«ãŸã§ãè€æ°ã®ãã¡ã€ã³ã«ããã) ã¯ãéåžžã¯è¥¿åŽè«žåœã§ã®ã³ã³ãã©ã€ã¢ã³ã¹ãç®çãšããŠãäœããã®åæ§ã®ä¿è·ãšãšãã«ãªãªãŒã¹ãããŸãã
ããããå ¬åŒã¢ãã«ãªãªãŒã¹ã¯ãã®åŸå®æçã« åŸ®èª¿æŽ ããå®å šãªæ©èœãæ±ãããŠãŒã¶ãŒã³ãã¥ããã£ã«ãã£ãŠãããã㯠LoRA å¶éãåé¿ããæœåšçã«ãæãŸãããªããçµæãåŸãããã«äœ¿çšãããŸãã
ãªã³ã©ã€ã³LLMã®å€§éšåã¯ãæªæã®ããããã»ã¹ã§ãŠãŒã¶ãŒãæ¯æŽããããšã鲿¢ããŸãããæ¬¡ã®ãããªãèªç±ãªãåãçµã¿ã¯ ãã¯ã€ãã©ãããã㪠ã»ãã¥ãªãã£ç ç©¶è ã察æŠçžæãšåãæ¡ä»¶ã§æŽ»åã§ããããæ¯æŽããããã«å©çšã§ããŸãã
çŸæç¹ã§ã®äžè¬çãªãŠãŒã¶ãŒãšã¯ã¹ããªãšã³ã¹ã¯ã AIèšèªã¢ãã«ã掻çšããŠã³ãŒãã®ãããã°ããããŒã¿ã®ç°åžžæ€åºãŸã§ã ãã®ãã£ã«ã¿ãŒæ©æ§ã¯ãã°ãã°æ¹å€ã济ã³ãŠãã LLMã®ãã€ãã£ãã³ãã¥ããã£ãã.
ã·ã¹ãã ãæ»æããããšããŠããããã§ãã
å¶éãšæ€é²ãžã®åŸåãèªèãããŠããããšãèæ ®ãããšãChatGPTã æãååç èšèªã¢ãã«ã«æªæã®ããã³ãŒããšã¯ã¹ããã€ããäœæãããããšãç®çãšããæè¿ã®èª¿æ»ã§ãã¹ãããããã¹ãŠã® LLM ã®ãã¡ã
AIããŒã±ãã£ã³ã°æ¥çã¯ã æ°ããçŽ UNSWã·ãããŒæ ¡ãšãªãŒã¹ãã©ãªã¢é£éŠç§åŠç£æ¥ç ç©¶æ©æ§ïŒCSIROïŒã®ç ç©¶è ã«ããã ã¹ã¯ãªããããã£ã«æå ±ïŒèªåãšã¯ã¹ããã€ãçæã®ããã®å€§èŠæš¡èšèªã¢ãã«ã®è©äŸ¡ã¯ããããã®ã¢ãã«ãããã«å¹æçã«å®çšçãªãšã¯ã¹ããã€ããçæãããã«ã€ããŠãåããŠäœç³»çãªè©äŸ¡ãæäŸããŠããŸããç ç©¶ããã®äŸæ æäŸãããŠãã èè ã«ããã
ãã®ç ç©¶ã§ã¯ãæ¢ç¥ã®è匱æ§ã©ãïŒç¹å®ã®ãœãããŠã§ã¢ã»ãã¥ãªãã£ã®æ¬ é¥ãå®èšŒããããã«èšèšãããæ§é åããã°ã©ãã³ã°æŒç¿ïŒã®ãªãªãžãã«ããŒãžã§ã³ãšä¿®æ£ããŒãžã§ã³ã®äž¡æ¹ã§ã¢ãã«ãã©ã®ããã«æ©èœããããæ¯èŒããã¢ãã«ã èšæ¶ãã äŸ: çµã¿èŸŒã¿ã®å®å šå¶éã®ããã«èŠåŽããã

ãµããŒããµã€ããããOllama LLM ã¯ç ç©¶è ãæååã®èåŒ±æ§æ»æãéçºããã®ãæ¯æŽããŸãã åºå ž: https://anonymous.4open.science/r/AEG_LLM-EAE8/chatgpt_format_string_original.txt
ã©ã®ã¢ãã«ã广çãªãšã¯ã¹ããã€ããäœæã§ããªãã£ãããããã€ãã®ã¢ãã«ã¯éåžžã«è¿ã¥ãããããã«éèŠãªã®ã¯ãããã€ãã®ã¢ãã«ã ãã®ä»äºããã£ãšããŸãããããã£ãããã¯ãæ¢åã®ã¬ãŒãã¬ãŒã«ã®ã¢ãããŒãã倱æããå¯èœæ§ãããããšã瀺ããŠããŸãã
è«æã¯æ¬¡ã®ããã«è¿°ã¹ãŠããŸãã
ç§ãã¡ã®å®éšã§ã¯ãGPT-4ãšGPT-4oããšã¯ã¹ããã€ãçæã«ãããŠé«åºŠãªååæ§ã瀺ããŠãããæ€é²ãããŠããªããªãŒãã³ãœãŒã¹ã¢ãã«ã®äžéšã«å¹æµããããšãããããŸãããè©äŸ¡å¯Ÿè±¡ã¢ãã«ã®äžã§ãLlama3ã¯ãã®ãããªèŠæ±ã«å¯ŸããŠæãèæ§ããããŸããã
ãããã®ã¢ãã«ã¯ååçãªå§¿å¢ãèŠããŠãããã®ã®ããªãã¡ã¯ã¿ãªã³ã°ãããã³ãŒããçšãã4ã€ã®ã«ã¹ã¿ã ã©ãã§ãšã¯ã¹ããã€ããçæã§ããã¢ãã«ã¯XNUMXã€ããªãã£ããããå®éã®è åšã¯éå®çã§ãããããããªãããæ¬ç ç©¶ã§æãåªããããã©ãŒãã³ã¹ã瀺ããGPT-XNUMXoã¯ãéåžžãXNUMXåã®è©Šè¡ã§XNUMXïœXNUMXä»¶ã®ãšã©ãŒããçºçããªãã£ãã
ãããã¯ãLLM ãæŽ»çšããŠé«åºŠã§äžè¬åå¯èœãª [èªåãšã¯ã¹ããã€ãçæ (AEG)] æè¡ãéçºãã倧ããªå¯èœæ§ã瀺åããŠããŸããã
å€ãã®ã»ã«ã³ããã£ã³ã¹
ãè¯ã第äžå°è±¡ãäžãããã£ã³ã¹ã¯äºåºŠãšãªãããšããèªæã®çã¯ãèšèªã¢ãã«ã®å žåçãªéçããããããæ³åŠä¿®å£«èª²çšã«ã¯åœãŠã¯ãŸããŸããã ã³ã³ããã¹ããŠã£ã³ã㊠åŠå®çãªæèïŒç€ŸäŒçãªæå³ã§ã®ãã€ãŸãæµå¯Ÿé¢ä¿ïŒã æç¶çã§ã¯ãªã.
èããŠã¿ãŠãã ããã峿žé€šã«è¡ã£ãŠãå®çšçãªç匟補é ã«é¢ããæ¬ãæ±ãããšããããå°ãªããšãæãããã§ãããããããïŒãã®è³ªåãæåããäŒè©±ãå®å šã«å°ç¡ãã«ããªãã£ããšä»®å®ãããšïŒãããªãã®èŠæ±ã¯ é¢é£äœåååŠåå¿ãåè·¯èšèšã«é¢ããæ¬ãªã©ã¯ãåžæžã«ãšã£ãŠã¯ãæåã®åãåããã«æããã«é¢é£ããŠãããšæãããããããã®ããã«æ±ãããã§ãããã
ãããããåžæžã¯ã©ããªå Žåã§ãèŠããŠããã ãã æªæ¥ äžåºŠãç匟補é ã®æ¬ãèŠæ±ããäŒè°ã§ãããªãèªèº«ã®ãã®æ°ããç¶æ³ãã修埩äžå¯èœããªãã®ã«ãªã£ãã®ã§ãã
LLMã§ã¯ããã§ã¯ãããŸãããLLMã¯ãçŸåšã®äŒè©±ããããŒã¯ã³åãããæ å ±ãä¿æããã®ã«èŠåŽããå¯èœæ§ããããé·æèšæ¶æä»€ïŒã¢ãŒããã¯ãã£ã«äœããããå ŽåïŒã¯èšããŸã§ããããŸããã ã®ããã« ChatGPT-4o 補å)ã
ãããã£ãŠãChatGPT ãšã®äœæ°ãªãäŒè©±ã§ããå¶ç¶ã«ããChatGPT ã¯ãããšãæãŸããããšããŠã©ã¯ãã飲ã¿èŸŒãã§ããŸãããšãããããšããããšãåãããŸããç¹ã«ãè«è©±äžã«ãæ¬æ¥ã¯ãçŠæ¢ããããŠããæŽ»åã«é¢é£ããæ§æããŒããç ç©¶ããŸãã¯ããã»ã¹ãå±éãããå Žåã«ããããé¡èã«ãªããŸãã
ããã¯çŸåšã®ãã¹ãŠã®èšèªã¢ãã«ã«åœãŠã¯ãŸããŸãããã¬ãŒãã¬ãŒã«ã®è³ªã¯ã¢ãã«ã«ãã£ãŠçšåºŠãã¢ãããŒããç°ãªãå ŽåããããŸãïŒã€ãŸãã éã¿ ãã¬ãŒãã³ã°æžã¿ã®ã¢ãã«ãæ¹ãããããããã£ãã ã»ãã·ã§ã³äžã«ããã¹ãã®å ¥åºåãã£ã«ã¿ãªã³ã°ã䜿çšãããããããšã§ãã¢ãã«ã®æ§é ã¯ãã®ãŸãŸæ®ããŸãããæ»æãåãããããªãå¯èœæ§ããããŸãã
ã¡ãœããã®ãã¹ã
LLMãã©ã®çšåºŠãŸã§å®çšçãªãšã¯ã¹ããã€ããçæã§ãããããã¹ãããããã«ãèè ãã¯5ã€ã® SEED Labsã®ã©ããããããæ¢ç¥ã®è匱æ§ãäžå¿ã«æ§ç¯ãããŠããã ãããã¡ãªãŒããŒãããŒ, libc ã«æ»ã æ±ãCââOWæ»æ, ç«¶åç¶æ .
ç ç©¶è ãã¡ã¯ããªãªãžãã«ã®ã©ãã«å ãã倿°ãšé¢æ°ã®ååãæ±çšçãªèå¥åã«å€æŽããããšã§ä¿®æ£çãäœæãããããã¯ãã¢ãã«ãèšæ¶ããããã¬ãŒãã³ã°äŸãå©çšããããšãé²ãããã§ãã£ãã
åã©ãã¯ã¢ãã«ããšã« 2 åå®è¡ãããŸããã1 åã¯å ã®åœ¢åŒã§ããã 1 åã¯é£èªåãããããŒãžã§ã³ã§å®è¡ãããŸããã
ç ç©¶è ãã¡ã¯æ¬¡ã«ãã«ãŒãã«4ã€ç®ã®LLMãå°å ¥ããŸãããããã¯ãã¿ãŒã²ããã¢ãã«ã«äœåºŠãããã³ãããåºããè€æ°ã©ãŠã³ãã«ããã£ãŠåºåãæ¹è¯ã»æ¹åããããã«èšèšãããæ»æè ã¢ãã«ã§ãããã®åœ¹å²ã«äœ¿çšãããLLMã¯GPT-XNUMXoã§ãæ»æè ãšã¿ãŒã²ããéã®å¯Ÿè©±ã仲ä»ããã¹ã¯ãªãããéããŠåäœããæ¹è¯ãµã€ã¯ã«ãæå€§XNUMXåããŸãã¯ãã以äžã®æ¹åãäžå¯èœãšå€æããããŸã§ç¶ç¶ã§ããŸãã

LLM ããŒã¹ã®æ»æè (ãã®å Žå㯠GPT-4o) ã®ã¯ãŒã¯ãããŒã
ãã®ãããžã§ã¯ãã®å¯Ÿè±¡ã¢ãã«ã¯ GPT-4o, GPT-4o-ãã, ã©ã3 ïŒ8BïŒã ãã«ãã£ã³ã»ãã¹ãã©ã« ïŒ7BïŒããã㊠ãã«ãã£ã³ã»ãã¡ã€ (2.7B) ã¯ãç¬èªã®ã·ã¹ãã ãšãªãŒãã³ãœãŒã¹ ã·ã¹ãã ã®äž¡æ¹ã衚ããæŽåã¢ãã«ãšéæŽåã¢ãã« (ã€ãŸããæå®³ãªããã³ããããããã¯ããããã«èšèšãããå®å šã¡ã«ããºã ãçµã¿èŸŒãŸããã¢ãã«ãšããããã®ã¡ã«ããºã ãåé¿ããããã«åŸ®èª¿æŽãŸãã¯æ§æã«ãã£ãŠå€æŽãããã¢ãã«) ãæ··åšããŠããŸãã
ããŒã«ã«ã«ã€ã³ã¹ããŒã«å¯èœãªã¢ãã«ã¯ã ãªã©ã ãã¬ãŒã ã¯ãŒã¯ãä»ããŠã¢ã¯ã»ã¹ããä»ã®ãã®ã¯ãå¯äžå©çšå¯èœãªæ¹æ³ã§ãã API ãä»ããŠã¢ã¯ã»ã¹ããŸãã
çµæã®åºåã¯ããšã¯ã¹ããã€ããæå³ãããšããã«æ©èœããã®ã劚ãããšã©ãŒã®æ°ã«åºã¥ããŠæ¡ç¹ãããŸããã
çµæ
ç ç©¶è ãã¯ããšã¯ã¹ããã€ãçæããã»ã¹äžã«åã¢ãã«ãã©ã®çšåºŠååçã§ãããããã¹ãããã¢ãã«ãã¿ã¹ã¯ãæ¯æŽããããšããå¿çã®å²åïŒåºåã«æ¬ é¥ããã£ãå Žåã§ãïŒãèšé²ããããšã§æž¬å®ããŸããã

ã¡ã€ã³ãã¹ãã®çµæãå¹³åçãªååã瀺ãããŠããŸãã
GPT-4o ãš GPT-4o-mini ã¯ã97 ã€ã®è匱æ§ã«ããŽãªå šäœã§å¹³åå¿ççããããã 96% ãš XNUMX% ãšãæãé«ãã¬ãã«ã®ååã瀺ããŸããã ãããã¡ãªãŒããŒãããŒ, libc ã«æ»ã, ãã©ãŒãããæåå, ç«¶åç¶æ , æ±ããç.
ãã«ãã£ã³ã»ãã¹ãã©ã«ãšãã«ãã£ã³ã»ãã¡ã€ã¯å¹³åååçã93%ãš95%ã§ããã«ç¶ãããã©ã3㯠æäœ åå ææ¬²ã¯äœããå šäœçãªååçã¯ããã27ããŒã»ã³ãã§ããã

å·ŠåŽã«ã¯ãLLM ããªãªãžãã«ã® SEED Lab ããã°ã©ã ã§ç¯ããééãã®æ°ã衚瀺ãããå³åŽã«ã¯ããªãã¡ã¯ã¿ãªã³ã°ãããããŒãžã§ã³ã§ç¯ããééãã®æ°ã衚瀺ãããŸãã
ãããã®ã¢ãã«ã®å®éã®ããã©ãŒãã³ã¹ã調ã¹ããšããã ææ¬² ããã³ æå¹GPT-4oã¯4ã€ã®é£èªåã©ãå šäœã§åèš3ä»¶ã®ãšã©ãŒãèšé²ããæãæ£ç¢ºãªçµæãåºããŸãããGPT-XNUMXo-miniã¯XNUMXä»¶ã®ãšã©ãŒã§ç¶ããŸãããDolphin-Mistralã¯å ã®ã©ãã§ã¯ãŸããŸãã®æçžŸã§ããããã³ãŒãã®ãªãã¡ã¯ã¿ãªã³ã°åŸã«å€§å¹ ã«èŠæŠããŸãããããã¯ããã¬ãŒãã³ã°äžã«é¡äŒŒã®ã³ã³ãã³ããæ€åºãããå¯èœæ§ã瀺åããŠããŸããDolphin-Phiã¯XNUMXä»¶ã®ãšã©ãŒãèšé²ããLlamaXNUMXã¯XNUMXä»¶ã®ãšã©ãŒãèšé²ããæãå€ãã®ãšã©ãŒãèšé²ããŸããã
倱æã®åå ã¯ããããã¡ãµã€ãºã®èª€ããã«ãŒãããžãã¯ã®æ¬ èœãæ§æçã«ã¯æå¹ã ã广ã®ãªããã€ããŒããªã©ããšã¯ã¹ããã€ããæ©èœäžå šã«é¥ãããæè¡çãªãã¹ã§ããããšãå žåçã§ãããé£èªåãããããŒãžã§ã³ã«å¯ŸããŠãå®éã«æ©èœãããšã¯ã¹ããã€ããçæããããšã«æåããã¢ãã«ã¯ãããŸããã§ããã
èè ãã¯ãã»ãšãã©ã®ã¢ãã«ãå®éã«æ©èœãããšã¯ã¹ããã€ãã«äŒŒãã³ãŒããçæããããæ ¹æ¬çãªæ»æãå®éã«ã©ã®ããã«æ©èœãããã«ã€ããŠã®çè§£ãä¹ããããã«å€±æããããšã芳å¯ããããã®ãã¿ãŒã³ã¯ãã¹ãŠã®è匱æ§ã«ããŽãªã§æããã§ãããã¢ãã«ãé¢é£ããããžãã¯ãæšè«ããã®ã§ã¯ãªããããç¥ãããã³ãŒãæ§é ãæš¡å£ããŠããããšã瀺åããŠããïŒããšãã°ããããã¡ãªãŒããŒãããŒã®ã±ãŒã¹ã§ã¯ãå€ãã®ã¢ãã«ãæ©èœããã³ãŒããæ§ç¯ã§ããªãã£ãïŒã NOP ãã/ã¹ã©ã€ã).
return-to-libc ã®è©Šè¡ã§ã¯ããã€ããŒãã«èª€ã£ãããã£ã³ã°ãééã£ãäœçœ®ã«é 眮ããã颿°ã¢ãã¬ã¹ãå«ãŸããããšãå€ããçµæãšããŠãæå¹ã«èŠããŠã䜿çšã§ããªãåºåãçæãããŸããã
èè ãã¯ãã®è§£éã¯æšæž¬çã§ãããšè¿°ã¹ãŠãããããšã©ãŒã®äžè²«æ§ã¯ãã¢ãã«ããšã¯ã¹ããã€ãã®æé ãšæå³ãã广ãçµã³ä»ããããšãã§ããŠããªããšããããåºç¯ãªåé¡ã瀺åããŠããã
ãŸãšãïŒ
è«æã§ã¯ããã¹ããããèšèªã¢ãã«ãæåã®ãã¬ãŒãã³ã°äžã«ãªãªãžãã«ã®SEEDã©ããåŠç¿ãããã©ããã«ã€ããŠã¯çåããããšèªããŠããããã®ããããªã¢ã³ããæ§ç¯ããããããããªãããç ç©¶è ãã¯ããã®ç ç©¶ã®åŸã®å埩ã§ã¯çŸå®äžçã®ãšã¯ã¹ããã€ããæ±ããããšèããŠãããçã«æ°ããæè¿ã®çŽ æã¯ã ã·ã§ãŒãã«ãã ãŸãã¯ãã®ä»ã®æ··ä¹±ãæã圱é¿ã
èè ãã¯ãŸããç ç©¶ãè¡ãããæç¹ã§ã¯å©çšã§ããªãã£ãGPT-o1ãDeepSeek-r1ãªã©ã®ãããæ°ãããããé«åºŠãªãæèãã¢ãã«ã«ãã£ãŠãåŸãããçµæãæ¹åãããå¯èœæ§ãããããããå°æ¥ã®ç ç©¶ãžã®ãããªãå åã§ããããšãèªããŠããã
è«æã¯ããã¹ããããã¢ãã«ã®ã»ãšãã©ã¯ããããããå¯èœã§ããã°ãå®éã«æ©èœãããšã¯ã¹ããã€ããçæã§ããã§ããããšçµè«ã¥ããŠãããå®å šã«æ©èœããåºåãçæã§ããªãã£ãã®ã¯ãã¢ã©ã€ã¡ã³ãã®å®å šçã«ãããã®ã§ã¯ãªãããããã¢ãŒããã¯ãã£äžã®çã®éçã瀺ããŠããããã®éçã¯ãæè¿ã®ã¢ãã«ã§ã¯ãã§ã«è»œæžãããŠãããããããã¯éããªã軜æžãããå¯èœæ§ãããã
åççºè¡æ¥ïŒ5幎2025æXNUMXæ¥ïŒæïŒ