ã¯ããã«
çæAIã䜿ãéããããªãã¯åªç§ãªã¢ã·ã¹ã¿ã³ãã§ãããããã®å°éå®¶ãšããŠåçããŠããšåœ¹å²ïŒãã«ãœãïŒãæå®ãããã¯ããã¯ã¯åºãç¥ãããŠããŸãããããããã®ããã«ãœãã®æå®ãã¯ãäºå®ãåããããªå®¢èгçãªã¿ã¹ã¯ã«ãããŠãæ¬åœã«AIã®æçžŸåäžã«åœ¹ç«ã£ãŠããã®ã§ããããïŒ
æ¬èšäºã§ã¯ãè¿å¹Žèª¿æ»ãããäžèšè«æã®å
容ã解説ããŠããŸãã
When âA Helpful Assistantâ Is Not Really Helpful: Personas in System Prompts Do Not Improve Performances of Large Language Models.ïŒhttps://arxiv.org/abs/2311.10054ïŒ
èª¿æ»æŠèŠ
ç ç©¶ããŒã ã¯ã以äžã®4ã€ã®èŠçŽ ãçµã¿åãããŠå®¢èгçãªæ€èšŒãè¡ã£ãŠããŸãã
- 2,410åã®å®¢èгçãªåé¡ããŒã¿: æ³åŸãå»åŠãã³ã³ãã¥ãŒã¿ãµã€ãšã³ã¹ãæ°åŠãªã©8ã€ã®äž»èŠåéã«ãŸãããç¥èãã³ãããŒã¯ãã¹ãïŒMMLUïŒããã客芳çãªæ£è§£ãååšãã2,410åã®éžæåŒåé¡ãå³éžããŠäœ¿çš
- 162çš®é¡ã®ãã«ãœã: AIã«äžãã圹å²ãšããŠããå»åž«ããããœãããŠã§ã¢ãšã³ãžãã¢ããªã©ã®è·æ¥ã ãã§ãªãããç¶èŠªãã劻ããå人ããšãã£ã人éé¢ä¿ãããAIã¢ã·ã¹ã¿ã³ãããŸã§ãå€å²ã«ããã162çš®é¡ã®ãã«ãœããçšæ
- 4ã€ã®LLMïŒå€§èŠæš¡èšèªã¢ãã«ïŒ: Llama-3ãMistralãQwen2.5ãFLAN-T5ãšããã4ã€ã®ãªãŒãã³ãœãŒã¹ã¢ãã«ãã¡ããªãŒã䜿çš
- ããã³ããã®åœ¢åŒ: ãããªãã¯ããã§ãããšAIèªèº«ã«åœ¹å²ãäžãã圢åŒãšããããªãã¯ãããšè©±ããŠããŸãããšäŒè©±çžæãæå®ãã圢åŒãªã©ãçšæãããã«ãœããå šãæå®ããªããã³ã³ãããŒã«èšå®ããšæ¯èŒ
æ€èšŒçµæ
ãã®åæçµæã«ã€ããŠãäžèšã®ããã«èšãããŠããŸãã
âThrough our analysis, we find that, in general, prompting with personas has no or small negative effects on model performance compared with the control setting where no persona is added.â
ïŒç§ãã¡ã®åæãéããŠãäžè¬çã«ããã«ãœããçšããããã³ããã¯ããã«ãœãã远å ããªãã³ã³ãããŒã«èšå®ãšæ¯èŒããŠãã¢ãã«ã®ããã©ãŒãã³ã¹ã«åœ±é¿ãäžããªããããããã¯ãããã«æªåœ±é¿ãåãŒãããšãåãããŸãããïŒ
ã€ãŸããããã«ãœããæå®ããããšã§å®¢èгçã¿ã¹ã¯ïŒæ£è§£ãååšããåé¡ïŒã®æ£ççãäžããããšãã蚌æ ã¯èŠã€ãããªãã£ããšããã®ã§ãã
ãšã¯ããã162çš®é¡ã®ãã«ãœãéã§æçžŸãå®å šã«å šãåãã ã£ãããã§ã¯ãªãã£ãããã§ããã«ãœãã®ã屿§ãã现ããåæããçµæãæçžŸã«åœ±é¿ãäžããããã€ãã®åŸåãååšããããšãåãããŸããã
å ·äœçã«ã¯ãæ§å¥ãç¹å®ãããªããæ§å¥äžç«ãªåœ¹å²ãåªããŠãããããšãããä»äºã»åŠæ ¡ã«é¢é£ãã圹å²ããããã«è¯ãåŸåããªã©ãããã«åœãããŸãã
ä»ã«ãããæ³åŸã®è³ªåã«ã¯åŒè·å£«ãæå®ããããšãã£ãããã«ã質åå 容ãšãã«ãœãã®å°éåéãäžèŽãããããšããã®äžã€ã§ãã
èæãšããŠã¯ãåœç¶ã®ããã«å¹æãããããšæ³åã§ããŸãããå®éè«æã§ããå°éåéãäžèŽãã圹å²ã¯æŠããŠè¯ãçµæãããããããšãããŠããŸãããç¶ããŠä»¥äžã®ããã«éãåºããŠããŸãã
âHowever, the effect size of domain alignment is relatively smallâ
ïŒããããå°éåéã®äžèŽã«ãã广ã®å€§ããã¯æ¯èŒçå°ãªãïŒ
ã€ãŸãããå°éå®¶ããæå®ããããšã«ããæçžŸã®åºäžã广ã¯ãç§ãã¡ãæåŸ ããã»ã©å€§ããªãã®ã§ã¯ãªããšããã®ã§ãã
ã§ã¯ã©ããªãã«ãœãæå®ãæé©ãªã®ãïŒ
è«æã®çµè«éšåã§ã¯ã次ã®ããã«æèšãããŠããŸãã
âidentifying the best role remains challenging, with most selection strategies performing similarly to random selection. Such a result suggests that the effect of personas on model performance can be largely unpredictable.â
ïŒæé©ãªåœ¹å²ãç¹å®ããããšã¯äŸç¶ãšããŠå°é£ã§ãããã»ãšãã©ã®éžææŠç¥ã¯ã©ã³ãã ãªéžæãšåæ§ã®ããã©ãŒãã³ã¹ãã瀺ããŸããããã®çµæã¯ããã«ãœããã¢ãã«ã®ããã©ãŒãã³ã¹ã«äžãã圱é¿ã¯å€§éšåãäºæž¬äžå¯èœã§ããããšã瀺åããŠããŸããïŒ
ã€ãŸããç¹å®ã®è³ªåã«å¯ŸããŠããªããæ£è§£ãå°ãåºãããã«ãœããã¯ç¢ºãã«ååšãããã®ã®ãäºåã«ãããäºæž¬ããã«ãœããšããŠæå®ããã®ã¯é£ããããšããã®ã§ãã
æŽ»çšæ¹æ³
ã§ã¯ãããèžãŸããæ®æ®µã®AI掻çšãã©ãããã°è¯ãã®ãã
ïŒâ»ããããã¯çè
ã®èããå«ã¿ãŸãïŒ
æ¬èª¿æ»ã¯ãããŸã§ã客芳çãªã¿ã¹ã¯ãã察象ãšããŠããŸãã
ããã倧åæãšããäžã§ãAIå©çšè
ãšããŠã¯äžèšã®ããã«èããã®ããã¿ãŒã§ã¯ãªãã§ããããã
1. ãã«ãœãæå®ãªããèæ ®
ïŒç¹ã«äºå®ãåããããªå®¢èгçãªã¿ã¹ã¯ã«ãããŠãïŒç¡é§ãªãã«ãœãèšå®ãçããã·ã³ãã«ã«è³ªåã ããæããããæ¹ãçµæçã«ã¯ç¡é£ãªã®ãããããŸããã
åçŽã«ããã³ãããçããªãã®ãè¯ãã§ããã
2. åºåãã©ãŒãããã®èª¿æŽ
åºåãããæç« ã®ããŒã³ã調æŽããããç¹å®ã®å¯Ÿè©±ã¹ã¿ã€ã«ãå®çŸ©ãããããç®çã«ãããŠå©çšããéã¯åŒ·åãªãµããŒããåããããŸãã
ããã«ãœãã¯ãåºåãããçµæãèªåãåãåããã圢ãžå€æããèšå®å€ã ããšèããã°ãã©ããªãã«ãœããæå®ãããã§æ©ã¿ã¥ãããªãã®ãã¡ãªããã ãšæããŸãã
ãŸãšã
AIã®é²æ©ãåãŸããäœããã¹ããªéžæè¢ãæ©ãããšãå€ãã§ãããããªããšãªããã§å©çšããããšãç¡ããããåžžã«æ ¹æ ãä»çµã¿ãçè§£ããŠæŽ»çšããŠãããããšæããŸãã