Pixelor:一个竞争性素描AI代理。你觉得你会素描吗?

A. Bhunia, Ayan Das, U. Muhammad, Yongxin Yang, Timothy M. Hospedales, T. Xiang, Yulia Gryaditskaya, Yi-Zhe Song
{"title":"Pixelor:一个竞争性素描AI代理。你觉得你会素描吗?","authors":"A. Bhunia, Ayan Das, U. Muhammad, Yongxin Yang, Timothy M. Hospedales, T. Xiang, Yulia Gryaditskaya, Yi-Zhe Song","doi":"10.1145/3414685.3417840","DOIUrl":null,"url":null,"abstract":"We present the first competitive drawing agent Pixelor that exhibits human-level \nperformance at a Pictionary-like sketching game, where the participant whose sketch is recognized first is a winner. Our AI agent can autonomously \nsketch a given visual concept, and achieve a recognizable rendition as quickly \nor faster than a human competitor. The key to victory for the agent’s goal \nis to learn the optimal stroke sequencing strategies that generate the most \nrecognizable and distinguishable strokes first. Training Pixelor is done in two \nsteps. First, we infer the stroke order that maximizes early recognizability of \nhuman training sketches. Second, this order is used to supervise the training \nof a sequence-to-sequence stroke generator. Our key technical contributions \nare a tractable search of the exponential space of orderings using neural \nsorting; and an improved Seq2Seq Wasserstein (S2S-WAE) generator that \nuses an optimal-transport loss to accommodate the multi-modal nature of the \noptimal stroke distribution. Our analysis shows that Pixelor is better than the \nhuman players of the Quick, Draw! game, under both AI and human judging \nof early recognition. To analyze the impact of human competitors’ strategies, \nwe conducted a further human study with participants being given unlimited \nthinking time and training in early recognizability by feedback from an AI \njudge. The study shows that humans do gradually improve their strategies \nwith training, but overall Pixelor still matches human performance. The code \nand the dataset are available at http://sketchx.ai/pixelor.","PeriodicalId":7121,"journal":{"name":"ACM Trans. Graph.","volume":"5 1","pages":"166:1-166:15"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":"{\"title\":\"Pixelor: a competitive sketching AI agent. so you think you can sketch?\",\"authors\":\"A. Bhunia, Ayan Das, U. Muhammad, Yongxin Yang, Timothy M. Hospedales, T. Xiang, Yulia Gryaditskaya, Yi-Zhe Song\",\"doi\":\"10.1145/3414685.3417840\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present the first competitive drawing agent Pixelor that exhibits human-level \\nperformance at a Pictionary-like sketching game, where the participant whose sketch is recognized first is a winner. Our AI agent can autonomously \\nsketch a given visual concept, and achieve a recognizable rendition as quickly \\nor faster than a human competitor. The key to victory for the agent’s goal \\nis to learn the optimal stroke sequencing strategies that generate the most \\nrecognizable and distinguishable strokes first. Training Pixelor is done in two \\nsteps. First, we infer the stroke order that maximizes early recognizability of \\nhuman training sketches. Second, this order is used to supervise the training \\nof a sequence-to-sequence stroke generator. Our key technical contributions \\nare a tractable search of the exponential space of orderings using neural \\nsorting; and an improved Seq2Seq Wasserstein (S2S-WAE) generator that \\nuses an optimal-transport loss to accommodate the multi-modal nature of the \\noptimal stroke distribution. Our analysis shows that Pixelor is better than the \\nhuman players of the Quick, Draw! game, under both AI and human judging \\nof early recognition. To analyze the impact of human competitors’ strategies, \\nwe conducted a further human study with participants being given unlimited \\nthinking time and training in early recognizability by feedback from an AI \\njudge. The study shows that humans do gradually improve their strategies \\nwith training, but overall Pixelor still matches human performance. The code \\nand the dataset are available at http://sketchx.ai/pixelor.\",\"PeriodicalId\":7121,\"journal\":{\"name\":\"ACM Trans. Graph.\",\"volume\":\"5 1\",\"pages\":\"166:1-166:15\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"23\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM Trans. Graph.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3414685.3417840\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Trans. Graph.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3414685.3417840","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 23

摘要

我们提出了第一个竞争性绘画代理Pixelor,它在类似pictionar的素描游戏中展示了人类水平的表现,其中参与者的素描首先被识别为获胜者。我们的人工智能代理可以自主绘制给定的视觉概念,并以与人类竞争对手一样快或更快的速度实现可识别的呈现。智能体获胜的关键是学习最优的笔划顺序策略,首先生成最可识别和可区分的笔划。Pixelor的训练分为两个步骤。首先,我们推断笔画顺序,最大限度地提高人类训练草图的早期可识别性。其次,该顺序用于监督序列到序列笔划生成器的训练。我们的主要技术贡献是使用神经排序对排序的指数空间进行易于处理的搜索;以及改进的Seq2Seq Wasserstein (S2S-WAE)发生器,该发生器使用最优传输损失来适应最优冲程分布的多模态性质。我们的分析表明,Pixelor比人类玩家在Quick, Draw!游戏,在人工智能和人类判断下的早期识别。为了分析人类竞争对手策略的影响,我们进行了进一步的人类研究,参与者被给予无限的思考时间,并通过人工智能裁判的反馈进行早期识别训练。研究表明,通过训练,人类确实会逐渐提高自己的策略,但总体而言,Pixelor的表现仍然与人类相当。代码和数据集可从http://sketchx.ai/pixelor获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Pixelor: a competitive sketching AI agent. so you think you can sketch?
We present the first competitive drawing agent Pixelor that exhibits human-level performance at a Pictionary-like sketching game, where the participant whose sketch is recognized first is a winner. Our AI agent can autonomously sketch a given visual concept, and achieve a recognizable rendition as quickly or faster than a human competitor. The key to victory for the agent’s goal is to learn the optimal stroke sequencing strategies that generate the most recognizable and distinguishable strokes first. Training Pixelor is done in two steps. First, we infer the stroke order that maximizes early recognizability of human training sketches. Second, this order is used to supervise the training of a sequence-to-sequence stroke generator. Our key technical contributions are a tractable search of the exponential space of orderings using neural sorting; and an improved Seq2Seq Wasserstein (S2S-WAE) generator that uses an optimal-transport loss to accommodate the multi-modal nature of the optimal stroke distribution. Our analysis shows that Pixelor is better than the human players of the Quick, Draw! game, under both AI and human judging of early recognition. To analyze the impact of human competitors’ strategies, we conducted a further human study with participants being given unlimited thinking time and training in early recognizability by feedback from an AI judge. The study shows that humans do gradually improve their strategies with training, but overall Pixelor still matches human performance. The code and the dataset are available at http://sketchx.ai/pixelor.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
LuisaRender: A High-Performance Rendering Framework with Layered and Unified Interfaces on Stream Architectures BoolSurf: Boolean Operations on Surfaces SkinMixer: Blending 3D Animated Models PopStage: The Generation of Stage Cross-Editing Video Based on Spatio-Temporal Matching QuadStream: A Quad-Based Scene Streaming Architecture for Novel Viewpoint Reconstruction
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1