takminの書きっぱなし備忘録 @はてなブログ

発表者	論文タイトル	発表資料
tomoaki_teshima	Neural Global Shutter: Learn to Restore Video from a Rolling Shutter Camera with Global Reset Feature	https://www.slideshare.net/tomoaki0705/neural-global-shutter-learn-to-restore-video-from-a-rolling-shutter-camera-with-global-reset-feature
yo_itz	IntentVizor: Towards Generic Query Guided Interactive Video Summarization	https://speakerdeck.com/yo_itz/cvpr2022lun-wen-du-mihui-suraido-intentvisor
smygw	FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment	https://www.slideshare.net/ssuserda79d5/finediving-a-finegrained-dataset-for-procedureaware-action-quality-assessment-cvpr2022
hasegawa_k35	E-CIR: Event-Enhanced Continuous Intensity Recovery
dhirooka	GroupViT: Semantic Segmentation Emerges from Text Supervision	https://speakerdeck.com/daigo0927/groupvit-cvpr2022du-mihui-suraido
Godel	Event-Aided Direct Sparse Odometry	https://speakerdeck.com/godel/lun-wen-du-mihui-event-aided-direct-sparse-odometry
tereka	Mobile-Former: Bridging MobileNet and Transformer	https://speakerdeck.com/tereka114/mobile-former-bridging-mobilenet-and-transformer
YamatoOkamoto	A Self-Supervised Descriptor for Image Copy Detection	https://speakerdeck.com/roadroller/di-11hui-quan-ri-ben-konpiyutabiziyonmian-qiang-hui-cvpr2022-a-self-supervised-descriptor-for-image-copy-detection

発表者

論文タイトル

発表資料

tomoaki_teshima

Neural Global Shutter: Learn to Restore Video from a Rolling Shutter Camera with Global Reset Feature

https://www.slideshare.net/tomoaki0705/neural-global-shutter-learn-to-restore-video-from-a-rolling-shutter-camera-with-global-reset-feature

yo_itz

IntentVizor: Towards Generic Query Guided Interactive Video Summarization

https://speakerdeck.com/yo_itz/cvpr2022lun-wen-du-mihui-suraido-intentvisor

smygw

FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment

https://www.slideshare.net/ssuserda79d5/finediving-a-finegrained-dataset-for-procedureaware-action-quality-assessment-cvpr2022

hasegawa_k35

E-CIR: Event-Enhanced Continuous Intensity Recovery

dhirooka

GroupViT: Semantic Segmentation Emerges from Text Supervision

https://speakerdeck.com/daigo0927/groupvit-cvpr2022du-mihui-suraido

Godel

Event-Aided Direct Sparse Odometry

https://speakerdeck.com/godel/lun-wen-du-mihui-event-aided-direct-sparse-odometry

tereka

Mobile-Former: Bridging MobileNet and Transformer

https://speakerdeck.com/tereka114/mobile-former-bridging-mobilenet-and-transformer

YamatoOkamoto

A Self-Supervised Descriptor for Image Copy Detection

https://speakerdeck.com/roadroller/di-11hui-quan-ri-ben-konpiyutabiziyonmian-qiang-hui-cvpr2022-a-self-supervised-descriptor-for-image-copy-detection

関東、名古屋、関西のコンピュータビジョン勉強会合同で開催している全日本コンピュータビジョン勉強会の11回目です。

今回は恒例となりました、コンピュータビジョンのトップカンファレンス「CVPR2022」の論文読み会をオンラインで行いました。 CVPR2022読み会は毎回多くの発表希望者がいるため、今回も前後編の２回に分けて行われます。

今回は前編ということで、リンク等をまとめます。

登録サイト

kantocv.connpass.com

Togetter

togetter.com

Youtube

youtu.be

発表資料

全ての発表資料は、勉強会で使用した質疑応答用のSlack上にはアップされているのですが、ここにリンクした資料は発表者本人がtwitterで公開したもののみ記載しています。

発表者	論文タイトル	発表資料
takmin	Learning to Solve Hard Minimal Problems	https://speakerdeck.com/takmin/learning-to-solve-hard-minimal-problems
shade-tree	It Is Okay To Not Be Okay: Overcoming Emotional Bias in Affective Image Captioning by Contrastive Data Collection	https://speakerdeck.com/forest1988/it-is-okay-to-not-be-okay-overcoming-emotional-bias-in-affective-image-captioning-by-contrastive-data-collection
carnavi	TableFormer: Table Structure Understanding with Transformers	https://www.slideshare.net/RyoKawanami/tableformercarnavipdf
inoichan	Time3D: End-to-End Joint Monocular 3D Object Detection and Tracking for Autonomous Driving	https://speakerdeck.com/inoichan/cvpr2022du-mihui-time3d-end-to-end-joint-monocular-3d-object-detection-and-tracking-for-autonomous-driving
kzykmyzw	EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation	https://www.slideshare.net/KazuyukiMiyazawa/20220807japancvkzykmyzwepropnppdf
kano_sawa	DoubleField: Bridging the Neural Surface and Radiance Fields for High-Fidelity Human Reconstruction and Rendering	https://speakerdeck.com/kanosawa/doublefield-cvmian-qiang-hui-cvpr2022
alfredplpl	Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding - 論文デモ：「エターナルフォースブリザード」を作るためにImagenをご家庭で育てる方法 -	https://www.docswell.com/s/alfredplpl/59YQLK-2022-08-07-145902
n-watanabe	Learning Neural Light Fields with Ray-Space Embedding	https://temporal-nw-slide-japancv-cvpr2022-reading-20220807.netlify.app/
toni	Balanced Multimodal Learning via On-the-fly Gradient Modulation	https://www.slideshare.net/AntonioTejerodePablo/cvpr2022-paper-reading-balanced-multimodal-learning-all-japan-computer-vision-study-group-20220807
losnuevetoros	パーツ探し～ PubTables-1M: Towards comprehensive table extraction from unstructured documents と XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding と V-Doc: Visual questions answers with Documents と Cross-modal Clinical Graph Transformer for Ophthalmic Report Generation は読んだし、TableFormer: Table Structure Understanding with Transformers と Neural Collaborative Graph Machines for Table Structure Recognition と Revisiting Document Image Dewarping by Grid Regularization と Fourier Document Restoration for Robust Document Dewarping and Recognition は気になったが読まなかった。	https://speakerdeck.com/yushiku/patutan-si

私の発表資料はこちらです。

speakerdeck.com

後編は8/21に開催予定です。

第１１回全日本コンピュータビジョン勉強会(後編) - connpass

発表者	論文タイトル	発表資料
tony_tiler	VAEs for multimodal disentanglement	https://www.slideshare.net/AntonioTejerodePablo/vaes-for-multimodal-disentanglement
maguro27	StyleNeRF: A Style-based 3D Aware Generator for High-resolution Image Synthesis	https://speakerdeck.com/maguro27/di-9hui-quan-ri-ben-konpiyutabiziyonmian-qiang-hui-stylenerf-a-style-based-3d-aware-generator-for-high-resolution-image-synthesis-fa-biao-zi-liao
shade-tree	GEODIFF: A GEOMETRIC DIFFUSION MODEL FOR MOLECULAR CONFORMATION GENERATION	https://speakerdeck.com/forest1988/geodiff-a-geometric-diffusion-model-for-molecular-conformation-generation
takmin	A Conditional Point Diffusion-Refinement Paradigm for 3D Point Cloud Completion	https://speakerdeck.com/takmin/a-conditional-point-diffusion-refinement-paradigm-for-3d-point-cloud-completion

発表者

論文タイトル

発表資料

tony_tiler

VAEs for multimodal disentanglement

https://www.slideshare.net/AntonioTejerodePablo/vaes-for-multimodal-disentanglement

maguro27

StyleNeRF: A Style-based 3D Aware Generator for High-resolution Image Synthesis

https://speakerdeck.com/maguro27/di-9hui-quan-ri-ben-konpiyutabiziyonmian-qiang-hui-stylenerf-a-style-based-3d-aware-generator-for-high-resolution-image-synthesis-fa-biao-zi-liao

shade-tree

GEODIFF: A GEOMETRIC DIFFUSION MODEL FOR MOLECULAR CONFORMATION GENERATION

https://speakerdeck.com/forest1988/geodiff-a-geometric-diffusion-model-for-molecular-conformation-generation

takmin

A Conditional Point Diffusion-Refinement Paradigm for 3D Point Cloud Completion

https://speakerdeck.com/takmin/a-conditional-point-diffusion-refinement-paradigm-for-3d-point-cloud-completion

発表者	論文タイトル	発表資料
shade-tree	Panoptic Narrative Grounding	https://speakerdeck.com/forest1988/panoptic-narrative-grounding
Godel	An Asynchronous Kalman Filter for Hybrid Event Camera	https://speakerdeck.com/godel/lun-wen-du-mihui-an-asynchronous-kalman-filter-for-hybrid-event-cameras
gash717	WarpedGANSpace: Finding Non-Linear RBF Paths in GAN Latent Space
yu4u	Swin Transformer: Hierarchical Vision Transformer using Shifted Windows	https://www.slideshare.net/ren4yu/swin-transformer-iccv21-best-paper
dhirooka	Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields	https://speakerdeck.com/daigo0927/mip-nerf-iccv2021lun-du-hui-suraido
maguro27	ICCV2021で発表されたEmbodied AI（EAI）の論文の全まとめ、ICCV2021で参加したEAIのコンペ参加報告
peisuke	GNeRF: GAN-based Neural Radiance Field without Posed Camera	https://speakerdeck.com/peisuke/gnerf-gan-based-neural-radiance-field-without-posed-camera
Yamato.OKAMOTO	DocFormer: End-to-End Transformer for Document Understanding	https://speakerdeck.com/roadroller/di-9hui-quan-ri-ben-konpiyutabiziyonmian-qiang-hui-fa-biao-zi-liao
s_aiueo32	Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution
losnuevetoros	ただただVision&Languateの論文を読んでみた	https://speakerdeck.com/yushiku/iccv-2021-tadatadavision-and-languagefalselun-wen-wodu-ndemita

発表者

論文タイトル

発表資料

shade-tree

Panoptic Narrative Grounding

https://speakerdeck.com/forest1988/panoptic-narrative-grounding

Godel

An Asynchronous Kalman Filter for Hybrid Event Camera

https://speakerdeck.com/godel/lun-wen-du-mihui-an-asynchronous-kalman-filter-for-hybrid-event-cameras

gash717

WarpedGANSpace: Finding Non-Linear RBF Paths in GAN Latent Space

yu4u