资源目录:
├── 基于LangChain和知识图谱的大模型医疗问答机器人项目 │ ├── 源代码 │ ├── 大模型实战P25LangChain之给Agent加Memory.mp4 │ ├── 大模型实战P11LangChain之Prompt和LLMChain.mp4 │ ├── 大模型实战P45问答机器人项目面试考点总结.mp4 │ ├── 大模型实战P36从用户问题中抽取命名实体词槽.mp4 │ ├── 大模型实战P37CQL词槽填充和相关问题筛选.mp4 │ ├── 大模型实战P1LangChain与知识图谱问答机器人项目.mp4 │ ├── 大模型实战P13LangChain之FewShotPrompt.mp4 │ ├── 大模型实战P41用户消息的补全和归纳总结.mp4 │ ├── 大模型实战P48快速接入百川和Claude大模型.mp4 │ ├── Neo4j实战P7-1Windows和Mac本地安装Neo4j数据库.mp4 │ ├── 大模型实战P44LangChain框架版本升级.mp4 │ ├── 大模型实战P24LangChain之多Agent协作.mp4 │ ├── 大模型实战P12LangChain之多参数与LCEL.mp4 │ ├── 大模型实战P32定义环境变量和模型获取函数.mp4 │ ├── 大模型实战P47一种解决Agent响应慢的方法.mp4 │ ├── 大模型实战P19LangChain之FAISS文档召回.mp4 │ ├── 大模型实战P28LangChain之GraphCypherQAChain.mp4 │ ├── 大模型实战P31项目LangChainAgent架构简介.mp4 │ ├── 大模型实战P40用Agent串联业务处理函数.mp4 │ ├── 大模型实战P43LangSmith监控大模型应用程序.mp4 │ ├── 大模型实战P20LangChain之文档加载和分割.mp4 │ ├── 大模型实战P30Gradio之ChatInterface对话界面.mp4 │ ├── 大模型实战P15LangChain之ConversationChain.mp4 │ ├── 大模型实战P27LangChain之输出提示词重写.mp4 │ ├── 大模型实战P8OpenAI接口实现TextEmbeddings.mp4 │ ├── 大模型实战P18LangChain之问答QAChain.mp4 │ ├── 大模型实战P9根据OpenAI句向量召回相似文本.mp4 │ ├── 大模型实战P26LangChain之命名实体识别.mp4 │ ├── 大模型实战P6OpenAI接口调用Token计算.mp4 │ ├── 大模型实战P46共性问题修复和统一答疑.mp4 │ ├── 大模型实战P39Google搜索回答非在库问题.mp4 │ ├── 大模型实战P10LangChain简介与初体验.mp4 │ ├── 大模型实战P16LangChain之Memory.mp4 │ ├── 大模型实战P17LangChain之LLMRequestsChain.mp4 │ ├── 大模型实战P2基础课和项目课的内容概述.mp4 │ ├── 大模型实战P21LangChain之文档检索问答.mp4 │ ├── 大模型实战P7OpenAI接口实现多轮对话.mp4 │ ├── Neo4j实战P7-2Windows和Mac本地安装Neo4j数据库.mp4 │ ├── 医疗问答P7CSV文件导入到Neo4j数据库.mp4 │ ├── 大模型实战P22LangChain之向量保存和加载.mp4 │ ├── 大模型实战P5OpenAI对话接口代码优化.mp4 │ ├── 大模型实战P3大语言模型通识和课前准备.mp4 │ ├── 大模型实战P42Gradio对话窗口修改和测试.mp4 │ ├── 大模型实战P29Gradio简介与初体验.mp4 │ ├── 大模型实战P14LangChain之SequentialChain.mp4 │ ├── 大模型实战P38查询Neo4j回答医疗相关问题.mp4 │ ├── 大模型实战P35Chroma召回数据回答公司相关问题.mp4 │ ├── 大模型实战P34通用大模型回答日常交际问题.mp4 │ ├── 大模型实战P33公司相关文档向量化和存储.mp4 │ ├── 大模型实战P4OpenAI对话接口简单使用方法.mp4 │ ├── 大模型实战P23LangChain之Agent和自定义Tool.mp4 ├── 大模型面试笔记书籍 │ ├── 大模型论文 │ │ ├── CVPR 2024 (最佳+oral+highlight)(持续更新) │ │ │ ├── 1 CVPR'24 获奖论文 │ │ │ │ ├── 4 最佳学生论文次优奖 │ │ │ │ │ ├── Objects as volumes: A stochastic geometry view of opaque solids.pdf │ │ │ │ │ ├── Comparing the Decision-Making Mechanisms by Transformers and CNNs.pdf │ │ │ │ ├── 2 最佳学生论文奖 │ │ │ │ │ ├── BIOCLIP:A Vision Foundation Model for the Tree of Life.pdf │ │ │ │ │ ├── Mip-Splatting:Alias-free 3D Gaussian Splatting.pdf │ │ │ │ ├── 3 最佳论文次优奖 │ │ │ │ │ ├── pixelSplat. 3D Gaussian Splats from lmage Pairs for Scalable Generalizable 3D Reconstruction.pdf │ │ │ │ ├── 1 最佳论文奖 │ │ │ │ │ ├── Rich Human Feedback for Text-to-Image Generation.pdf │ │ │ │ │ ├── Generative Image Dynamics.pdf │ │ │ ├── 3 CVPR'24 oral论文(更新完毕) │ │ │ │ ├── 18 多模态学习 │ │ │ │ │ ├── Describing Differences in Image Sets with Natural Language.pdf │ │ │ │ │ ├── NoiseCLR:A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions in Diffusion Models.pdf │ │ │ │ │ ├── MetaCloak.pdf │ │ │ │ │ ├── InternVL:Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks.pdf │ │ │ │ ├── 1 低层次视觉 │ │ │ │ │ ├── Specularity Factorization for Low-Light Enhancement.pdf │ │ │ │ │ ├── FMA-Net:Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring.pdf │ │ │ │ │ ├── Bilateral Event Mining and Complementary for Event Stream Super-Resolution.pdf │ │ │ │ │ ├── FlowIE:Efficient Image Enhancement via Rectified Flow.pdf │ │ │ │ │ ├── Towards Robust Event-guided Low-Light Image Enhancement.pdf │ │ │ │ ├── 11三维视觉 │ │ │ │ │ ├── A Subspace-Constrained Tyler’s Estimator and its Applications to Structure from Motion.pdf │ │ │ │ ├── 16 低层次视觉与遥感 │ │ │ │ │ ├── DART:Implicit Doppler Tomography for Radar Novel View Synthesis.pdf │ │ │ │ │ ├── LDP: Language-driven Dual-Pixel Image Defocus Deblurring Network.pdf │ │ │ │ ├── 14 多视角三维技术和传感器 2 │ │ │ │ │ ├── Learning to Produce Semi-dense Correspondences for Visual Localization.pdf │ │ │ │ ├── 15 低样本学习、自监督学习和半监督学习 │ │ │ │ │ ├── CroSel.pdf │ │ │ │ │ ├── LTGC:Long-tail Recognition via Leveraging LLMs-driven Generated Content.pdf │ │ │ │ │ ├── Improving Semantic Correspondence with Viewpoint-Guided Spherical Maps.pdf │ │ │ │ ├── 6 多视角三维技术和传感器 │ │ │ │ │ ├── Seeing the World through Your Eyes.pdf │ │ │ │ │ ├── Tri-Perspective View Decomposition for Geometry-Aware Depth Completion.pdf │ │ │ │ │ ├── Steerers:A Framework for Rotation Equivariant Keypoint Descriptors.pdf │ │ │ │ │ ├── Point Transformer V3:Simpler Faster Stronger.pdf │ │ │ │ │ ├── Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences.pdf │ │ │ │ ├── 5 深度学习架构与技术 │ │ │ │ │ ├── Neural Lineage.pdf │ │ │ │ │ ├── Learning Structure-from-Motion with Graph Attention Networks.pdf │ │ │ │ │ ├── Neural Redshift:Random Networks are not Random Functions.pdf │ │ │ │ │ ├── In Search of a Data Transformation That Accelerates Neural Field Training.pdf │ │ │ │ │ ├── Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks.pdf │ │ │ │ ├── 7 单视角三维技术 │ │ │ │ │ ├── WALT3D:Generating Realistic Training Data from Time-Lapse Imagery for Reconstructing Dynamic Objects Under Occlusion.pdf │ │ │ │ │ ├── EscherNet:A Generative Model for Scalable View Synthesis.pdf │ │ │ │ │ ├── Rethinking Inductive Biases for Surface Normal Estimation.pdf │ │ │ │ ├── 10 自主导航和自我中心视觉 │ │ │ │ │ ├── SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection.pdf │ │ │ │ │ ├── EgoGen:An Egocentric Synthetic Data Generator.pdf │ │ │ │ │ ├── UnO:Unsupervised Occupancy Fields for Perception and Forecasting.pdf │ │ │ │ ├── 3 人类行为和特征 │ │ │ │ │ ├── Stratified Avatar Generation from Sparse Observations.pdf │ │ │ │ │ ├── Semantic Human Mesh Reconstruction with Textures.pdf │ │ │ │ │ ├── URHand:Universal Relightable Hands.pdf │ │ │ │ │ ├── MultiPly:Reconstruction of Multiple People from Monocular Video in the Wild.pdf │ │ │ │ │ ├── Relightable Gaussian Codec Avatars.pdf │ │ │ │ ├── 2 视觉与图形 │ │ │ │ │ ├── Eclipse:Disambiguating Illumination and Materials using Unintended Shadows.pdf │ │ │ │ │ ├── Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation.pdf │ │ │ │ │ ├── DiffusionLight:Light Probes for Free by Painting a Chrome Ball.pdf │ │ │ │ ├── 9 医学与物理视觉 │ │ │ │ │ ├── Transcriptomics-guided Slide Representation Learning in Computational Pathology.pdf │ │ │ │ ├── 17 图像与视频合成 2 │ │ │ │ │ ├── MonoHair:High-Fidelity Hair Modeling from a Monocular Video.pdf │ │ │ │ │ ├── Alchemist:Parametric Control of Material Properties with Diffusion Models.pdf │ │ │ │ │ ├── Visual Anagrams:Generating Multi-View Optical Illusions with Diffusion Models.pdf │ │ │ │ ├── 8 视觉、语言与推理 │ │ │ │ │ ├── Visual Program Distillation:Distilling Tools and Programmatic Reasoning into Vision-Language Models.pdf │ │ │ │ │ ├── LISA:Reasoning Segmentation via Large Language Model.pdf │ │ │ │ │ ├── Eyes Wide Shut Exploring the Visual Shortcomings of Multimodal LLMs.pdf │ │ │ │ ├── 12 动作和运动分析 │ │ │ │ │ ├── An N-Point Linear Solver for Line and Motion Estimation with Event Cameras.pdf │ │ │ │ │ ├── FineParser:A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality Assessment.pdf │ │ │ │ │ ├── Modeling Multimodal Social Interactions:New Challenges and Baselines with Densely Aligned Representations.pdf │ │ │ │ │ ├── RoHM:Robust Human Motion Reconstruction via Diffusio.pdf │ │ │ │ ├── 4 图像与视频合成 │ │ │ │ │ ├── Ranni:Taming Text-to-Image Diffusion for Accurate Instruction Following.pdf │ │ │ │ │ ├── Attention Calibration for Disentangled Text-to-Image Personalization.pdf │ │ │ │ │ ├── FreeU:Free Lunch in Diffusion U-Net.pdf │ │ │ │ │ ├── Instruct-Imagen: Image Generation with Multi-modal Instruction.pdf │ │ │ │ │ ├── Style Aligned Image Generation via Shared Attention.pdf │ │ │ │ ├── 13 数据集和评估 │ │ │ │ │ ├── 360+x:A Panoptic Multi-modal Scene Understanding Dataset.pdf │ │ │ │ │ ├── Deep Generative Model based Rate-Distortion for Image Downscaling Assessment.pdf │ │ │ │ │ ├── Ego-Exo4D:Understanding Skilled Human Activity from First- and Third-Person Perspectives.pdf │ │ │ ├── 4 CVPR'24 highlight论文(更新中) │ │ │ │ ├── ODIN A Single Model for 2D and 3D Segmentation.pdf │ │ │ │ ├── Enforcing Geometric and Physical Priors.pdf │ │ │ │ ├── Scaling Up Dynamic Human-Scene Interaction Modeling.pdf │ │ │ │ ├── CADTalk An Algorithm and Benchmark for Semantic Commenting of CAD Programs.pdf │ │ │ │ ├── LucidDreamer Towards High-Fidelity Text-to-3D Generation via Interval Score Matching.pdf │ │ │ │ ├── pix2gestalt Amodal Segmentation by Synthesizing Wholes.pdf │ │ │ │ ├── Semantic-aware SAM for Point-Prompted Instance Segmentation.pdf │ │ │ │ ├── Self-Supervised Dual Contouring.pdf │ │ │ │ ├── Multi-view Aggregation Network for Dichotomous Image Segmentation.pdf │ │ │ │ ├── From Correspondences to Pose Non-minimal Certifiably Optimal Relative Pose without Disambiguation.pdf │ │ │ │ ├── 3D Face Reconstruction with the Geometric Guidance of Facial Part Segmentation.pdf │ │ │ │ ├── Suppress and Rebalance Towards Generalized Multi-Modal Face Anti-Spoofing.pdf │ │ │ │ ├── GraCo Granularity-Controllable Interactive Segmentation.pdf │ │ │ │ ├── Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation.pdf │ │ │ │ ├── RAVE Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models.pdf │ │ │ │ ├── DiffusionLight Light Probes for Free by Painting a Chrome Ball.pdf │ │ │ │ ├── Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation.pdf │ │ │ │ ├── Towards Robust Event-guided Low-Light Image Enhancement A Large-Scale Real-World Event-Image Dataset and Novel Approach.pdf │ │ │ │ ├── Eclipse Disambiguating Illumination and Materials using Unintended Shadows.pdf │ │ │ │ ├── Boosting Neural Representations for Videos with a Conditional Decoder.pdf │ │ │ │ ├── Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation.pdf │ │ │ │ ├── LocLLM Exploiting Generalizable Human Keypoint Localization via Large Language Model.pdf │ │ │ │ ├── HandDiff 3D Hand Pose Estimation with Diffusion on Image-Point Cloud.pdf │ │ │ │ ├── From Activation to Initialization Scaling Insights for Optimizing Neural Fields.pdf │ │ │ │ ├── ViT-CoMer Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.pdf │ │ │ │ ├── NRDF Neural Riemannian Distance Fields for Learning Articulated Pose Priors.pdf │ │ │ │ ├── Unbiased Estimator for Distorted Conics in Camera Calibration.pdf │ │ │ │ ├── Restoration by Generation with Constrained Priors.pdf │ │ │ │ ├── From Activation to Initialization Scaling Insights for Optimizing Neural Fields.pdf │ │ │ │ ├── Time-, Memory- and Parameter-Efficient Visual Adaptation.pdf │ │ │ │ ├── FreeU Free Lunch in Diffusion U-Net.pdf │ │ │ │ ├── EAGLE Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation.pdf │ │ │ │ ├── Human Motion Prediction Under Unexpected Perturbation.pdf │ │ │ │ ├── XCube Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies.pdf │ │ │ │ ├── Relightable and Animatable Neural Avatar from Sparse-View Video.pdf │ │ │ │ ├── Bilateral Event Mining and Complementary for Event Stream Super-Resolution.pdf │ │ │ │ ├── Breathing Life Into Sketches Using Text-to-Video Priors.pdf │ │ │ │ ├── Efficient Deformable ConvNets Rethinking Dynamic and Sparse Operator for Vision Applications.pdf │ │ │ │ ├── Enhancing Video Super-Resolution via Implicit Resampling-based Alignment.pdf │ │ │ │ ├── HOLD Category-agnostic 3D Reconstruction of Interacting Hands and Objects from Vide.pdf │ │ │ │ ├── DreamPropeller Supercharge Text-to-3D Generation with Parallel Sampling.pdf │ │ │ │ ├── Ranni Taming Text-to-Image Diffusion for Accurate Instruction Following.pdf │ │ │ │ ├── Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes.pdf │ │ │ │ ├── Specularity Factorization for Low-Light Enhancement.pdf │ │ │ │ ├── Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis.pdf │ │ │ │ ├── HashPoint Accelerated Point Searching and Sampling for Neural Rendering.pdf │ │ │ │ ├── 3D Human Pose Perception from Egocentric Stereo Videos.pdf │ │ │ │ ├── Tri-Modal Motion Retrieval by Learning a Joint Embedding Space.pdf │ │ │ │ ├── Learning Structure-from-Motion with Graph Attention Networks.pdf │ │ │ │ ├── Real-Time Simulated Avatar from Head-Mounted Sensors.pdf │ │ │ │ ├── Frequency-Adaptive Dilated Convolution for Semantic Segmentation.pdf │ │ │ │ ├── Move as You Say, Interact as You Can Language-guided Human Motion Generation with Scene Affordance.pdf │ │ │ │ ├── FinePOSE Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models.pdf │ │ │ │ ├── 4D-DRESS A 4D Dataset of Real-world Human Clothing with Semantic Annotations.pdf │ │ │ │ ├── PhysGaussian Physics-Integrated 3D Gaussians for Generative Dynamics.pdf │ │ │ │ ├── GAvatar Animatable 3D Gaussian Avatars with Implicit Mesh Learning.pdf │ │ │ │ ├── Fantastic Animals and Where to Find Them Segment Any Marine Animal with Dual SAM.pdf │ │ │ │ ├── General Object Foundation Model for Images and Videos at Scale.pdf │ │ │ │ ├── FMA-Net Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring.pdf │ │ │ │ ├── Objects as volumes A stochastic geometry view of opaque solids.pdf │ │ │ │ ├── Point Transformer V3 Simpler, Faster, Stronger.pdf │ │ │ │ ├── CFPL-FAS Class Free Prompt Learning for Generalizable Face Anti-spoofing.pdf │ │ │ │ ├── Seeing the World through Your Eyes.pdf │ │ │ │ ├── Multi-agent Long-term 3D Human Pose Forecasting via Interaction-aware Trajectory Conditioning.pdf │ │ │ │ ├── Steerers A framework for rotation equivariant keypoint descriptors.pdf │ │ │ │ ├── In-Context Matting.pdf │ │ │ │ ├── In Search of a Data Transformation That Accelerates Neural Field Training.pdf │ │ │ │ ├── Matching 2D Images in 3D Metric Relative Pose from Metric Correspondences.pdf │ │ │ │ ├── Point2CAD Reverse Engineering CAD Models from 3D Point Clouds.pdf │ │ │ │ ├── Putting the Object Back into Video Object Segmentation.pdf │ │ │ │ ├── MMM Generative Masked Motion Model.pdf │ │ │ │ ├── Tri-Perspective View Decomposition for Geometry-Aware Depth Completion.pdf │ │ │ │ ├── CAT-Seg Cost Aggregation for Open-Vocabulary Semantic Segmentation.pdf │ │ │ │ ├── Neural Redshift Random Networks are not Random Functions.pdf │ │ │ │ ├── Deciphering 'What' and 'Where' Visual Pathways from Spectral Clustering of Layer-Distributed Neural Representations.pdf │ │ │ │ ├── No Time to Train Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation.pdf │ │ │ │ ├── LeGO Leveraging a Surface Deformation Network for Animatable Stylized Face Generation with One Example.pdf │ │ │ │ ├── Attention-Propagation Network for Egocentric Heatmap to 3D.pdf │ │ │ │ ├── CAD-SIGNet CAD Language Inference from Point Clouds using Layer-wise Sketch Instance Guided Attention.pdf │ │ │ ├── 2 CVPR'24 最佳论文提名(更新完毕) │ │ │ │ ├── 2 开源代码 │ │ │ │ │ ├── Marigold-main.zip │ │ │ │ │ ├── egtr-main.zip │ │ │ │ │ ├── pixelsplat-main.zip │ │ │ │ │ ├── mip-splatting-main.zip │ │ │ │ │ ├── lambda_vit-main mlp.zip │ │ │ │ │ ├── Registration-CorrMLP-master.zip │ │ │ │ │ ├── PlatoNeRF-main.zip │ │ │ │ │ ├── NVlabs-edm2-main.zip │ │ │ │ │ ├── MemSAM-main.zip │ │ │ │ │ ├── PaSCo-main.zip │ │ │ │ │ ├── MMMU-main.zip │ │ │ │ │ ├── bioclip-main.zip │ │ │ │ │ ├── MapUncertaintyPrediction-main.zip │ │ │ │ │ ├── NeRF-HuGS-master.zip │ │ │ │ │ ├── spider-match-main.zip │ │ │ │ ├── 1 提名论文 │ │ │ │ │ ├── 19 EGTR:Extracting Graph from Transformer for Scene Graph Generation.pdf │ │ │ │ │ ├── 12 Grounding and Enhancing Grid-based Models for Neural Fields.pdf │ │ │ │ │ ├── 2 Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation.pdf │ │ │ │ │ ├── 4 MMMU A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI.pdf │ │ │ │ │ ├── 14 Mip-Splatting:Alias-free 3D Gaussian Splatting.pdf │ │ │ │ │ ├── 11 BIOCLIP:A Vision Foundation Model for the Tree of Life.pdf │ │ │ │ │ ├── 15 pixelSplat. 3D Gaussian Splats from lmage Pairs for Scalable Generalizable 3D Reconstruction.pdf │ │ │ │ │ ├── 13 NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes.pdf │ │ │ │ │ ├── 1 Objects as volumes: A stochastic geometry view of opaque solids.pdf │ │ │ │ │ ├── 18 Analyzing and Improving the Training Dynamics of Diffusion Models.pdf │ │ │ │ │ ├── 8 PlatoNeRF 3D Reconstruction in Plato’s Cave via Single-View Two-Bounce Lidar.pdf │ │ │ │ │ ├── 16 MLPCanBeAGoodTransformer Learner.pdf │ │ │ │ │ ├── 5 Correlation-aware Coarse-to-fine MLPs for Deformable Medical Image Registration.pdf │ │ │ │ │ ├── 9 Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation.pdf │ │ │ │ │ ├── 6 Producing and Leveraging Online Map Uncertainty in Trajectory Prediction.pdf │ │ │ │ │ ├── 3 Comparing the Decision-Making Mechanisms by Transformers and CNNs.pdf │ │ │ │ │ ├── 10 Rich Human Feedback for Text-to-Image Generation.pdf │ │ │ │ │ ├── 17 Generative Image Dynamics.pdf │ │ │ │ │ ├── 7 PaSCo:Urban 3D Panoptic Scene Completion with Uncertainty Awareness.pdf │ │ ├── 50篇大型语言模型提示工程必读 │ │ │ ├── Prompting in Autoregressive Large Language.pdf │ │ │ ├── Exploring Visual Prompts for Adapting Large-Scale Models.pdf │ │ │ ├── Large Language Models Understand and Can Be Enhanced by Emotional Stimuli.pdf │ │ │ ├── LPML LLM-PROMPTING MARKUP LANGUAGE FOR.pdf │ │ │ ├── Chain-of-Thought Prompting Elicits Reasoning in Large Language Models.pdf │ │ │ ├── Joint Prompt Optimization of Stacked LLMs.pdf │ │ │ ├── Contrastive Chain-of-Thought Prompting.pdf │ │ │ ├── TAKE A STEP BACK- EVOKING REASONING VIA ABSTRACTION IN LARGE LANGUAGE MODELS.pdf │ │ │ ├── Reprompting Automated Chain-of-Thought Prompt.pdf │ │ │ ├── Program of Thoughts Prompting- Disentangling Computation from Reasoning for Numerical Reasoning Tasks.pdf │ │ │ ├── LARGE LANGUAGE MODELS AS TOOL MAKERS.pdf │ │ │ ├── A Systematic Survey of Prompt Engineering in Large Language Models- Techniques and Applications.pdf │ │ │ ├── Rephrase and Respond- Let Large Language Models Ask Better Questions for Themselves.pdf │ │ │ ├── CHAIN-OF-NOTE- ENHANCING ROBUSTNESS IN RETRIEVAL-AUGMENTED LANGUAGE MODELS.pdf │ │ │ ├── PROMPTBREEDER.pdf │ │ │ ├── Prompt Engineering Through the Lens of Optimal.pdf │ │ │ ├── Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf │ │ │ ├── SELF-CONSISTENCY IMPROVES CHAIN OF THOUGHT REASONING IN LANGUAGE MODELS.pdf │ │ │ ├── Prompting Is Programming A Query Language for.pdf │ │ │ ├── Chain of Code- Reasoning with a Language Model-Augmented Code Emulator.pdf │ │ │ ├── ART- Automatic multi-step reasoning and tool-use for large language models.pdf │ │ │ ├── Visual ChatGPT- Talking, Drawing and Editing with Visual Foundation Models.pdf │ │ │ ├── Structured Chain-of-Thought Prompting for Code Generation.pdf │ │ │ ├── Unleashing the potential of prompt engineering in Large Language Models- a comprehensive review.pdf │ │ │ ├── Active Prompting with Chain-of-Thought for Large Language Models.pdf │ │ │ ├── CHAIN-OF-SYMBOL PROMPTING FOR SPATIAL RELATIONSHIPS IN LARGE LANGUAGE MODELS.pdf │ │ │ ├── Language Models are Few-Shot Learners.pdf │ │ │ ├── Thread of Thought Unraveling Chaotic Contexts.pdf │ │ │ ├── Pre-train, Prompt, and Predict- A Systematic Survey of Prompting Methods in Natural Language Processing.pdf │ │ │ ├── Chain of Code Reasoning with.pdf │ │ │ ├── REAC T- SYNERGIZING REASONING AND ACTING IN LANGUAGE MODELS.pdf │ │ │ ├── CHAIN-OF-VERIFICATION REDUCES HALLUCINATION IN LARGE LANGUAGE MODELS.pdf │ │ │ ├── Large Language Model Guided Tree-of-Thought.pdf │ │ │ ├── CHAIN-OF-KNOWLEDGE- GROUNDING LARGE LANGUAGE MODELS VIA DYNAMIC KNOWLEDGE ADAPTING OVER HETEROGENEOUS SOURCES.pdf │ │ │ ├── System 2 Attention (is something you might need too).pdf │ │ │ ├── UPAR A KANTIAN-INSPIRED PROMPTING FRAME.pdf │ │ │ ├── A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.pdf │ │ │ ├── CHAIN-OF-TABLE- EVOLVING TABLES IN THE REASONING CHAIN FOR TABLE UNDERSTANDING.pdf │ │ │ ├── OlaGPT Empowering LLMs With Human-like Problem-Solving.pdf │ │ │ ├── A Systematic Survey of Prompt Engineering in Large Language Models Techniques and Applications.pdf │ │ │ ├── Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Large Language Models.pdf │ │ │ ├── Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic.pdf │ │ │ ├── Boosting Logical Reasoning in Large Language Models through a New.pdf │ │ │ ├── SHOW YOUR WORK- SCRATCHPADS FOR INTERMEDIATE COMPUTATION WITH LANGUAGE MODELS.pdf │ │ │ ├── IMPLICIT CHAIN OF THOUGHT REASONING.pdf │ │ │ ├── Tree of Thoughts- Deliberate Problem Solving with Large Language Models.pdf │ │ │ ├── A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models.pdf │ │ │ ├── LARGE LANGUAGE MODELS ARE HUMAN-LEVEL PROMPT ENGINEERS.pdf │ │ │ ├── AUTOMATIC CHAIN OF THOUGHT PROMPTING IN LARGE LANGUAGE MODELS.pdf │ │ │ ├── LARGE LANGUAGE MODELS AS OPTIMIZERS.pdf │ │ ├── ICLR 2024(更新中) │ │ │ ├── The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation.pdf │ │ │ ├── Memory Efficient Optimizers with 4-bit States.pdf │ │ │ ├── Language Is Not All You Need:Aligning Perception with Language Models.pdf │ │ │ ├── Is Your Code Generated by ChatGPT Really Correct Rigorous Evaluation of Large Language Models for Code Generation.pdf │ │ │ ├── Fine-Tuning Language Models with Just Forward Passes.pdf │ │ │ ├── Hierarchical Integration Diffusion Model for Realistic Image Deblurring.pdf │ │ │ ├── Textually Pretrained Speech Language Models.pdf │ │ │ ├── VisionLLM:Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks.pdf │ │ │ ├── Cappy:Outperforming and Boosting Large Multi-Task LMs with a Small Scorer.pdf │ │ │ ├── One-2-3-45:Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization.pdf │ │ │ ├── Direct Preference Optimization:Your Language Model is Secretly a Reward Model.pdf │ │ │ ├── SimMTM:A Simple Pre-Training Framework for Masked Time-Series Modeling.pdf │ │ │ ├── ProPILE:Probing Privacy Leakage in Large Language Models.pdf │ │ │ ├── SnapFusion:Text-to-Image Diffusion Model on Mobile Devices within Two Seconds.pdf │ │ │ ├── Efficient Diffusion Policies for Offline Reinforcement Learning.pdf │ │ │ ├── Focused Transformer:Contrastive Training for Context Scaling.pdf │ │ │ ├── LayoutPrompter:Awaken the Design Ability of Large Language Models.pdf │ │ │ ├── Segment Everything Everywhere All at Once.pdf │ │ │ ├── RAPHAEL:Text-to-Image Generation via Large Mixture of Diffusion Paths.pdf │ │ │ ├── Towards Revealing the Mystery behind Chain of Thought:a Theoretical Perspective.pdf │ │ │ ├── Elastic Decision Transformer.pdf │ │ │ ├── Training Transformers with 4-bit Integers.pdf │ │ │ ├── In-Context Impersonation Reveals Large Language Models' Strengths and Biases.pdf │ │ │ ├── DaTaSeg:Taming a Universal Multi-Dataset Multi-Task Segmentation Model.pdf │ │ │ ├── How to Turn Your Knowledge Graph Embeddings into Generative Models.pdf │ │ │ ├── EvoPrompting:Language Models for Code-Level Neural Architecture Search.pdf │ │ │ ├── Learning to Tokenize for Generative Retrieval.pdf │ │ │ ├── VanillaNet:the Power of Minimalism in Deep Learning.pdf │ │ │ ├── Unlimiformer:Long-Range Transformers with Unlimited Length Input.pdf │ │ │ ├── RRHF:Rank Responses to Align Language Models with Human Feedback without tears.pdf │ │ │ ├── Language Models Meet World Models:Embodied Experiences Enhance Language Models.pdf │ │ │ ├── Does Graph Distillation See Like Vision Dataset Counterpart.pdf │ │ │ ├── Stable and low-precision training for large-scale vision-language models.pdf │ │ │ ├── Towards Label Position Bias in Graph Neural Networks.pdf │ │ │ ├── Guiding Large Language Models via Directional Stimulus Prompting.pdf │ │ │ ├── Bridging Discrete and Backpropagation:Straight-Through and Beyond.pdf │ │ │ ├── Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer Quantization.pdf │ │ │ ├── Foundation Model is Efficient Multimodal Multitask Model Selector.pdf │ │ │ ├── Scaling Data-Constrained Language Models.pdf │ │ │ ├── Differentiable Blocks World:Qualitative 3D Decomposition by Rendering Primitives.pdf │ │ │ ├── MVDiffusion:Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion.pdf │ │ │ ├── Chameleon:Plug-and-Play Compositional Reasoning with Large Language Models.pdf │ │ │ ├── Vision-Flan:Scaling Human-Labeled Tasks in Visual Instruction Tuning.pdf │ │ │ ├── MarioGPT:Open-Ended Text2Level Generation through Large Language Models.pdf │ │ │ ├── Recommender Systems with Generative Retrieval.pdf │ │ │ ├── AlpacaFarm:A Simulation Framework for Methods that Learn from Human Feedback.pdf │ │ │ ├── Grammar Prompting for Domain-Specific Language Generation with Large Language Models.pdf │ │ │ ├── QLoRA:Efficient Finetuning of Quantized LLMs.pdf │ │ │ ├── Can Language Models Solve Graph Problems in Natural Language.pdf │ │ │ ├── DPM-Solver-v3:Improved Diffusion ODE Solver with Empirical Model Statistics.pdf │ │ │ ├── 3D-LLM:Injecting the 3D World into Large Language Models.pdf │ │ │ ├── ToolkenGPT:Augmenting Frozen Language Models with Massive Tools via Tool Embeddings.pdf │ │ │ ├── HuggingGPT:Solving AI Tasks with ChatGPT and its Friends in HuggingFace.pdf │ │ │ ├── Sample-efficient Multi-objective Molecular Optimization with GFlowNets.pdf │ │ │ ├── Tailoring Self-Attention for Graph via Rooted Subtrees.pdf │ │ │ ├── SheetCopilot:Bringing Software Productivity to the Next Level through Large Language Models.pdf │ │ │ ├── MotionGPT:Human Motion as a Foreign Language.pdf │ │ │ ├── Fine-Grained Human Feedback Gives Better Rewards for Language Model Training.pdf │ │ │ ├── Learning Large Graph Property Prediction via Graph Segment Training.pdf │ │ │ ├── White-Box Transformers via Sparse Rate Reduction.pdf │ │ │ ├── Meta In-Context Learning:Harnessing Large Language Models for Electrical Data Classification.pdf │ │ │ ├── Deductive Verification of Chain-of-Thought Reasoning.pdf │ │ │ ├── Fairness-guided Few-shot Prompting for Large Language Models.pdf │ │ │ ├── No Train No Gain:Revisiting Efficient Training Algorithms For Transformer-based Language Models.pdf │ │ │ ├── ImageReward:Learning and Evaluating Human Preferences for Text-to-Image Generation.pdf │ │ │ ├── Are aligned neural networks adversarially aligned.pdf │ │ │ ├── Convolutions Die Hard:Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP.pdf │ │ │ ├── Large Language Models of Code Fail at Completing Code with Potential Bugs.pdf │ │ │ ├── A Decomposable Causal View of Compositional Zero-Shot Learning.pdf │ │ │ ├── HyenaDNA:Long-Range Genomic Sequence Modeling at Single Nucleotide Resolution.pdf │ │ │ ├── Tree of Thoughts:Deliberate Problem Solving with Large Language Models.pdf │ │ │ ├── LIMA:Less Is More for Alignment.pdf │ │ │ ├── Improving CLIP Training with Language Rewrites.pdf │ │ │ ├── Language models are weak learners.pdf │ │ │ ├── Reverse Engineering Self-Supervised Learning.pdf │ │ │ ├── ProlificDreamer:High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation.pdf │ │ │ ├── Large Language Models as Commonsense Knowledge for Large-Scale Task Planning.pdf │ │ │ ├── AR-Diffusion:Auto-Regressive Diffusion Model for Text Generation.pdf │ │ │ ├── Reflexion:language agents with verbal reinforcement learning.pdf │ │ │ ├── Symbolic Discovery of Optimization Algorithms.pdf │ │ │ ├── Language Models Don't Always Say What They Think:Unfaithful Explanations in Chain-of-Thought Prompting.pdf │ │ │ ├── InstructBLIP:Towards General-purpose Vision-Language Models with Instruction Tuning.pdf │ │ │ ├── Cheap and Quick:Efficient Vision-Language Instruction Tuning for Large Language Models.pdf │ │ │ ├── Inference-Time Intervention:Eliciting Truthful Answers from a Language Model.pdf │ │ │ ├── DoReMi:Optimizing Data Mixtures Speeds Up Language Model Pretraining.pdf │ │ │ ├── Toolformer:Language Models Can Teach Themselves to Use Tools.pdf │ │ │ ├── Transformers learn through gradual rank increase.pdf │ │ │ ├── Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning.pdf │ │ │ ├── GPT4Tools:Teaching Large Language Model to Use Tools via Self-instruction.pdf │ │ │ ├── STEVE-1:A Generative Model for Text-to-Behavior in Minecraft.pdf │ │ │ ├── Self-Refine:Iterative Refinement with Self-Feedback.pdf │ │ │ ├── Are Emergent Abilities of Large Language Models a Mirage.pdf │ │ │ ├── Augmenting Language Models with Long-Term Memory.pdf │ │ │ ├── UniControl:A Unified Diffusion Model for Controllable Visual Generation In the Wild.pdf │ │ │ ├── DiffComplete:Diffusion-based Generative 3D Shape Completion.pdf │ │ │ ├── Any-to-Any Generation via Composable Diffusion.pdf │ │ │ ├── SANeRF-HQ:Segment Anything for NeRF in High Quality.pdf │ │ │ ├── Voicebox:Text-Guided Multilingual Universal Speech Generation at Scale.pdf │ │ │ ├── MEGABYTE:Predicting Million-byte Sequences with Multiscale Transformers.pdf │ │ │ ├── VisorGPT:Learning Visual Prior via Generative Pre-Training.pdf │ │ │ ├── Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition.pdf │ │ │ ├── Simple and Controllable Music Generation.pdf │ │ │ ├── Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models.pdf │ │ │ ├── Flocks of Stochastic Parrots:Differentially Private Prompt Learning for Large Language Models.pdf │ │ │ ├── SwiftSage:A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks.pdf │ │ │ ├── EmbodiedGPT:Vision-Language Pre-Training via Embodied Chain of Thought.pdf │ │ ├── 20篇llm必读 │ │ │ ├── AWQ Activation-aware Weight Quantization.pdf │ │ │ ├── The Internal State of an LLM Knows When It’s Lying.pdf │ │ │ ├── OpenAGI When LLM Meets Domain Experts.pdf │ │ │ ├── X-LLM.pdf │ │ │ ├── Wider and Deeper LLM Networks.pdf │ │ │ ├── Judging LLM-as-a-Judge.pdf │ │ │ ├── Jailbroken How Does LLM Safety Training Fail.pdf │ │ │ ├── Can LLM Already Serve as A Database Interface.pdf │ │ │ ├── LLM-grounded Diffusion Enhancing Prompt Understanding of.pdf │ │ │ ├── Why Johnny Can’t Prompt.pdf │ │ │ ├── NExT-GPT Any-to-Any Multimodal LLM.pdf │ │ │ ├── Large Language Models are Few-shot Testers.pdf │ │ │ ├── AutoGen Enabling Next-Gen LLM.pdf │ │ │ ├── Song_LLM-Planner_Few-Shot_Grounded_Planning_for_Embodied_Agents_with_Large_Language_ICCV_2023_paper.pdf │ │ │ ├── CHATEVAL TOWARDS BETTER LLM-BASED EVALUATORS THROUGH MULTI-AGENT DEBATE.pdf │ │ │ ├── Large language models (LLM) and ChatGPT what will the impact.pdf │ │ │ ├── LLM-Pruner On the Structural Pruning.pdf │ │ │ ├── The RefinedWeb Dataset for Falcon LLM.pdf │ │ │ ├── LLM-BL E N D E R Ensembling Large Language Models.pdf │ │ │ ├── LLM-Adapters An Adapter Family for Parameter-Efficient Fine-Tuning of.pdf │ │ ├── ICLR 2024 │ │ │ ├── 【时间检验奖】Auto-Encoding Variational Bayes.pdf │ │ ├── AAAI 2024 111篇 │ │ │ ├── Parallel Ranking of Ads and Creative Services for Real-time.pdf │ │ │ ├── AT4CTR Auxiliary Match Tasks for Enhancing Click-Through Rate Prediction.pdf │ │ │ ├── Upper Bounding Barlow Twins:A Novel Filter for Multi-relational.pdf │ │ │ ├── Non-Excludable Bilateral Trade Between Groups.pdf │ │ │ ├── Identification of Causal Structure in the Presence of Missing Data with Additive.pdf │ │ │ ├── Few-shot Part Segmentation Reveals Compositional Logic for Industrial.pdf │ │ │ ├── Learning Human-like Representations to Enable Learning Human Values.pdf │ │ │ ├── OVD-Explorer:Optimism Should Not Be the Sole Pursuit of Exploration.pdf │ │ │ ├── Federated Learning with Extremely Noisy Clients via Negative Distillation.pdf │ │ │ ├── EarthVQA:Towards Queryable Earth via Relational Reasoning-Based Remote.pdf │ │ │ ├── MDGNN:Multi-Relational Dynamic Graph Neural Network for Comprehensive and Dynamic Stock Investment Prediction.pdf │ │ │ ├── Towards Fairness in Online Service with k Servers and its Application.pdf │ │ │ ├── Unified framework for diffusion generative models in SO(3).pdf │ │ │ ├── Text2Analysis:A Benchmark of Table Question Answering with Advanced.pdf │ │ │ ├── Spectral-based Graph Neutral Networks for Complementary Item.pdf │ │ │ ├── ECHO-GL Earnings Calls-Driven Heterogeneous Graph Learning for Stock.pdf │ │ │ ├── Point Cloud Part Editing:Segmentation, Generation, Assembly, and.pdf │ │ │ ├── IS-DARTS:Stabilizing DARTS through Precise Measurement.pdf │ │ │ ├── Robust Active Measuring under Model Uncertainty.pdf │ │ │ ├── MASTER:Market-Guided Stock Transformer for Stock Price Forecasting.pdf │ │ │ ├── Provably Convergent Federated Trilevel Learning.pdf │ │ │ ├── Exploring Gradient Explosion in Generative Adversarial Imitation.pdf │ │ │ ├── AE-NeRF:Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis.pdf │ │ │ ├── Learning Fair Policies for Multi-stage Problem Solving from.pdf │ │ │ ├── AI-Based Energy Transportation Safety:Pipeline Radial Threat.pdf │ │ │ ├── EFFECT SIZE ESTIMATION FOR DURATION RECOMMENDATION.pdf │ │ │ ├── When Model Meets New Normals:Test-Time Adaptation for Unsupervised.pdf │ │ │ ├── Fluctuation-based Adaptive Structured Pruning for Large Language.pdf │ │ │ ├── ContraNovo:A Contrastive Learning Approach to Enhance De Novo Peptide.pdf │ │ │ ├── CR-SAM: Curvature Regularized Sharpness-aware Minimization.pdf │ │ │ ├── HuTuMotion:Human-Tuned Motion of Latent Motion Diffusions with.pdf │ │ │ ├── Enhancing Job Recommendation through.pdf │ │ │ ├── H-ensemble: An Information Theoretic Approach to Reliable Few-Shot.pdf │ │ │ ├── Temporally and Distributionally Robust Optimization for Cold-start.pdf │ │ │ ├── Structure-CLIP:Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations.pdf │ │ │ ├── Probabilistic Offline Policy Ranking with Approximate Bayesian.pdf │ │ │ ├── Foreseeing Reconstruction Quality of Gradient Inversion.pdf │ │ │ ├── Successive POI Recommendation via Brain-inspired Spatiotemporal Aware Representation.pdf │ │ │ ├── No More Shortcuts:Realizing the Potential of Temporal Self-Supervision.pdf │ │ │ ├── PPEA-Depth:Progressive Parameter-efficient Adaptation for.pdf │ │ │ ├── FedDiv:Collaborative Noise Filtering for Federated Learning with Noisy Labels.pdf │ │ │ ├── Cached Transformers:Improving Transformers with Differentiable Memory.pdf │ │ │ ├── Market-GAN Adding Control to Financial Market Data Generation with.pdf │ │ │ ├── CORECODE: A Common Sense Annotated Dialogue Dataset with Benchmark.pdf │ │ │ ├── Uncertainty Quantification for Data-Driven Change-Point Learning via.pdf │ │ │ ├── Regulating Intermediate 3D Features for Vision-Centric Autonomous.pdf │ │ │ ├── Imitation of Life:A Search Engine for Biologically Inspired Design.pdf │ │ │ ├── Blind-Touch:Homomorphic Encryption-Based Distributed Neural Network.pdf │ │ │ ├── Domain Invariant Learning for Gaussian Processes and Bayesian.pdf │ │ │ ├── Effectiveness of Constant Stepsize in Markovian LSA and Statistical.pdf │ │ │ ├── On Partial Optimal Transport:Revising the Infeasibility of Sinkhorn.pdf │ │ │ ├── Peer Learning Learning Complex Policies in Groups from Scratch via Action.pdf │ │ │ ├── Identification of Causal Structure with Latent Variables Based on Higher Order Cumulants.pdf │ │ │ ├── MmAP:Multi-modal Alignment Prompt for Cross-domain Multi-task Learning.pdf │ │ │ ├── DataElixir:Purifying Poisoned Dataset to Mitigate Backdoor Attacks.pdf │ │ │ ├── Estimation of individual causal effects in network setup for multiple.pdf │ │ │ ├── VITA:Carefully Chosen and Weighted Less Is Better in Medication.pdf │ │ │ ├── SeGA:Preference-Aware Self-Contrasting Learning with Prompts for.pdf │ │ │ ├── Scalable Geometric Fracture Assembly via Co-creation Space among Assemblers.pdf │ │ │ ├── Fine-Grained Knowledge Selection and Restoration for Non-exemplar.pdf │ │ │ ├── Augmented Negative Sampling for Collaborative Filtering.pdf │ │ │ ├── Chasing Fairness in Graphs: A GNN Architecture Perspective.pdf │ │ │ ├── LGMRec Local and Global Graph Learning for Multimodal Recommendation.pdf │ │ │ ├── Fine-tuning Graph Neural Networks by Preserving Graph Generative.pdf │ │ │ ├── Hierarchical and Incremental Structural Entropy Minimization for Unsupervised Social Event Detection.pdf │ │ │ ├── Coreference Graph Guidance for Mind-Map Generation.pdf │ │ │ ├── Doubly Perturbed Task Free Continual Learning.pdf │ │ │ ├── Explaining Reinforcement Learning Agents Through Counterfactual Action Outcomes.pdf │ │ │ ├── Progressive Poisoned Data Isolation for Training-time Backdoor Attack.pdf │ │ │ ├── COOPER: Coordinating Specialized Agents towards a Complex Dialogue Goal.pdf │ │ │ ├── BadRL:Sparse Targeted Backdoor Attack Against Reinforcement Learning.pdf │ │ │ ├── Learning Multimodal Volumetric Features for Large-Scale Neuron Tracing.pdf │ │ │ ├── Curriculum Learning and Imitation Learning for Model-free Control on Financial Time-series.pdf │ │ │ ├── An Attentive Inductive Bias for Sequential Recommendation.pdf │ │ │ ├── Entropic Open-set Active Learning.pdf │ │ │ ├── EarnHFT Efficient Hierarchical Reinforcement Learning for High Frequency Trading.pdf │ │ │ ├── Distributional Off-Policy Evaluation for Slate Recommendations.pdf │ │ │ ├── Robust Loss Functions for Training Decision Trees with Noisy Labels.pdf │ │ │ ├── VITA ‘Carefully Chosen and Weighted Less’ Is Better.pdf │ │ │ ├── Big Learning Expectation Maximization.pdf │ │ │ ├── Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling.pdf │ │ │ ├── Competition among Pairwise Lottery Contests.pdf │ │ │ ├── Envy-free House Allocation under Uncertainty Preferences.pdf │ │ │ ├── Learning Domain-Independent Heuristics for Grounded and Lifted Planning.pdf │ │ │ ├── RadOcc:Learning Cross-Modality Occupancy Knowledge through Rendering.pdf │ │ │ ├── Root Cause Explanation of Outliers under Noisy Mechanisms.pdf │ │ │ ├── Exploring Large Language Model for Graph Data Understanding.pdf │ │ │ ├── Q-SENN: Quantized Self-explaining Neural Networks.pdf │ │ │ ├── Knowledge Graph Error Detection with Contrastive Confidence Adaption.pdf │ │ │ ├── Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition.pdf │ │ │ ├── STEM Unleashing the Power of Embeddings for Multi-task Recommendation.pdf │ │ │ ├── Protect Your Score: Contact Tracing with Differential Privacy.pdf │ │ │ ├── Inducing Point Operator Transformer:A Flexible and Scalable Architecture for Solving PDEs.pdf │ │ │ ├── Weakly Supervised Open-Vocabulary Object Detection.pdf │ │ │ ├── Cautiously-Optimistic Knowledge Sharing for Cooperative Multi-Agent Reinforcement Learning.pdf │ │ │ ├── Ada-Ranker A Data Distribution Adaptive Ranking Paradigm.pdf │ │ │ ├── Topic Shifts as a Proxy for Assessing Politicization in Social Media.pdf │ │ │ ├── No prejudice! Fair Federated Graph Neural Networks for Personalized.pdf │ │ │ ├── Fortify Your Defenses:Strategic Allocation to Enhance Defense Grid.pdf │ │ │ ├── MESED: A Multi-modal Entity Set Expansion Dataset with Fine-grained Semantic Classes and Hard Negative Entities.pdf │ │ │ ├── CI-STHPAN Pre-trained Attention Network for Stock Selection with Channel-Independent Spatio-Temporal Hypergraph.pdf │ │ │ ├── Towards Efficient Verification of Quantized Neural Networks.pdf │ │ │ ├── On the Role of Server Momentum in Federated Learning.pdf │ │ │ ├── Roll With the Punches:Expansion and Shrinkage of Soft Label Selection.pdf │ │ │ ├── Bi-directional Adapter for Multi-modal Tracking.pdf │ │ │ ├── FontDiffuser: One-Shot Font Generation via Denoising Diffusion with.pdf │ │ │ ├── Signed Graph Neural Ordinary Differential Equation for Modeling.pdf │ │ │ ├── Continuous Time Graph Representation with Sequential Survival Process.pdf │ │ │ ├── FontDiffuser:One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning.pdf │ │ │ ├── Brush Your Text:Synthesize Any Scene Text on Images via Diffusion Model.pdf │ │ │ ├── LAMM:Label Alignment for Multi-Modal Prompt Learning.pdf │ │ ├── 大模型MoE必读论文 │ │ │ ├── 【直播课原文】Pushing Mixture of Experts to the Limit Extremely Parameter Efficient MoE for Instruction Tuning.pdf │ │ ├── LISA:大模型微调40篇 │ │ │ ├── 在大型视觉语言模型中评估物体幻觉.pdf │ │ │ ├── MiniGPT-v2:大型语言模型作为视觉语言多任务学习的统一接口.pdf │ │ │ ├── SPHINX:多模态大型语言模型的权重、任务和视觉嵌入的联合混合.pdf │ │ │ ├── 利用显式推理链和可视化问题生成推进大型多模态模型.pdf │ │ │ ├── 睁大眼睛?探索多模态LLMs的视觉缺陷.pdf │ │ │ ├── LLaMA-VID:在大型语言模型中,一个图像值 2 个令牌.pdf │ │ │ ├── LST:用于参数和内存高效迁移学习的梯形图侧调.pdf │ │ │ ├── VL-PET:通过粒度控制进行视觉和语言参数高效调整.pdf │ │ │ ├── mPLUG-Owl2:通过模态协作彻底改变多模态大型语言模型.pdf │ │ │ ├── CaMML:适用于大型模型的情境感知多模态学习器.pdf │ │ │ ├── Ziya-Visual:通过多任务指令调优的双语大型视觉语言模型.pdf │ │ │ ├── Qwen-VL:用于理解、定位、文本阅读等的多功能视觉语言模型.pdf │ │ │ ├── Lyrics-通过语义感知视觉对象促进细粒度语言-视觉对齐和理解.pdf │ │ │ ├── MMBench:你的多模态模型是一个全能的玩家吗?.pdf │ │ │ ├── OtterHD:高分辨率多模态模型.pdf │ │ │ ├── 通过视觉指令调整改进基线.pdf │ │ │ ├── 可视化指令调优.pdf │ │ │ ├── 对比视觉-语言对齐使教学成为学习者的高效.pdf │ │ │ ├── MiniGPT-4:使用高级大型语言模型增强视觉语言理解.pdf │ │ │ ├── SVIT:扩展可视化指令调优.pdf │ │ │ ├── InfMLLM:可视化语言任务的统一框架.pdf │ │ │ ├── ReForm-Eval:通过统一重新制定面向任务的基准来评估大型视觉语言模型.pdf │ │ │ ├── InstructBLIP:通过指令调整实现通用视觉语言模型.pdf │ │ │ ├── Compacter:高效的低秩超复杂适配器层.pdf │ │ │ ├── Shikra:释放多模态LLM的参照对话魔力.pdf │ │ │ ├── Genixer:将多模态大型语言模型赋能为强大的数据生成器提供支持.pdf │ │ │ ├── 眼见为实:提示 GPT-4V 进行更好的视觉指令调整.pdf │ │ │ ├── SEED-Bench:对多模态LLMs进行生成式理解的基准测试.pdf │ │ │ ├── UniPT:具有高效参数和存储器的迁移学习通用并行调优.pdf │ │ │ ├── LISA: Layerwise Importance Sampling for Memory-efficient Large Language Model Fine-Tuning.pdf │ │ │ ├── GlitchBench:大型多模态模型可以检测视频游戏故障吗?.pdf │ │ │ ├── Video-LLaVA:通过投影前的对齐来学习统一的视觉表示.pdf │ │ │ ├── 视觉语言预训练模型的近似提示调整.pdf │ │ │ ├── VL-ADAPTER:用于视觉和语言任务的参数高效迁移学习.pdf │ │ │ ├── ShareGPT4V:使用更好的字幕改进大型多模态模型.pdf │ │ │ ├── 关于多模态语言模型的性能.pdf │ │ │ ├── Visual Instruction Tuning with Polite Flamingo.pdf │ │ │ ├── MM-Vet:评估大型多模态模型的集成能力.pdf │ │ │ ├── HyperPELT:针对语言和视觉与语言任务的统一参数高效语言模型调优.pdf │ │ │ ├── DoRA- Weight-Decomposed Low-Rank Adaptation.pdf │ │ ├── ECCV24 收录论文83篇(更新中) │ │ │ ├── 推荐工作 │ │ │ │ ├── FontStudio Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation.pdf │ │ │ │ ├── LEGO Learning EGOcentric Action FrameGeneration via Visual Instruction Tuning.pdf │ │ │ │ ├── FSGS Real Time Few shot View Synthesis using Gaussian Splatting.pdf │ │ │ │ ├── Glyph-ByT5 A Customized Text Encoder for Accurate Visual Text Rendering.pdf │ │ │ │ ├── ZipLoRA Any Subject in Any Style by Effectively Merging LoRAs..pdf │ │ │ │ ├── DreamScene360 Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting.pdf │ │ │ │ ├── SwapAnything Enabling Arbitrary Object Swapping in Personalized Visual Editing.pdf │ │ │ │ ├── DiffiT Diffusion Vision Transformers for Image Generation.pdf │ │ │ ├── Contrastive Region Guidance:Improving Grounding in Vision-Language Models without Training.pdf │ │ │ ├── MIPI 2024 Challenge on Demosaic for Hybridevs Camera: Methods and Results.pdf │ │ │ ├── BLINK:Multimodal Large Language Models Can See but Not Perceive.pdf │ │ │ ├── CityGaussian:Real-time High-quality Large-Scale Scene Rendering with Gaussians.pdf │ │ │ ├── Align, Minimize and Diversify A Source-Free Unsupervised Domain Adaptation Method for Handwritten Text Recognition.pdf │ │ │ ├── DATENeRF:Depth-Aware Text-based Editing of NeRFs.pdf │ │ │ ├── Dyadic Interaction Modeling for Social Behavior Generation.pdf │ │ │ ├── DragAnything:Motion Control for Anything.pdf │ │ │ ├── GiT:Towards Generalist Vision Transformer through Universal Language Interface.pdf │ │ │ ├── SuperGaussian:Repurposing Video Models for 3D Super Resolution.pdf │ │ │ ├── EvAC3D From Event-based Apparent Contours to 3D Models via Continuous Visual Hulls.pdf │ │ │ ├── GScream:Learning 3D Geometry and Feature Consistent Gaussian Splatting for Object Removal.pdf │ │ │ ├── N2F2:Hierarchical Scene Understanding with Nested Neural Feature Fields.pdf │ │ │ ├── Object-Centric Diffusion for Efficient Video Editing.pdf │ │ │ ├── SALVe: Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas.pdf │ │ │ ├── Listen to Look into the Future:Audio-Visual Egocentric Gaze Anticipation.pdf │ │ │ ├── MixDQ:Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization.pdf │ │ │ ├── DreamMotion:Space-Time Self-Similarity Score Distillation for Zero-Shot Video Editing.pdf │ │ │ ├── PEAVS:Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers' Opinion Scores.pdf │ │ │ ├── FreeInit:Bridging Initialization Gap in Video Diffusion Models.pdf │ │ │ ├── SpecFormer Guarding Vision Transformer Robustness via Maximum Singular Value Penalization.pdf │ │ │ ├── Empowering 3D Visual Grounding with Reasoning Capabilities.pdf │ │ │ ├── Introducing HOT3D:An Egocentric Dataset for 3D Hand and Object Tracking.pdf │ │ │ ├── Rasterized Edge Gradients:Handling Discontinuities Differentiably.pdf │ │ │ ├── A Task is Worth One Word:Learning with Task Prompts for High-Quality Versatile Image Inpainting.pdf │ │ │ ├── An Image is Worth 1`2 Tokens After Layer 2:Plug and Play Inference Acceleration for Large Vision Language Models.pdf │ │ │ ├── Neural Graphics Texture Compression Supporting Random Access.pdf │ │ │ ├── LA3 Efficient Label-Aware AutoAugment.pdf │ │ │ ├── Towards Multimodal Open-Set Domain Generalization and Adaptation through Self-supervision.pdf │ │ │ ├── Learning Neural Volumetric Pose Features for Camera Localization.pdf │ │ │ ├── UniDream:UnifyingDiffusionPriorsforRelightableText-to-3DGeneration.pdf │ │ │ ├── Prompt Federated Learning for Weather Forecasting:Toward Foundation Models on Meteorological Data.pdf │ │ │ ├── Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance.pdf │ │ │ ├── DGInStyle:Domain Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control.pdf │ │ │ ├── Robo-ABC:Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipulation.pdf │ │ │ ├── Agent3D-Zero: An automatic agent leverages VLM for zero-shot 3D understanding.pdf │ │ │ ├── Compact3D:Smaller and Faster Gaussian Splatting with Vector Quantization.pdf │ │ │ ├── Pix2Gif:Motion-Guided Diffusion for GIF Generation.pdf │ │ │ ├── TriNeRFLet:A Wavelet Based Multiscale Triplane NeRF Representation.pdf │ │ │ ├── ClusteringSDF:Self-Organized Neural Implicit Surfaces for 3D Decomposition.pdf │ │ │ ├── Map-free Visual Relocalization:Metric Pose Relative to a Single Image.pdf │ │ │ ├── T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy.pdf │ │ │ ├── MVSplat:Efficient 3D Gaussian Splatting from Sparse Multi-View Images.pdf │ │ │ ├── Training Full Spike Neural Networks via Auxiliary Accumulation Pathway.pdf │ │ │ ├── ScanTalk:3D Talking Heads from Unregistered Scans.pdf │ │ │ ├── VITATECS:A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models.pdf │ │ │ ├── DenseNets Reloaded:Paradigm Shift Beyond ResNets and ViTs.pdf │ │ │ ├── HYPE:Hyperbolic Entailment Filtering for Underspecified Images and Texts.pdf │ │ │ ├── Open-Vocabulary SAM:Segment and Recognize Twenty-thousand Classes Interactively.pdf │ │ │ ├── Controllable Human-Object Interaction Synthesis.pdf │ │ │ ├── DragAPart:Learning a Part-Level Motion Prior for Articulated Objects.pdf │ │ │ ├── DragVideo:Interactive Drag-style Video Editing.pdf │ │ │ ├── GalLoP:Learning Global and Local Prompts.pdf │ │ │ ├── GLAD: Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection.pdf │ │ │ ├── CoLLaVO:Crayon Large Language and Vision mOdel.pdf │ │ │ ├── WordRobe:Text-Guided Generation of Textured 3D Garments.pdf │ │ │ ├── AdaDistill:Adaptive Knowledge Distillation for Deep Face Recognition.pdf │ │ │ ├── AnyLens:A Generative Diffusion Model with Any Rendering Lens.pdf │ │ │ ├── PointLLM:Empowering Large Language Models to Understand Point Clouds.pdf │ │ │ ├── E.T. the Exceptional Trajectories:Text-to-camera-trajectory generation with character awareness.pdf │ │ │ ├── DreamReward:Text-to-3D Generation with Human Preference.pdf │ │ │ ├── Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning.pdf │ │ │ ├── Mismatch Quest:Visual and Textual Feedback for Image-Text Misalignment.pdf │ │ │ ├── Pyramid Diffusion for Fine 3D Large Scene Generation.pdf │ │ │ ├── MoAI:Mixture of All Intelligence for Large Language and Vision Models.pdf │ │ │ ├── NIGHT - Non-Line-of-Sight Imaging from Indirect Time of Flight Data.pdf │ │ │ ├── Modality Translation for Object Detection Adaptation Without Forgetting Prior Knowledge.pdf │ │ │ ├── Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation.pdf │ │ │ ├── PaPr Training-Free One-Step Patch Pruning with Lightweight ConvNets for Faster Inference.pdf │ │ │ ├── ZeST:Zero-Shot Material Transfer from a Single Image.pdf │ │ │ ├── GVGEN:A text-to-GS generation framework with volumetric representation.pdf │ │ │ ├── MotionLCM:Real-time Controllable Motion Generation via Latent Consistency Model.pdf │ │ │ ├── ManiGaussian:Dynamic Gaussian Splatting for Multi-task Robotic Manipulation.pdf │ │ │ ├── MOTIONDIRECTOR:MOTION CUSTOMIZATION OF TEXT-TO-VIDEO DIFFUSION MODELS.pdf │ │ ├── Code Llama论文(5月最新+内含24篇) │ │ │ ├── 2 LLaMA 1、2 论文&源码 │ │ │ │ ├── 源码:llama-main.zip │ │ │ │ ├── LLaMA: Open and Efficient Foundation Language Models.pdf │ │ │ │ ├── Llama 2:Open Foundation and Fine-Tuned Chat Models.pdf │ │ │ ├── 3 Code Llama 其他相关论文 │ │ │ │ ├── TinyLlama:An Open-Source Small Language Model.pdf │ │ │ │ ├── S3LLM: Large-Scale Scientific Software Understanding.pdf │ │ │ │ ├── IS SELF-REPAIR A SILVER BULLET FOR CODE GENERATION.pdf │ │ │ │ ├── MFTCODER: BOOSTING CODE LLMS WITH MULTITASK.pdf │ │ │ │ ├── README++:Benchmarking Multilingual Language Models for Multi-Domain Readability Assessment.pdf │ │ │ │ ├── Open-TransMind:A New Baseline and Benchmark for 1st Foundation Model.pdf │ │ │ │ ├── Binary Code Summarization:Benchmarking ChatGPT、GPT-4 and Other Large Language Models.pdf │ │ │ │ ├── LLAMA PRO:Progressive LLaMA with Block Expansion.pdf │ │ │ │ ├── LLaMA-LoRA Neural Prompt Engineering.pdf │ │ │ │ ├── Open-SQL Framework: Enhancing Text-to-SQL on Open-source Large.pdf │ │ │ │ ├── A Comparative Analysis of Large Language Models for Code.pdf │ │ │ │ ├── A Controlled Experiment on the Energy Efficiency of the Source Code Generated by Code Llama.pdf │ │ │ │ ├── CRUXEval:A Benchmark for Code Reasoning.pdf │ │ │ │ ├── LLaMA-Reviewer:Advancing Code Review Automation with Large Language Models through Parameter-Efficient Fine-Tuning.pdf │ │ │ │ ├── LLaMA-Adapter:Efficient Fine-tuning of Language.pdf │ │ │ │ ├── LLaMA-Adapter V2:Parameter-Efficient Visual Instruction Model.pdf │ │ │ │ ├── Semantic Similarity Loss for Neural Source Code.pdf │ │ │ │ ├── Granite Code Models:A Family of Open.pdf │ │ │ │ ├── Evaluating In-Context Learning of Libraries for Code Generation.pdf │ │ │ │ ├── Making Large Language Models A Better Foundation For Dense Retrieval.pdf │ │ │ │ ├── DebugBench:Evaluating Debugging Capability of Large Language Models.pdf │ │ │ ├── 1 Code Llama 论文&源码 │ │ │ │ ├── 源码:codellama-main.zip │ │ │ │ ├── 论文:Code Llama:Open Foundation Models for Code.pdf │ │ ├── ICML 2024 67篇 │ │ │ ├── ICML'23 │ │ │ │ ├── 看不见的概括,逻辑推理和学位课程.pdf │ │ │ │ ├── 适应零和不完全信息博弈中的博弈树.pdf │ │ │ │ ├── 大型语言模型的水印.pdf │ │ │ │ ├── 像素递归神经网络.pdf │ │ │ │ ├── 混淆梯度给人一种虚假的安全感:规避对抗性示例的防御.pdf │ │ │ │ ├── D-Adaptation 的无学习率学习.pdf │ │ │ │ ├── 异质性治疗效果的因果等渗校准.pdf │ │ │ │ ├── 通过影响函数理解黑盒预测.pdf │ │ │ │ ├── Beyond Hawkes:时空点过程的神经多事件预测.pdf │ │ │ │ ├── 用于统一通用逼近的 Leaky-ReLU 神经网络的最小宽度.pdf │ │ │ │ ├── 通过噪声到噪声映射从噪声 3D 点云中学习有符号距离函数.pdf │ │ │ │ ├── 用于子集选择的可解释行列式选择模型.pdf │ │ │ │ ├── 正交解耦高斯过程的球形诱导特征.pdf │ │ │ ├── ICML'24 最佳论文+时间检验奖 │ │ │ │ ├── Scaling Rectified Flow Transformers for High-Resolution Image Synthesis.pdf │ │ │ │ ├── Debating with More Persuasive LLMs Leads to More Truthful Answers.pdf │ │ │ │ ├── Information Complexity of Stochastic Convex OptimizationP:Applications to Generalization, Memorization, and Tracing.pdf │ │ │ │ ├── Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo.pdf │ │ │ │ ├── Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution.pdf │ │ │ │ ├── DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition.pdf │ │ │ │ ├── VideoPoet:A Large Language Model for Zero-Shot Video Generation.pdf │ │ │ │ ├── Stealing part of a production language model.pdf │ │ │ │ ├── Genie:Generative Interactive Environments.pdf │ │ │ │ ├── Considerations for Differentially Private Learning with Large-Scale Public Pretraining.pdf │ │ │ │ ├── Position:Measure Dataset Diversity, Don't Just Claim It.pdf │ │ │ ├── ICML'24 oral(更新中) │ │ │ │ ├── Transformers Learn Nonlinear Features In Context:Nonconvex Mean-field Dynamics on the Attention Landscape.pdf │ │ │ │ ├── HowPrivate are DP-SGD Implementations.pdf │ │ │ │ ├── Monitoring AI-Modified Content at Scale:A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews.pdf │ │ │ │ ├── Hybrid2 Neural ODE Causal Modeling and an Application to Glycemic Response.pdf │ │ │ │ ├── GaLore:Memory-Efficient LLM Training by Gradient Low-Rank Projection.pdf │ │ │ │ ├── PrE-Text:Training Language Models on Private Federated Data in the Age of LLMs.pdf │ │ │ │ ├── FedMBridge:Bridgeable Multimodal Federated Learning.pdf │ │ │ │ ├── Position:Open-Endedness is Essential for Artificial Superhuman Intelligence.pdf │ │ │ │ ├── Less is More:on the Over-Globalizing Problem in Graph Transformers.pdf │ │ │ │ ├── Evolution of Heuristics:Towards Efficient Automatic Algorithm Design Using Large Language Model.pdf │ │ │ │ ├── Expressivity and Generalization:Fragment-Biases for Molecular GNNs.pdf │ │ │ │ ├── Locality-Sensitive Hashing-Based Efficient Point Transformer with Applications in High-Energy Physics.pdf │ │ │ │ ├── Stop Regressing:Training Value Functions via Classification for Scalable Deep RL.pdf │ │ │ │ ├── Emergent Equivariance in Deep Ensembles.pdf │ │ │ │ ├── Improving Transformers with Dynamically Composable Multi-Head Attention.pdf │ │ │ │ ├── Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling.pdf │ │ │ │ ├── SAPG:Split and Aggregate Policy Gradients.pdf │ │ │ │ ├── Position:Automatic Environment Shaping is the Next Frontier in RL.pdf │ │ │ │ ├── Multiplicative Weights Update, Area Convexity and Random Coordinate Descent for Densest Subgraph Problems.pdf │ │ │ │ ├── Weak-to-Strong Generalization:Eliciting Strong Capabilities With Weak Supervision.pdf │ │ │ │ ├── Discovering Environments with XRM.pdf │ │ │ │ ├── Unified Training of Universal Time Series Forecasting Transformers.pdf │ │ │ │ ├── A Dynamic Algorithm for Weighted Submodular Cover Problem.pdf │ │ │ │ ├── Theoretical Analysis of Learned Database Operations under Distribution Shift through Distribution Learnability.pdf │ │ │ │ ├── SceneCraft:An LLM Agent for Synthesizing 3D Scenes as Blender Code.pdf │ │ │ │ ├── Doubly Robust Causal Effect Estimation under Networked Interference via Targeted Learning.pdf │ │ │ │ ├── Robust CLIP:Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models.pdf │ │ │ │ ├── Evaluation of LLMs on Syntax-Aware Code Fill-in-the-Middle Tasks.pdf │ │ │ │ ├── Position:Technical Research and Talent is Needed for Effective AI Governance.pdf │ │ │ │ ├── Position:Opportunities Exist for Machine Learning in Magnetic Fusion Energy.pdf │ │ │ │ ├── Online Matching with Stochastic Rewards:Provable Better Bound via Adversarial Reinforcement Learning.pdf │ │ │ │ ├── How do Large Language Models Navigate Conflicts between Honesty and Helpfulness.pdf │ │ │ │ ├── Is DPO Superior to PPO for LLM Alignment A Comprehensive Study.pdf │ │ │ │ ├── Trained Random Forests Completely Reveal your Dataset.pdf │ │ │ │ ├── Rethinking Data Shapley for Data Selection Tasks:Misleads and Merits.pdf │ │ │ │ ├── Inferring the Long-Term Causal Effects of Long-Term Treatments from Short-Term Experiments.pdf │ │ │ │ ├── Fast Co-Training under Weak Dependence via Stream-Based Active Learning.pdf │ │ │ │ ├── Learning Useful Representations of Recurrent Neural Network Weight Matrices.pdf │ │ │ │ ├── Bottleneck-Minimal Indexing for Generative Document Retrieval.pdf │ │ │ │ ├── I.O Complexity of Attention or How Optimal is FlashAttention.pdf │ │ │ │ ├── ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization.pdf │ │ │ │ ├── Position:Beyond Personhood:Agency, Accountability, and the Limits of Anthropomorphic Ethical Analysis.pdf │ │ │ │ ├── LoRA Training in the NTK Regime has No Spurious Local Minima.pdf │ │ ├── 100篇大模型必读论文 │ │ │ ├── Solving Quantitative Reasoning Problems with Language Models.pdf │ │ │ ├── A ConvNet for the 2020s..pdf │ │ │ ├── KERPLE Kernelized Relative Positional Embedding for Length Extrapolation.pdf │ │ │ ├── Emergent Abilities of Large Language Models.pdf │ │ │ ├── Red Teaming Language Models with Language Models.pdf │ │ │ ├── GET3D A Generative Model of High Quality 3D Textured Shapes Learned from Images.pdf │ │ │ ├── GLM-130B An Open Bilingual Pre-trained Model.pdf │ │ │ ├── Compositional character models for open vocabulary word representation.pdf │ │ │ ├── Efficient Estimation of Word Representation in Vector Space.pdf │ │ │ ├── Beyond the Imitation Game Quantifying and extrapolating the capabilities of language models.pdf │ │ │ ├── A Survey on Knowledge Graphs Representation, Acquisition, and Applications.pdf │ │ │ ├── Evaluating Large Language Models Trained on Code.pdf │ │ │ ├── Multi-Grained Vision Language Pre-Training Aligning Texts with Visual Concepts.pdf │ │ │ ├── When Vision Transformers Outperform ResNets without Pre-training or Strong Data Augmentations.pdf │ │ │ ├── OFA Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework..pdf │ │ │ ├── COLD A Benchmark for Chinese Offensive Language Detection.pdf │ │ │ ├── Language models generalize beyond natural proteins.pdf │ │ │ ├── High-Resolution Image Synthesis with Latent Diffusion Models.pdf │ │ │ ├── Fine-Tuning Language Models from Human Preferences.pdf │ │ │ ├── Imagen Video High Definition Video Generation with Diffusion Models.pdf │ │ │ ├── No Language Left Behind Scaling Human-Centered Machine Translation.pdf │ │ │ ├── Zero-Shot Video Question Answering via Frozen Bidirectional Language Models.pdf │ │ │ ├── Towards Efficient Post-training Quantization of Pre-trained Language Models.pdf │ │ │ ├── Retrieval Augmented Generation for.pdf │ │ │ ├── Reducing Activation Recomputation in Large Transformer Models.pdf │ │ │ ├── GPT Understands, Too.pdf │ │ │ ├── Transformer-Xl Attentive Language Models Beyond A Fixed-Length Context.pdf │ │ │ ├── InstructPix2Pix Learning to Follow Image Editing Instructions.pdf │ │ │ ├── PPT Pre-trained Prompt Tuning for Few-shot Learning.pdf │ │ │ ├── Generating Training Data with Language Models Towards Zero-Shot Language Understanding.pdf │ │ │ ├── SmoothQuant Accurate and Efficient Post-Training Quantization for Large Language Models.pdf │ │ │ ├── Tensor Programs V Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer.pdf │ │ │ ├── Hierarchical Text-Conditional Image Generation with CLIP Latents.pdf │ │ │ ├── Knowledgeable Prompt-tuning Incorporating Knowledge into Prompt Verbalizer for Text Classification.pdf │ │ │ ├── BLOOM A 176B-Parameter Open-Access Multilingual Language Model.pdf │ │ │ ├── SGM Sequence Generation Model for Multi-label Classification.pdf │ │ │ ├── Pre-train, Prompt, and Predict A Systematic Survey of Prompting Methods in Natural Language Processing.pdf │ │ │ ├── Improving Language Models by Retrieving from Trillions of Tokens.pdf │ │ │ ├── Learning Transferable Visual Models From Natural Language Supervision.pdf │ │ │ ├── BaGuaLu targeting brain scale pretrained models with over 37 million cores.pdf │ │ │ ├── Zero-Shot Text-to-Image Generation.pdf │ │ │ ├── CogView Mastering Text-to-Image Generation via Transformers.pdf │ │ │ ├── Training Language Models with Memory Augmentation.pdf │ │ │ ├── Denoising Diffusion Implicit Models.pdf │ │ │ ├── WebGPT Browser-assisted question-answering with human feedback.pdf │ │ │ ├── Fine-mixing Mitigating Backdoors in Fine-tuned Language Models.pdf │ │ │ ├── GPT-NeoX-20B An Open-Source Autoregressive Language Model.pdf │ │ │ ├── Character-level Convolutional Networks for Text Classification.pdf │ │ │ ├── Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners.pdf │ │ │ ├── FastMoE A Fast Mixture-of-Expert Training System.pdf │ │ │ ├── Autoformalization with Large Language Models.pdf │ │ │ ├── Evolutionary-scale prediction of atomic level protein structure with a language model.pdf │ │ │ ├── Score-Based Generative Modeling through Stochastic Differential Equations.pdf │ │ │ ├── ERNIE 3.0 Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation.pdf │ │ │ ├── Versatile Diffusion Text, Images and Variations All in One Diffusion Model.pdf │ │ │ ├── Discrete mean estimates and the Landau-Siegel zero.pdf │ │ │ ├── Training Compute-Optimal Large Language Models.pdf │ │ │ ├── Video PreTraining (VPT) Learning to Act by Watching Unlabeled Online Videos.pdf │ │ │ ├── UnifiedSKG Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models.pdf │ │ │ ├── Foundation Transformers.pdf │ │ │ ├── Chain-of-Thought Prompting Elicits Reasoning in Large Language Models.pdf │ │ │ ├── PAL Program-aided Language Models.pdf │ │ │ ├── GLM General Language Model Pretraining with Autoregressive Blank Infilling.pdf │ │ │ ├── Training language models to follow instructions with human feedback.pdf │ │ │ ├── Colossal-AI A Unified Deep Learning System For Large-Scale Parallel Training.pdf │ │ │ ├── Galactica A Large Language Model for Science.pdf │ │ │ ├── Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval.pdf │ │ │ ├── PaLM Scaling Language Modeling with Pathways.pdf │ │ │ ├── OPT Open Pre-trained Transformer Language Models.pdf │ │ │ ├── Few-shot Learning with Multilingual Language Models.pdf │ │ │ ├── UL2 Unifying Language Learning Paradigms.pdf │ │ │ ├── Prompt-and-Rerank A Method for Zero-Shot and Few-Shot Arbitrary Textual Style Transfer with Small Language Models.pdf │ │ │ ├── InternImage Exploring Large-Scale Vision Foundation Models with Deformable Convolutions.pdf │ │ │ ├── Sequence to Sequence Learning with Neural Networks.pdf │ │ │ ├── AltCLIP Altering the Language Encoder in CLIP for Extended Language Capabilities.pdf │ │ │ ├── Convolutional Neural Network for Sentence Classification.pdf │ │ │ ├── Character-Aware Neural Language Models.pdf │ │ │ ├── Holistic Evaluation of Language Models.pdf │ │ │ ├── CPM A large-scale generative Chinese Pre-trained language model.pdf │ │ │ ├── Language Models are Few-Shot Learners.pdf │ │ │ ├── DiffusionDet Diffusion Model for Object Detection.pdf │ │ │ ├── Improving language understanding by generative pre training.pdf │ │ │ ├── DeepSpeed Data Efficiency Improving Deep Learning Model Quality and Training Efficiency via Efficient Data Sampling and Routing.pdf │ │ │ ├── PaLI A Jointly-Scaled Multilingual Language-Image Model.pdf │ │ │ ├── Language Models are Unsupervised Multitask Learners.pdf │ │ │ ├── Git Re-Basin Merging Models modulo Permutation Symmetries.pdf │ │ │ ├── How Much Knowledge Can You Pack Into the Parameters of a Language Model.pdf │ │ │ ├── BLIP Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation..pdf │ │ │ ├── Muse Text-To-Image Generation via Masked Generative Transformers.pdf │ │ │ ├── The Stability-Efficiency Dilemma Investigating Sequence Length Warmup for Training GPT Models.pdf │ │ │ ├── Masked Autoencoders Are Scalable Vision Learners.pdf │ │ │ ├── A Survey on In-context Learning.pdf │ │ │ ├── An Image is Worth 16x16 Words Transformers for Image Recognition at Scale.pdf │ │ │ ├── Learning to summarize from human feedback.pdf │ │ │ ├── ERNIE 3.0 Titan Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation.pdf │ │ │ ├── Language Models as Knowledge Bases.pdf │ │ │ ├── CodeGen An Open Large Language Model for Code with Multi-Turn Program Synthesis.pdf │ │ │ ├── LAION-5B An open large-scale dataset for training next generation image-text models.pdf │ │ │ ├── Generating Sequences With Recurrent Neural Networks.pdf │ │ │ ├── Language Models as Zero-Shot Planners Extracting Actionable Knowledge for Embodied Agents.pdf │ │ │ ├── Vision-Language Pre-Training with Triple Contrastive Learning.pdf │ │ │ ├── 01必读.jpg │ │ ├── EMNLP 19篇 │ │ │ ├── 自然语言生成的主动学习.pdf │ │ │ ├── 通过概念化来解释嵌入空间.pdf │ │ │ ├── IMTLab:用于构建、评估和诊断交互式机器翻译系统的开源平台.pdf │ │ │ ├── 驾驭灰色地带:不确定性和过度自信的表达如何影响语言模型.pdf │ │ │ ├── RAPL:一种用于少样本文档级关系提取的关系感知原型学习方法.pdf │ │ │ ├── 重新审视机器翻译的跨语言分类.pdf │ │ │ ├── 视觉、机器人技术及其他领域的语言基础.pdf │ │ │ ├── 通过对NLP领域学术写作的对比分析来解决语言偏见.pdf │ │ │ ├── 了解模型压缩对大型语言模型中社会偏见的影响.pdf │ │ │ ├── 凝聚力:生成文本连贯性的增量与整体评估的新基准.pdf │ │ │ ├── 用语言模型进行推理就是用世界模型进行规划.pdf │ │ │ ├── 使用大型语言模型进行可解释的心理健康分析.pdf │ │ │ ├── TopWORDS-Poetry:基于贝叶斯推理的中国古典诗歌同步文本分割和单词发现.pdf │ │ │ ├── 学习用于多模态失语症类型检测的共同语音手势.pdf │ │ │ ├── 具有 Wasserstein 独立性的公平文本分类.pdf │ │ │ ├── ROBBIE:大型生成语言模型的鲁棒偏差评估.pdf │ │ │ ├── 大型语言模型可以自我改进.pdf │ │ │ ├── SODA:具有社会常识语境化的百万级对话提炼.pdf │ │ │ ├── 混合倒挂索引是用于密集检索的鲁棒加速器.pdf │ │ ├── CVPR 2024 (持续更新) │ │ │ ├── 1 CVPR'24 获奖论文 │ │ │ │ ├── 4 最佳学生论文次优奖 │ │ │ │ │ ├── Objects as volumes: A stochastic geometry view of opaque solids.pdf │ │ │ │ │ ├── Comparing the Decision-Making Mechanisms by Transformers and CNNs.pdf │ │ │ │ ├── 3 最佳论文次优奖 │ │ │ │ │ ├── pixelSplat. 3D Gaussian Splats from lmage Pairs for Scalable Generalizable 3D Reconstruction.pdf │ │ │ │ ├── 2 最佳学生论文奖 │ │ │ │ │ ├── Mip-Splatting:Alias-free 3D Gaussian Splatting.pdf │ │ │ │ │ ├── BIOCLIP:A Vision Foundation Model for the Tree of Life.pdf │ │ │ │ ├── 1 最佳论文奖 │ │ │ │ │ ├── Generative Image Dynamics.pdf │ │ │ │ │ ├── Rich Human Feedback for Text-to-Image Generation.pdf │ │ │ ├── 3 CVPR'24 oral论文(更新完毕) │ │ │ │ ├── 10 自主导航和自我中心视觉 │ │ │ │ │ ├── EgoGen:An Egocentric Synthetic Data Generator.pdf │ │ │ │ │ ├── SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object Detection.pdf │ │ │ │ │ ├── UnO:Unsupervised Occupancy Fields for Perception and Forecasting.pdf │ │ │ │ ├── 15 低样本学习、自监督学习和半监督学习 │ │ │ │ │ ├── Improving Semantic Correspondence with Viewpoint-Guided Spherical Maps.pdf │ │ │ │ │ ├── CroSel.pdf │ │ │ │ │ ├── LTGC:Long-tail Recognition via Leveraging LLMs-driven Generated Content.pdf │ │ │ │ ├── 13 数据集和评估 │ │ │ │ │ ├── 360+x:A Panoptic Multi-modal Scene Understanding Dataset.pdf │ │ │ │ │ ├── Deep Generative Model based Rate-Distortion for Image Downscaling Assessment.pdf │ │ │ │ │ ├── Ego-Exo4D:Understanding Skilled Human Activity from First- and Third-Person Perspectives.pdf │ │ │ │ ├── 12 动作和运动分析 │ │ │ │ │ ├── An N-Point Linear Solver for Line and Motion Estimation with Event Cameras.pdf │ │ │ │ │ ├── Modeling Multimodal Social Interactions:New Challenges and Baselines with Densely Aligned Representations.pdf │ │ │ │ │ ├── FineParser:A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality Assessment.pdf │ │ │ │ │ ├── RoHM:Robust Human Motion Reconstruction via Diffusio.pdf │ │ │ │ ├── 5 深度学习架构与技术 │ │ │ │ │ ├── Learning Structure-from-Motion with Graph Attention Networks.pdf │ │ │ │ │ ├── Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks.pdf │ │ │ │ │ ├── In Search of a Data Transformation That Accelerates Neural Field Training.pdf │ │ │ │ │ ├── Neural Lineage.pdf │ │ │ │ │ ├── Neural Redshift:Random Networks are not Random Functions.pdf │ │ │ │ ├── 7 单视角三维技术 │ │ │ │ │ ├── WALT3D:Generating Realistic Training Data from Time-Lapse Imagery for Reconstructing Dynamic Objects Under Occlusion.pdf │ │ │ │ │ ├── EscherNet:A Generative Model for Scalable View Synthesis.pdf │ │ │ │ │ ├── Rethinking Inductive Biases for Surface Normal Estimation.pdf │ │ │ │ ├── 17 图像与视频合成 2 │ │ │ │ │ ├── Visual Anagrams:Generating Multi-View Optical Illusions with Diffusion Models.pdf │ │ │ │ │ ├── Alchemist:Parametric Control of Material Properties with Diffusion Models.pdf │ │ │ │ │ ├── MonoHair:High-Fidelity Hair Modeling from a Monocular Video.pdf │ │ │ │ ├── 1 低层次视觉 │ │ │ │ │ ├── Towards Robust Event-guided Low-Light Image Enhancement.pdf │ │ │ │ │ ├── Bilateral Event Mining and Complementary for Event Stream Super-Resolution.pdf │ │ │ │ │ ├── Specularity Factorization for Low-Light Enhancement.pdf │ │ │ │ │ ├── FMA-Net:Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring.pdf │ │ │ │ │ ├── FlowIE:Efficient Image Enhancement via Rectified Flow.pdf │ │ │ │ ├── 4 图像与视频合成 │ │ │ │ │ ├── FreeU:Free Lunch in Diffusion U-Net.pdf │ │ │ │ │ ├── Attention Calibration for Disentangled Text-to-Image Personalization.pdf │ │ │ │ │ ├── Instruct-Imagen: Image Generation with Multi-modal Instruction.pdf │ │ │ │ │ ├── Ranni:Taming Text-to-Image Diffusion for Accurate Instruction Following.pdf │ │ │ │ │ ├── Style Aligned Image Generation via Shared Attention.pdf │ │ │ │ ├── 18 多模态学习 │ │ │ │ │ ├── NoiseCLR:A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions in Diffusion Models.pdf │ │ │ │ │ ├── InternVL:Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks.pdf │ │ │ │ │ ├── MetaCloak.pdf │ │ │ │ │ ├── Describing Differences in Image Sets with Natural Language.pdf │ │ │ │ ├── 6 多视角三维技术和传感器 │ │ │ │ │ ├── Point Transformer V3:Simpler Faster Stronger.pdf │ │ │ │ │ ├── Steerers:A Framework for Rotation Equivariant Keypoint Descriptors.pdf │ │ │ │ │ ├── Tri-Perspective View Decomposition for Geometry-Aware Depth Completion.pdf │ │ │ │ │ ├── Seeing the World through Your Eyes.pdf │ │ │ │ │ ├── Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences.pdf │ │ │ │ ├── 14 多视角三维技术和传感器 2 │ │ │ │ │ ├── Learning to Produce Semi-dense Correspondences for Visual Localization.pdf │ │ │ │ ├── 3 人类行为和特征 │ │ │ │ │ ├── Semantic Human Mesh Reconstruction with Textures.pdf │ │ │ │ │ ├── Stratified Avatar Generation from Sparse Observations.pdf │ │ │ │ │ ├── MultiPly:Reconstruction of Multiple People from Monocular Video in the Wild.pdf │ │ │ │ │ ├── Relightable Gaussian Codec Avatars.pdf │ │ │ │ │ ├── URHand:Universal Relightable Hands.pdf │ │ │ │ ├── 16 低层次视觉与遥感 │ │ │ │ │ ├── DART:Implicit Doppler Tomography for Radar Novel View Synthesis.pdf │ │ │ │ │ ├── LDP: Language-driven Dual-Pixel Image Defocus Deblurring Network.pdf │ │ │ │ ├── 8 视觉、语言与推理 │ │ │ │ │ ├── Eyes Wide Shut Exploring the Visual Shortcomings of Multimodal LLMs.pdf │ │ │ │ │ ├── Visual Program Distillation:Distilling Tools and Programmatic Reasoning into Vision-Language Models.pdf │ │ │ │ │ ├── LISA:Reasoning Segmentation via Large Language Model.pdf │ │ │ │ ├── 9 医学与物理视觉 │ │ │ │ │ ├── Transcriptomics-guided Slide Representation Learning in Computational Pathology.pdf │ │ │ │ ├── 11三维视觉 │ │ │ │ │ ├── A Subspace-Constrained Tyler’s Estimator and its Applications to Structure from Motion.pdf │ │ │ │ ├── 2 视觉与图形 │ │ │ │ │ ├── Eclipse:Disambiguating Illumination and Materials using Unintended Shadows.pdf │ │ │ │ │ ├── Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation.pdf │ │ │ │ │ ├── DiffusionLight:Light Probes for Free by Painting a Chrome Ball.pdf │ │ │ ├── 4 CVPR'24 highlight论文(更新中) │ │ │ │ ├── Learning Structure-from-Motion with Graph Attention Networks.pdf │ │ │ │ ├── CFPL-FAS Class Free Prompt Learning for Generalizable Face Anti-spoofing.pdf │ │ │ │ ├── Efficient Deformable ConvNets Rethinking Dynamic and Sparse Operator for Vision Applications.pdf │ │ │ │ ├── Human Motion Prediction Under Unexpected Perturbation.pdf │ │ │ │ ├── XCube Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies.pdf │ │ │ │ ├── Boosting Neural Representations for Videos with a Conditional Decoder.pdf │ │ │ │ ├── Deciphering 'What' and 'Where' Visual Pathways from Spectral Clustering of Layer-Distributed Neural Representations.pdf │ │ │ │ ├── Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation.pdf │ │ │ │ ├── ODIN A Single Model for 2D and 3D Segmentation.pdf │ │ │ │ ├── LucidDreamer Towards High-Fidelity Text-to-3D Generation via Interval Score Matching.pdf │ │ │ │ ├── Ranni Taming Text-to-Image Diffusion for Accurate Instruction Following.pdf │ │ │ │ ├── Point2CAD Reverse Engineering CAD Models from 3D Point Clouds.pdf │ │ │ │ ├── ViT-CoMer Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.pdf │ │ │ │ ├── Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation.pdf │ │ │ │ ├── Multi-agent Long-term 3D Human Pose Forecasting via Interaction-aware Trajectory Conditioning.pdf │ │ │ │ ├── FinePOSE Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models.pdf │ │ │ │ ├── HOLD Category-agnostic 3D Reconstruction of Interacting Hands and Objects from Vide.pdf │ │ │ │ ├── Enhancing Video Super-Resolution via Implicit Resampling-based Alignment.pdf │ │ │ │ ├── Relightable and Animatable Neural Avatar from Sparse-View Video.pdf │ │ │ │ ├── In Search of a Data Transformation That Accelerates Neural Field Training.pdf │ │ │ │ ├── FMA-Net Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring.pdf │ │ │ │ ├── LocLLM Exploiting Generalizable Human Keypoint Localization via Large Language Model.pdf │ │ │ │ ├── DreamPropeller Supercharge Text-to-3D Generation with Parallel Sampling.pdf │ │ │ │ ├── Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes.pdf │ │ │ │ ├── Breathing Life Into Sketches Using Text-to-Video Priors.pdf │ │ │ │ ├── In-Context Matting.pdf │ │ │ │ ├── From Correspondences to Pose Non-minimal Certifiably Optimal Relative Pose without Disambiguation.pdf │ │ │ │ ├── Neural Redshift Random Networks are not Random Functions.pdf │ │ │ │ ├── 3D Human Pose Perception from Egocentric Stereo Videos.pdf │ │ │ │ ├── pix2gestalt Amodal Segmentation by Synthesizing Wholes.pdf │ │ │ │ ├── Frequency-Adaptive Dilated Convolution for Semantic Segmentation.pdf │ │ │ │ ├── HandDiff 3D Hand Pose Estimation with Diffusion on Image-Point Cloud.pdf │ │ │ │ ├── RAVE Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models.pdf │ │ │ │ ├── 4D-DRESS A 4D Dataset of Real-world Human Clothing with Semantic Annotations.pdf │ │ │ │ ├── Bilateral Event Mining and Complementary for Event Stream Super-Resolution.pdf │ │ │ │ ├── Real-Time Simulated Avatar from Head-Mounted Sensors.pdf │ │ │ │ ├── Tri-Modal Motion Retrieval by Learning a Joint Embedding Space.pdf │ │ │ │ ├── DiffusionLight Light Probes for Free by Painting a Chrome Ball.pdf │ │ │ │ ├── From Activation to Initialization Scaling Insights for Optimizing Neural Fields.pdf │ │ │ │ ├── FreeU Free Lunch in Diffusion U-Net.pdf │ │ │ │ ├── MMM Generative Masked Motion Model.pdf │ │ │ │ ├── Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis.pdf │ │ │ │ ├── Attention-Propagation Network for Egocentric Heatmap to 3D.pdf │ │ │ │ ├── GraCo Granularity-Controllable Interactive Segmentation.pdf │ │ │ │ ├── No Time to Train Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation.pdf │ │ │ │ ├── HashPoint Accelerated Point Searching and Sampling for Neural Rendering.pdf │ │ │ │ ├── CAD-SIGNet CAD Language Inference from Point Clouds using Layer-wise Sketch Instance Guided Attention.pdf │ │ │ │ ├── Tri-Perspective View Decomposition for Geometry-Aware Depth Completion.pdf │ │ │ │ ├── Move as You Say, Interact as You Can Language-guided Human Motion Generation with Scene Affordance.pdf │ │ │ │ ├── Seeing the World through Your Eyes.pdf │ │ │ │ ├── Enforcing Geometric and Physical Priors.pdf │ │ │ │ ├── CAT-Seg Cost Aggregation for Open-Vocabulary Semantic Segmentation.pdf │ │ │ │ ├── Suppress and Rebalance Towards Generalized Multi-Modal Face Anti-Spoofing.pdf │ │ │ │ ├── Unbiased Estimator for Distorted Conics in Camera Calibration.pdf │ │ │ │ ├── Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation.pdf │ │ │ │ ├── 3D Face Reconstruction with the Geometric Guidance of Facial Part Segmentation.pdf │ │ │ │ ├── Scaling Up Dynamic Human-Scene Interaction Modeling.pdf │ │ │ │ ├── General Object Foundation Model for Images and Videos at Scale.pdf │ │ │ │ ├── Putting the Object Back into Video Object Segmentation.pdf │ │ │ │ ├── Time-, Memory- and Parameter-Efficient Visual Adaptation.pdf │ │ │ │ ├── Towards Robust Event-guided Low-Light Image Enhancement A Large-Scale Real-World Event-Image Dataset and Novel Approach.pdf │ │ │ │ ├── GAvatar Animatable 3D Gaussian Avatars with Implicit Mesh Learning.pdf │ │ │ │ ├── EAGLE Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation.pdf │ │ │ │ ├── Point Transformer V3 Simpler, Faster, Stronger.pdf │ │ │ │ ├── CADTalk An Algorithm and Benchmark for Semantic Commenting of CAD Programs.pdf │ │ │ │ ├── Steerers A framework for rotation equivariant keypoint descriptors.pdf │ │ │ │ ├── PhysGaussian Physics-Integrated 3D Gaussians for Generative Dynamics.pdf │ │ │ │ ├── Specularity Factorization for Low-Light Enhancement.pdf │ │ │ │ ├── Objects as volumes A stochastic geometry view of opaque solids.pdf │ │ │ │ ├── LeGO Leveraging a Surface Deformation Network for Animatable Stylized Face Generation with One Example.pdf │ │ │ │ ├── Semantic-aware SAM for Point-Prompted Instance Segmentation.pdf │ │ │ │ ├── Restoration by Generation with Constrained Priors.pdf │ │ │ │ ├── Multi-view Aggregation Network for Dichotomous Image Segmentation.pdf │ │ │ │ ├── Fantastic Animals and Where to Find Them Segment Any Marine Animal with Dual SAM.pdf │ │ │ │ ├── From Activation to Initialization Scaling Insights for Optimizing Neural Fields.pdf │ │ │ │ ├── Self-Supervised Dual Contouring.pdf │ │ │ │ ├── NRDF Neural Riemannian Distance Fields for Learning Articulated Pose Priors.pdf │ │ │ │ ├── Matching 2D Images in 3D Metric Relative Pose from Metric Correspondences.pdf │ │ │ │ ├── Eclipse Disambiguating Illumination and Materials using Unintended Shadows.pdf │ │ │ ├── 2 CVPR'24 最佳论文提名(更新完毕) │ │ │ │ ├── 2 开源代码 │ │ │ │ │ ├── spider-match-main.zip │ │ │ │ │ ├── PlatoNeRF-main.zip │ │ │ │ │ ├── Registration-CorrMLP-master.zip │ │ │ │ │ ├── pixelsplat-main.zip │ │ │ │ │ ├── PaSCo-main.zip │ │ │ │ │ ├── NVlabs-edm2-main.zip │ │ │ │ │ ├── NeRF-HuGS-master.zip │ │ │ │ │ ├── MMMU-main.zip │ │ │ │ │ ├── Marigold-main.zip │ │ │ │ │ ├── MemSAM-main.zip │ │ │ │ │ ├── mip-splatting-main.zip │ │ │ │ │ ├── lambda_vit-main mlp.zip │ │ │ │ │ ├── MapUncertaintyPrediction-main.zip │ │ │ │ │ ├── egtr-main.zip │ │ │ │ │ ├── bioclip-main.zip │ │ │ │ ├── 1 提名论文 │ │ │ │ │ ├── 9 Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation.pdf │ │ │ │ │ ├── 8 PlatoNeRF 3D Reconstruction in Plato’s Cave via Single-View Two-Bounce Lidar.pdf │ │ │ │ │ ├── 6 Producing and Leveraging Online Map Uncertainty in Trajectory Prediction.pdf │ │ │ │ │ ├── 7 PaSCo:Urban 3D Panoptic Scene Completion with Uncertainty Awareness.pdf │ │ │ │ │ ├── 5 Correlation-aware Coarse-to-fine MLPs for Deformable Medical Image Registration.pdf │ │ │ │ │ ├── 4 MMMU A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI.pdf │ │ │ │ │ ├── 3 Comparing the Decision-Making Mechanisms by Transformers and CNNs.pdf │ │ │ │ │ ├── 2 Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation.pdf │ │ │ │ │ ├── 19 EGTR:Extracting Graph from Transformer for Scene Graph Generation.pdf │ │ │ │ │ ├── 18 Analyzing and Improving the Training Dynamics of Diffusion Models.pdf │ │ │ │ │ ├── 17 Generative Image Dynamics.pdf │ │ │ │ │ ├── 16 MLPCanBeAGoodTransformer Learner.pdf │ │ │ │ │ ├── 14 Mip-Splatting:Alias-free 3D Gaussian Splatting.pdf │ │ │ │ │ ├── 15 pixelSplat. 3D Gaussian Splats from lmage Pairs for Scalable Generalizable 3D Reconstruction.pdf │ │ │ │ │ ├── 13 NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes.pdf │ │ │ │ │ ├── 12 Grounding and Enhancing Grid-based Models for Neural Fields.pdf │ │ │ │ │ ├── 11 BIOCLIP:A Vision Foundation Model for the Tree of Life.pdf │ │ │ │ │ ├── 10 Rich Human Feedback for Text-to-Image Generation.pdf │ │ │ │ │ ├── 1 Objects as volumes: A stochastic geometry view of opaque solids.pdf │ ├── 小黄搞AI大模型面试目录 │ │ ├── 小黄搞AI_大模型面试100问(PDF更新至90).pdf │ │ ├── 小黄搞AI_大模型面试100问(PDF更新至74).pdf │ │ ├── 小黄搞AI_大模型面试100问(PDF更新至107).pdf │ ├── 大模型书籍 │ │ ├── Mastering Transformers_ Build state-of-the-art models from -- .pdf │ │ ├── 预训练语言模型 2021 (邵浩 刘一烽) .pdf │ │ ├── BERT基础教程:Transformer大模型实战 (苏达哈尔桑·拉维昌迪兰) .azw3 │ │ ├── 自然语言处理:基于预训练模型的方法_2021.pdf │ │ ├── 精通Transformer:从零开始构建最先进的NLP模型_2023.epub │ │ ├── Mastering NLP from Foundations to LLMs_ Apply advanced.pdf │ │ ├── Building LLM Apps Create Intelligent Apps and Agents with Large Language Models_2024 .pdf │ │ ├── 大规模语言模型:从理论到实践_2023.pdf │ │ ├── 大语言模型_2024.pdf │ │ ├── HuggingFace自然语言处理详解:基于BERT中文模型的任务实战.epub │ │ ├── 大语言模型:基础与前沿_2024.epub │ │ ├── 面向开发者的 LLM 入门课.pdf │ │ ├── Transformer, BERT, and GPT:Including ChatGPT and Prompt Engineering_2024.pdf │ │ ├── 大语言模型:基础与前沿_2024.pdf │ │ ├── Transformers for Natural Language Processing Build, train, and fine-tune deep neural network architectures for NLP with... (--).pdf │ │ ├── 扩散模型从原理到实战.epub │ │ ├── Natural Language Processing with Transformers Building Language Applications with Hugging Face.pdf │ │ ├── 中国人工智能系列白皮书——大模型技术(2023 版).pdf │ │ ├── Transformers in Action (MEAP v7) _2024 .pdf │ │ ├── Transformers生成式AI实用指南(提前发售 GPT双语) _2023 .epub │ │ ├── 自然语言处理:原理、方法与应用.zip │ │ ├── HuggingFace自然语言处理详解:基于BERT中文模型的任务实战.pdf │ │ ├── Mastering Large Language Models Advanced techniques, applications, cutting-edge methods, and top LLMs_2024 .pdf │ │ ├── 自然语言处理导论 2023 张奇.pdf │ │ ├── Modern Generative AI with ChatGPT and OpenAI Models.pdf │ │ ├── Generative AI with LangChain_ Build large language model.pdf │ │ ├── BERT基础教程:Transformer大模型实战_2023.zip │ │ ├── 精通Transformer:从零开始构建最先进的NLP模型_2023.pdf │ │ ├── Getting Started with Google BERT_ Build and train .pdf │ │ ├── 自然语言处理:原理、方法与应用 2023 (王志立 雷鹏斌 吴宇凡) .epub │ │ ├── LLM Prompt Engineering For Developers The Art and Science of Unlocking LLMs True Potential_2024 .epub │ │ ├── Mastering Large Language Models Advanced techniques, applications, cutting-edge methods, and top LLMs_2024 .epub │ │ ├── Transformer自然语言处理实战:使用Hugging-Face-Transformers库构建NLP应用_2024.pdf │ ├── 面试八股文 │ │ ├── 大模型校招面试题.pdf │ │ ├── LLMs大模型面试问题和答案(97).pdf │ │ ├── 大模型常见面试题及解答1.pdf │ │ ├── 大模型 LLM 最全八股和答案.pdf │ │ ├── AI大模型面试题(102).pdf │ │ ├── 大模型岗位面试全纪录.pdf │ │ ├── 大模型常考面试题总结(含答案).pdf │ │ ├── 大模型常见面试题及解答2.pdf │ │ ├── 大模型LLMS.pdf │ │ ├── 从零开始大模型开发与微调基于PyTorch与ChatGLM.pdf │ │ ├── 大模型常见面试题3.pdf │ │ ├── 大模型落地应用案例集.pdf │ ├── 大模型面试题 │ │ ├── 大模型(LLMs)参数高效微调(PEFT)面 │ │ │ ├── 适配器微调(Adapter-tuning)篇.pdf │ │ │ ├── LoRA篇.pdf │ │ │ ├── 参数高效微调篇PRFT.pdf │ │ │ ├── 提示学习(Prompting)篇.pdf │ │ ├── 大模型(LLMs)langchain面 │ │ │ ├── 基于LLM+向量库的文档对话经验面.pdf │ │ │ ├── 大模型(LLMs)langchain面.pdf │ │ ├── 31-LLM-Interview-Plus │ │ │ ├── 大模型(LLMs)推理加速篇.pdf │ │ │ ├── 大模型(LLMs)Tokenizer篇.pdf │ │ │ ├── 多模态常见面试题.pdf │ │ │ ├── 大模型校招面试题.pdf │ │ │ ├── 大模型(LLMs)面试题答案Plus.pdf │ │ │ ├── 大模型(LLMs)蒸馏面.pdf │ │ │ ├── 大模型(LLMs)幻觉面.pdf │ │ │ ├── 大模型(LLMs)分布式训练面.pdf │ │ │ ├── 大模型(LLMs)显存问题面.pdf │ │ │ ├── 大模型 RAG 检索增强生成面.pdf │ │ │ ├── 大模型(LLMs)增量预训练篇.pdf │ │ ├── 大模型(LLMs)强化学习—— PPO 面.pdf │ │ ├── 大模型(LLMs)基础面.pdf │ │ ├── 大模型(LLMs)强化学习——RLHF及其变种面.pdf │ │ ├── 大模型(LLMs)训练集面.pdf │ │ ├── 大模型(LLMs)进阶面.pdf │ │ ├── 大模型(LLMs)评测面.pdf │ │ ├── 大模型(LLMs)agent 面.pdf │ │ ├── 大模型(LLMs)推理面.pdf │ │ ├── 大模型(LLMs)幻觉面.pdf │ │ ├── 大模型(LLMs)微调面.pdf