基于LangChain和知识图谱的大模型医疗问答机器人项目-优优资源站

资源目录：
├── 基于LangChain和知识图谱的大模型医疗问答机器人项目
│   ├── 源代码
│   ├── 大模型实战P25LangChain之给Agent加Memory.mp4
│   ├── 大模型实战P11LangChain之Prompt和LLMChain.mp4
│   ├── 大模型实战P45问答机器人项目面试考点总结.mp4
│   ├── 大模型实战P36从用户问题中抽取命名实体词槽.mp4
│   ├── 大模型实战P37CQL词槽填充和相关问题筛选.mp4
│   ├── 大模型实战P1LangChain与知识图谱问答机器人项目.mp4
│   ├── 大模型实战P13LangChain之FewShotPrompt.mp4
│   ├── 大模型实战P41用户消息的补全和归纳总结.mp4
│   ├── 大模型实战P48快速接入百川和Claude大模型.mp4
│   ├── Neo4j实战P7-1Windows和Mac本地安装Neo4j数据库.mp4
│   ├── 大模型实战P44LangChain框架版本升级.mp4
│   ├── 大模型实战P24LangChain之多Agent协作.mp4
│   ├── 大模型实战P12LangChain之多参数与LCEL.mp4
│   ├── 大模型实战P32定义环境变量和模型获取函数.mp4
│   ├── 大模型实战P47一种解决Agent响应慢的方法.mp4
│   ├── 大模型实战P19LangChain之FAISS文档召回.mp4
│   ├── 大模型实战P28LangChain之GraphCypherQAChain.mp4
│   ├── 大模型实战P31项目LangChainAgent架构简介.mp4
│   ├── 大模型实战P40用Agent串联业务处理函数.mp4
│   ├── 大模型实战P43LangSmith监控大模型应用程序.mp4
│   ├── 大模型实战P20LangChain之文档加载和分割.mp4
│   ├── 大模型实战P30Gradio之ChatInterface对话界面.mp4
│   ├── 大模型实战P15LangChain之ConversationChain.mp4
│   ├── 大模型实战P27LangChain之输出提示词重写.mp4
│   ├── 大模型实战P8OpenAI接口实现TextEmbeddings.mp4
│   ├── 大模型实战P18LangChain之问答QAChain.mp4
│   ├── 大模型实战P9根据OpenAI句向量召回相似文本.mp4
│   ├── 大模型实战P26LangChain之命名实体识别.mp4
│   ├── 大模型实战P6OpenAI接口调用Token计算.mp4
│   ├── 大模型实战P46共性问题修复和统一答疑.mp4
│   ├── 大模型实战P39Google搜索回答非在库问题.mp4
│   ├── 大模型实战P10LangChain简介与初体验.mp4
│   ├── 大模型实战P16LangChain之Memory.mp4
│   ├── 大模型实战P17LangChain之LLMRequestsChain.mp4
│   ├── 大模型实战P2基础课和项目课的内容概述.mp4
│   ├── 大模型实战P21LangChain之文档检索问答.mp4
│   ├── 大模型实战P7OpenAI接口实现多轮对话.mp4
│   ├── Neo4j实战P7-2Windows和Mac本地安装Neo4j数据库.mp4
│   ├── 医疗问答P7CSV文件导入到Neo4j数据库.mp4
│   ├── 大模型实战P22LangChain之向量保存和加载.mp4
│   ├── 大模型实战P5OpenAI对话接口代码优化.mp4
│   ├── 大模型实战P3大语言模型通识和课前准备.mp4
│   ├── 大模型实战P42Gradio对话窗口修改和测试.mp4
│   ├── 大模型实战P29Gradio简介与初体验.mp4
│   ├── 大模型实战P14LangChain之SequentialChain.mp4
│   ├── 大模型实战P38查询Neo4j回答医疗相关问题.mp4
│   ├── 大模型实战P35Chroma召回数据回答公司相关问题.mp4
│   ├── 大模型实战P34通用大模型回答日常交际问题.mp4
│   ├── 大模型实战P33公司相关文档向量化和存储.mp4
│   ├── 大模型实战P4OpenAI对话接口简单使用方法.mp4
│   ├── 大模型实战P23LangChain之Agent和自定义Tool.mp4
├── 大模型面试笔记书籍
│   ├── 大模型论文
│   │   ├── CVPR 2024 (最佳+oral+highlight）(持续更新）
│   │   │   ├── 1 CVPR'24 获奖论文
│   │   │   │   ├── 4 最佳学生论文次优奖
│   │   │   │   │   ├── Objects as volumes： A stochastic geometry view of opaque solids.pdf
│   │   │   │   │   ├── Comparing the Decision-Making Mechanisms by Transformers and CNNs.pdf
│   │   │   │   ├── 2 最佳学生论文奖
│   │   │   │   │   ├── BIOCLIP：A Vision Foundation Model for the Tree of Life.pdf
│   │   │   │   │   ├── Mip-Splatting：Alias-free 3D Gaussian Splatting.pdf
│   │   │   │   ├── 3 最佳论文次优奖
│   │   │   │   │   ├── pixelSplat. 3D Gaussian Splats from lmage Pairs for Scalable Generalizable 3D Reconstruction.pdf
│   │   │   │   ├── 1 最佳论文奖
│   │   │   │   │   ├── Rich Human Feedback for Text-to-Image Generation.pdf
│   │   │   │   │   ├── Generative Image Dynamics.pdf
│   │   │   ├── 3 CVPR'24 oral论文（更新完毕）
│   │   │   │   ├── 18 多模态学习
│   │   │   │   │   ├── Describing Differences in Image Sets with Natural Language.pdf
│   │   │   │   │   ├── NoiseCLR：A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions in Diffusion Models.pdf
│   │   │   │   │   ├── MetaCloak.pdf
│   │   │   │   │   ├── InternVL：Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks.pdf
│   │   │   │   ├── 1 低层次视觉
│   │   │   │   │   ├── Specularity Factorization for Low-Light Enhancement.pdf
│   │   │   │   │   ├── FMA-Net：Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring.pdf
│   │   │   │   │   ├── Bilateral Event Mining and Complementary for Event Stream Super-Resolution.pdf
│   │   │   │   │   ├── FlowIE：Efficient Image Enhancement via Rectified Flow.pdf
│   │   │   │   │   ├── Towards Robust Event-guided Low-Light Image Enhancement.pdf
│   │   │   │   ├── 11三维视觉
│   │   │   │   │   ├── A Subspace-Constrained Tyler’s Estimator and its Applications to Structure from Motion.pdf
│   │   │   │   ├── 16 低层次视觉与遥感
│   │   │   │   │   ├── DART：Implicit Doppler Tomography for Radar Novel View Synthesis.pdf
│   │   │   │   │   ├── LDP： Language-driven Dual-Pixel Image Defocus Deblurring Network.pdf
│   │   │   │   ├── 14 多视角三维技术和传感器 2
│   │   │   │   │   ├── Learning to Produce Semi-dense Correspondences for Visual Localization.pdf
│   │   │   │   ├── 15 低样本学习、自监督学习和半监督学习
│   │   │   │   │   ├── CroSel.pdf
│   │   │   │   │   ├── LTGC：Long-tail Recognition via Leveraging LLMs-driven Generated Content.pdf
│   │   │   │   │   ├── Improving Semantic Correspondence with Viewpoint-Guided Spherical Maps.pdf
│   │   │   │   ├── 6 多视角三维技术和传感器
│   │   │   │   │   ├── Seeing the World through Your Eyes.pdf
│   │   │   │   │   ├── Tri-Perspective View Decomposition for Geometry-Aware Depth Completion.pdf
│   │   │   │   │   ├── Steerers：A Framework for Rotation Equivariant Keypoint Descriptors.pdf
│   │   │   │   │   ├── Point Transformer V3：Simpler Faster Stronger.pdf
│   │   │   │   │   ├── Matching 2D Images in 3D： Metric Relative Pose from Metric Correspondences.pdf
│   │   │   │   ├── 5 深度学习架构与技术
│   │   │   │   │   ├── Neural Lineage.pdf
│   │   │   │   │   ├── Learning Structure-from-Motion with Graph Attention Networks.pdf
│   │   │   │   │   ├── Neural Redshift：Random Networks are not Random Functions.pdf
│   │   │   │   │   ├── In Search of a Data Transformation That Accelerates Neural Field Training.pdf
│   │   │   │   │   ├── Florence-2： Advancing a Unified Representation for a Variety of Vision Tasks.pdf
│   │   │   │   ├── 7 单视角三维技术
│   │   │   │   │   ├── WALT3D：Generating Realistic Training Data from Time-Lapse Imagery for Reconstructing Dynamic Objects Under Occlusion.pdf
│   │   │   │   │   ├── EscherNet：A Generative Model for Scalable View Synthesis.pdf
│   │   │   │   │   ├── Rethinking Inductive Biases for Surface Normal Estimation.pdf
│   │   │   │   ├── 10 自主导航和自我中心视觉
│   │   │   │   │   ├── SAFDNet： A Simple and Effective Network for Fully Sparse 3D Object Detection.pdf
│   │   │   │   │   ├── EgoGen：An Egocentric Synthetic Data Generator.pdf
│   │   │   │   │   ├── UnO：Unsupervised Occupancy Fields for Perception and Forecasting.pdf
│   │   │   │   ├── 3 人类行为和特征
│   │   │   │   │   ├── Stratified Avatar Generation from Sparse Observations.pdf
│   │   │   │   │   ├── Semantic Human Mesh Reconstruction with Textures.pdf
│   │   │   │   │   ├── URHand：Universal Relightable Hands.pdf
│   │   │   │   │   ├── MultiPly：Reconstruction of Multiple People from Monocular Video in the Wild.pdf
│   │   │   │   │   ├── Relightable Gaussian Codec Avatars.pdf
│   │   │   │   ├── 2 视觉与图形
│   │   │   │   │   ├── Eclipse：Disambiguating Illumination and Materials using Unintended Shadows.pdf
│   │   │   │   │   ├── Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation.pdf
│   │   │   │   │   ├── DiffusionLight：Light Probes for Free by Painting a Chrome Ball.pdf
│   │   │   │   ├── 9 医学与物理视觉
│   │   │   │   │   ├── Transcriptomics-guided Slide Representation Learning in Computational Pathology.pdf
│   │   │   │   ├── 17 图像与视频合成 2
│   │   │   │   │   ├── MonoHair：High-Fidelity Hair Modeling from a Monocular Video.pdf
│   │   │   │   │   ├── Alchemist：Parametric Control of Material Properties with Diffusion Models.pdf
│   │   │   │   │   ├── Visual Anagrams：Generating Multi-View Optical Illusions with Diffusion Models.pdf
│   │   │   │   ├── 8 视觉、语言与推理
│   │   │   │   │   ├── Visual Program Distillation：Distilling Tools and Programmatic Reasoning into Vision-Language Models.pdf
│   │   │   │   │   ├── LISA：Reasoning Segmentation via Large Language Model.pdf
│   │   │   │   │   ├── Eyes Wide Shut  Exploring the Visual Shortcomings of Multimodal LLMs.pdf
│   │   │   │   ├── 12 动作和运动分析
│   │   │   │   │   ├── An N-Point Linear Solver for Line and Motion Estimation with Event Cameras.pdf
│   │   │   │   │   ├── FineParser：A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality Assessment.pdf
│   │   │   │   │   ├── Modeling Multimodal Social Interactions：New Challenges and Baselines with Densely Aligned Representations.pdf
│   │   │   │   │   ├── RoHM：Robust Human Motion Reconstruction via Diffusio.pdf
│   │   │   │   ├── 4 图像与视频合成
│   │   │   │   │   ├── Ranni：Taming Text-to-Image Diffusion for Accurate Instruction Following.pdf
│   │   │   │   │   ├── Attention Calibration for Disentangled Text-to-Image Personalization.pdf
│   │   │   │   │   ├── FreeU：Free Lunch in Diffusion U-Net.pdf
│   │   │   │   │   ├── Instruct-Imagen： Image Generation with Multi-modal Instruction.pdf
│   │   │   │   │   ├── Style Aligned Image Generation via Shared Attention.pdf
│   │   │   │   ├── 13 数据集和评估
│   │   │   │   │   ├── 360+x：A Panoptic Multi-modal Scene Understanding Dataset.pdf
│   │   │   │   │   ├── Deep Generative Model based Rate-Distortion for Image Downscaling Assessment.pdf
│   │   │   │   │   ├── Ego-Exo4D：Understanding Skilled Human Activity from First- and Third-Person Perspectives.pdf
│   │   │   ├── 4 CVPR'24 highlight论文（更新中）
│   │   │   │   ├── ODIN  A Single Model for 2D and 3D Segmentation.pdf
│   │   │   │   ├── Enforcing Geometric and Physical Priors.pdf
│   │   │   │   ├── Scaling Up Dynamic Human-Scene Interaction Modeling.pdf
│   │   │   │   ├── CADTalk An Algorithm and Benchmark for Semantic Commenting of CAD Programs.pdf
│   │   │   │   ├── LucidDreamer Towards High-Fidelity Text-to-3D Generation via Interval Score Matching.pdf
│   │   │   │   ├── pix2gestalt  Amodal Segmentation by Synthesizing Wholes.pdf
│   │   │   │   ├── Semantic-aware SAM for Point-Prompted Instance Segmentation.pdf
│   │   │   │   ├── Self-Supervised Dual Contouring.pdf
│   │   │   │   ├── Multi-view Aggregation Network for Dichotomous Image Segmentation.pdf
│   │   │   │   ├── From Correspondences to Pose  Non-minimal Certifiably Optimal Relative Pose without Disambiguation.pdf
│   │   │   │   ├── 3D Face Reconstruction with the Geometric Guidance of Facial Part Segmentation.pdf
│   │   │   │   ├── Suppress and Rebalance  Towards Generalized Multi-Modal Face Anti-Spoofing.pdf
│   │   │   │   ├── GraCo Granularity-Controllable Interactive Segmentation.pdf
│   │   │   │   ├── Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation.pdf
│   │   │   │   ├── RAVE  Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models.pdf
│   │   │   │   ├── DiffusionLight Light Probes for Free by Painting a Chrome Ball.pdf
│   │   │   │   ├── Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation.pdf
│   │   │   │   ├── Towards Robust Event-guided Low-Light Image Enhancement  A Large-Scale Real-World Event-Image Dataset and Novel Approach.pdf
│   │   │   │   ├── Eclipse Disambiguating Illumination and Materials using Unintended Shadows.pdf
│   │   │   │   ├── Boosting Neural Representations for Videos with a Conditional Decoder.pdf
│   │   │   │   ├── Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation.pdf
│   │   │   │   ├── LocLLM  Exploiting Generalizable Human Keypoint Localization via Large Language Model.pdf
│   │   │   │   ├── HandDiff 3D Hand Pose Estimation with Diffusion on Image-Point Cloud.pdf
│   │   │   │   ├── From Activation to Initialization Scaling Insights for Optimizing Neural Fields.pdf
│   │   │   │   ├── ViT-CoMer Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.pdf
│   │   │   │   ├── NRDF Neural Riemannian Distance Fields for Learning Articulated Pose Priors.pdf
│   │   │   │   ├── Unbiased Estimator for Distorted Conics in Camera Calibration.pdf
│   │   │   │   ├── Restoration by Generation with Constrained Priors.pdf
│   │   │   │   ├── From Activation to Initialization  Scaling Insights for Optimizing Neural Fields.pdf
│   │   │   │   ├── Time-, Memory- and Parameter-Efficient Visual Adaptation.pdf
│   │   │   │   ├── FreeU Free Lunch in Diffusion U-Net.pdf
│   │   │   │   ├── EAGLE  Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation.pdf
│   │   │   │   ├── Human Motion Prediction Under Unexpected Perturbation.pdf
│   │   │   │   ├── XCube Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies.pdf
│   │   │   │   ├── Relightable and Animatable Neural Avatar from Sparse-View Video.pdf
│   │   │   │   ├── Bilateral Event Mining and Complementary for Event Stream Super-Resolution.pdf
│   │   │   │   ├── Breathing Life Into Sketches Using Text-to-Video Priors.pdf
│   │   │   │   ├── Efficient Deformable ConvNets  Rethinking Dynamic and Sparse Operator for Vision Applications.pdf
│   │   │   │   ├── Enhancing Video Super-Resolution via Implicit Resampling-based Alignment.pdf
│   │   │   │   ├── HOLD  Category-agnostic 3D Reconstruction of Interacting Hands and Objects from Vide.pdf
│   │   │   │   ├── DreamPropeller  Supercharge Text-to-3D Generation with Parallel Sampling.pdf
│   │   │   │   ├── Ranni Taming Text-to-Image Diffusion for Accurate Instruction Following.pdf
│   │   │   │   ├── Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes.pdf
│   │   │   │   ├── Specularity Factorization for Low-Light Enhancement.pdf
│   │   │   │   ├── Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis.pdf
│   │   │   │   ├── HashPoint Accelerated Point Searching and Sampling for Neural Rendering.pdf
│   │   │   │   ├── 3D Human Pose Perception from Egocentric Stereo Videos.pdf
│   │   │   │   ├── Tri-Modal Motion Retrieval by Learning a Joint Embedding Space.pdf
│   │   │   │   ├── Learning Structure-from-Motion with Graph Attention Networks.pdf
│   │   │   │   ├── Real-Time Simulated Avatar from Head-Mounted Sensors.pdf
│   │   │   │   ├── Frequency-Adaptive Dilated Convolution for Semantic Segmentation.pdf
│   │   │   │   ├── Move as You Say, Interact as You Can  Language-guided Human Motion Generation with Scene Affordance.pdf
│   │   │   │   ├── FinePOSE Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models.pdf
│   │   │   │   ├── 4D-DRESS A 4D Dataset of Real-world Human Clothing with Semantic Annotations.pdf
│   │   │   │   ├── PhysGaussian  Physics-Integrated 3D Gaussians for Generative Dynamics.pdf
│   │   │   │   ├── GAvatar Animatable 3D Gaussian Avatars with Implicit Mesh Learning.pdf
│   │   │   │   ├── Fantastic Animals and Where to Find Them Segment Any Marine Animal with Dual SAM.pdf
│   │   │   │   ├── General Object Foundation Model for Images and Videos at Scale.pdf
│   │   │   │   ├── FMA-Net Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring.pdf
│   │   │   │   ├── Objects as volumes  A stochastic geometry view of opaque solids.pdf
│   │   │   │   ├── Point Transformer V3 Simpler, Faster, Stronger.pdf
│   │   │   │   ├── CFPL-FAS Class Free Prompt Learning for Generalizable Face Anti-spoofing.pdf
│   │   │   │   ├── Seeing the World through Your Eyes.pdf
│   │   │   │   ├── Multi-agent Long-term 3D Human Pose Forecasting via Interaction-aware Trajectory Conditioning.pdf
│   │   │   │   ├── Steerers  A framework for rotation equivariant keypoint descriptors.pdf
│   │   │   │   ├── In-Context Matting.pdf
│   │   │   │   ├── In Search of a Data Transformation That Accelerates Neural Field Training.pdf
│   │   │   │   ├── Matching 2D Images in 3D  Metric Relative Pose from Metric Correspondences.pdf
│   │   │   │   ├── Point2CAD  Reverse Engineering CAD Models from 3D Point Clouds.pdf
│   │   │   │   ├── Putting the Object Back into Video Object Segmentation.pdf
│   │   │   │   ├── MMM  Generative Masked Motion Model.pdf
│   │   │   │   ├── Tri-Perspective View Decomposition for Geometry-Aware Depth Completion.pdf
│   │   │   │   ├── CAT-Seg  Cost Aggregation for Open-Vocabulary Semantic Segmentation.pdf
│   │   │   │   ├── Neural Redshift  Random Networks are not Random Functions.pdf
│   │   │   │   ├── Deciphering 'What' and 'Where' Visual Pathways from Spectral Clustering of Layer-Distributed Neural Representations.pdf
│   │   │   │   ├── No Time to Train  Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation.pdf
│   │   │   │   ├── LeGO  Leveraging a Surface Deformation Network for Animatable Stylized Face Generation with One Example.pdf
│   │   │   │   ├── Attention-Propagation Network for Egocentric Heatmap to 3D.pdf
│   │   │   │   ├── CAD-SIGNet CAD Language Inference from Point Clouds using Layer-wise Sketch Instance Guided Attention.pdf
│   │   │   ├── 2 CVPR'24 最佳论文提名（更新完毕）
│   │   │   │   ├── 2 开源代码
│   │   │   │   │   ├── Marigold-main.zip
│   │   │   │   │   ├── egtr-main.zip
│   │   │   │   │   ├── pixelsplat-main.zip
│   │   │   │   │   ├── mip-splatting-main.zip
│   │   │   │   │   ├── lambda_vit-main mlp.zip
│   │   │   │   │   ├── Registration-CorrMLP-master.zip
│   │   │   │   │   ├── PlatoNeRF-main.zip
│   │   │   │   │   ├── NVlabs-edm2-main.zip
│   │   │   │   │   ├── MemSAM-main.zip
│   │   │   │   │   ├── PaSCo-main.zip
│   │   │   │   │   ├── MMMU-main.zip
│   │   │   │   │   ├── bioclip-main.zip
│   │   │   │   │   ├── MapUncertaintyPrediction-main.zip
│   │   │   │   │   ├── NeRF-HuGS-master.zip
│   │   │   │   │   ├── spider-match-main.zip
│   │   │   │   ├── 1 提名论文
│   │   │   │   │   ├── 19 EGTR：Extracting Graph from Transformer for Scene Graph Generation.pdf
│   │   │   │   │   ├── 12 Grounding and Enhancing Grid-based Models for Neural Fields.pdf
│   │   │   │   │   ├── 2 Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation.pdf
│   │   │   │   │   ├── 4 MMMU  A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI.pdf
│   │   │   │   │   ├── 14 Mip-Splatting：Alias-free 3D Gaussian Splatting.pdf
│   │   │   │   │   ├── 11 BIOCLIP：A Vision Foundation Model for the Tree of Life.pdf
│   │   │   │   │   ├── 15 pixelSplat. 3D Gaussian Splats from lmage Pairs for Scalable Generalizable 3D Reconstruction.pdf
│   │   │   │   │   ├── 13 NeRF-HuGS： Improved Neural Radiance Fields in Non-static Scenes.pdf
│   │   │   │   │   ├── 1 Objects as volumes： A stochastic geometry view of opaque solids.pdf
│   │   │   │   │   ├── 18 Analyzing and Improving the Training Dynamics of Diffusion Models.pdf
│   │   │   │   │   ├── 8 PlatoNeRF 3D Reconstruction in Plato’s Cave via Single-View Two-Bounce Lidar.pdf
│   │   │   │   │   ├── 16 MLPCanBeAGoodTransformer Learner.pdf
│   │   │   │   │   ├── 5 Correlation-aware Coarse-to-fine MLPs for Deformable Medical Image Registration.pdf
│   │   │   │   │   ├── 9 Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation.pdf
│   │   │   │   │   ├── 6 Producing and Leveraging Online Map Uncertainty in Trajectory Prediction.pdf
│   │   │   │   │   ├── 3 Comparing the Decision-Making Mechanisms by Transformers and CNNs.pdf
│   │   │   │   │   ├── 10 Rich Human Feedback for Text-to-Image Generation.pdf
│   │   │   │   │   ├── 17 Generative Image Dynamics.pdf
│   │   │   │   │   ├── 7 PaSCo：Urban 3D Panoptic Scene Completion with Uncertainty Awareness.pdf
│   │   ├── 50篇大型语言模型提示工程必读
│   │   │   ├── Prompting in Autoregressive Large Language.pdf
│   │   │   ├── Exploring Visual Prompts for Adapting Large-Scale Models.pdf
│   │   │   ├── Large Language Models Understand and Can Be Enhanced by Emotional Stimuli.pdf
│   │   │   ├── LPML  LLM-PROMPTING MARKUP LANGUAGE FOR.pdf
│   │   │   ├── Chain-of-Thought Prompting Elicits Reasoning in Large Language Models.pdf
│   │   │   ├── Joint Prompt Optimization of Stacked LLMs.pdf
│   │   │   ├── Contrastive Chain-of-Thought Prompting.pdf
│   │   │   ├── TAKE A STEP BACK- EVOKING REASONING VIA ABSTRACTION IN LARGE LANGUAGE MODELS.pdf
│   │   │   ├── Reprompting  Automated Chain-of-Thought Prompt.pdf
│   │   │   ├── Program of Thoughts Prompting- Disentangling Computation from Reasoning for Numerical Reasoning Tasks.pdf
│   │   │   ├── LARGE LANGUAGE MODELS AS TOOL MAKERS.pdf
│   │   │   ├── A Systematic Survey of Prompt Engineering in Large Language Models- Techniques and Applications.pdf
│   │   │   ├── Rephrase and Respond- Let Large Language Models Ask Better Questions for Themselves.pdf
│   │   │   ├── CHAIN-OF-NOTE- ENHANCING ROBUSTNESS IN RETRIEVAL-AUGMENTED LANGUAGE MODELS.pdf
│   │   │   ├── PROMPTBREEDER.pdf
│   │   │   ├── Prompt Engineering Through the Lens of Optimal.pdf
│   │   │   ├── Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
│   │   │   ├── SELF-CONSISTENCY IMPROVES CHAIN OF THOUGHT REASONING IN LANGUAGE MODELS.pdf
│   │   │   ├── Prompting Is Programming  A Query Language for.pdf
│   │   │   ├── Chain of Code- Reasoning with a Language Model-Augmented Code Emulator.pdf
│   │   │   ├── ART- Automatic multi-step reasoning and tool-use for large language models.pdf
│   │   │   ├── Visual ChatGPT- Talking, Drawing and Editing with Visual Foundation Models.pdf
│   │   │   ├── Structured Chain-of-Thought Prompting for Code Generation.pdf
│   │   │   ├── Unleashing the potential of prompt engineering in Large Language Models- a comprehensive review.pdf
│   │   │   ├── Active Prompting with Chain-of-Thought for Large Language Models.pdf
│   │   │   ├── CHAIN-OF-SYMBOL PROMPTING FOR SPATIAL RELATIONSHIPS IN LARGE LANGUAGE MODELS.pdf
│   │   │   ├── Language Models are Few-Shot Learners.pdf
│   │   │   ├── Thread of Thought Unraveling Chaotic Contexts.pdf
│   │   │   ├── Pre-train, Prompt, and Predict- A Systematic Survey of Prompting Methods in Natural Language Processing.pdf
│   │   │   ├── Chain of Code  Reasoning with.pdf
│   │   │   ├── REAC T- SYNERGIZING REASONING AND ACTING IN LANGUAGE MODELS.pdf
│   │   │   ├── CHAIN-OF-VERIFICATION REDUCES HALLUCINATION IN LARGE LANGUAGE MODELS.pdf
│   │   │   ├── Large Language Model Guided Tree-of-Thought.pdf
│   │   │   ├── CHAIN-OF-KNOWLEDGE- GROUNDING LARGE LANGUAGE MODELS VIA DYNAMIC KNOWLEDGE ADAPTING OVER HETEROGENEOUS SOURCES.pdf
│   │   │   ├── System 2 Attention (is something you might need too).pdf
│   │   │   ├── UPAR  A KANTIAN-INSPIRED PROMPTING FRAME.pdf
│   │   │   ├── A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.pdf
│   │   │   ├── CHAIN-OF-TABLE- EVOLVING TABLES IN THE REASONING CHAIN FOR TABLE UNDERSTANDING.pdf
│   │   │   ├── OlaGPT Empowering LLMs With Human-like Problem-Solving.pdf
│   │   │   ├── A Systematic Survey of Prompt Engineering in Large Language Models Techniques and Applications.pdf
│   │   │   ├── Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Large Language Models.pdf
│   │   │   ├── Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic.pdf
│   │   │   ├── Boosting Logical Reasoning in Large Language Models through a New.pdf
│   │   │   ├── SHOW YOUR WORK- SCRATCHPADS FOR INTERMEDIATE COMPUTATION WITH LANGUAGE MODELS.pdf
│   │   │   ├── IMPLICIT CHAIN OF THOUGHT REASONING.pdf
│   │   │   ├── Tree of Thoughts- Deliberate Problem Solving with Large Language Models.pdf
│   │   │   ├── A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models.pdf
│   │   │   ├── LARGE LANGUAGE MODELS ARE HUMAN-LEVEL PROMPT ENGINEERS.pdf
│   │   │   ├── AUTOMATIC CHAIN OF THOUGHT PROMPTING IN LARGE LANGUAGE MODELS.pdf
│   │   │   ├── LARGE LANGUAGE MODELS AS OPTIMIZERS.pdf
│   │   ├── ICLR 2024（更新中）
│   │   │   ├── The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation.pdf
│   │   │   ├── Memory Efficient Optimizers with 4-bit States.pdf
│   │   │   ├── Language Is Not All You Need：Aligning Perception with Language Models.pdf
│   │   │   ├── Is Your Code Generated by ChatGPT Really Correct Rigorous Evaluation of Large Language Models for Code Generation.pdf
│   │   │   ├── Fine-Tuning Language Models with Just Forward Passes.pdf
│   │   │   ├── Hierarchical Integration Diffusion Model for Realistic Image Deblurring.pdf
│   │   │   ├── Textually Pretrained Speech Language Models.pdf
│   │   │   ├── VisionLLM：Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks.pdf
│   │   │   ├── Cappy：Outperforming and Boosting Large Multi-Task LMs with a Small Scorer.pdf
│   │   │   ├── One-2-3-45：Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization.pdf
│   │   │   ├── Direct Preference Optimization：Your Language Model is Secretly a Reward Model.pdf
│   │   │   ├── SimMTM：A Simple Pre-Training Framework for Masked Time-Series Modeling.pdf
│   │   │   ├── ProPILE：Probing Privacy Leakage in Large Language Models.pdf
│   │   │   ├── SnapFusion：Text-to-Image Diffusion Model on Mobile Devices within Two Seconds.pdf
│   │   │   ├── Efficient Diffusion Policies for Offline Reinforcement Learning.pdf
│   │   │   ├── Focused Transformer：Contrastive Training for Context Scaling.pdf
│   │   │   ├── LayoutPrompter：Awaken the Design Ability of Large Language Models.pdf
│   │   │   ├── Segment Everything Everywhere All at Once.pdf
│   │   │   ├── RAPHAEL：Text-to-Image Generation via Large Mixture of Diffusion Paths.pdf
│   │   │   ├── Towards Revealing the Mystery behind Chain of Thought：a Theoretical Perspective.pdf
│   │   │   ├── Elastic Decision Transformer.pdf
│   │   │   ├── Training Transformers with 4-bit Integers.pdf
│   │   │   ├── In-Context Impersonation Reveals Large Language Models' Strengths and Biases.pdf
│   │   │   ├── DaTaSeg：Taming a Universal Multi-Dataset Multi-Task Segmentation Model.pdf
│   │   │   ├── How to Turn Your Knowledge Graph Embeddings into Generative Models.pdf
│   │   │   ├── EvoPrompting：Language Models for Code-Level Neural Architecture Search.pdf
│   │   │   ├── Learning to Tokenize for Generative Retrieval.pdf
│   │   │   ├── VanillaNet：the Power of Minimalism in Deep Learning.pdf
│   │   │   ├── Unlimiformer：Long-Range Transformers with Unlimited Length Input.pdf
│   │   │   ├── RRHF：Rank Responses to Align Language Models with Human Feedback without tears.pdf
│   │   │   ├── Language Models Meet World Models：Embodied Experiences Enhance Language Models.pdf
│   │   │   ├── Does Graph Distillation See Like Vision Dataset Counterpart.pdf
│   │   │   ├── Stable and low-precision training for large-scale vision-language models.pdf
│   │   │   ├── Towards Label Position Bias in Graph Neural Networks.pdf
│   │   │   ├── Guiding Large Language Models via Directional Stimulus Prompting.pdf
│   │   │   ├── Bridging Discrete and Backpropagation：Straight-Through and Beyond.pdf
│   │   │   ├── Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer Quantization.pdf
│   │   │   ├── Foundation Model is Efficient Multimodal Multitask Model Selector.pdf
│   │   │   ├── Scaling Data-Constrained Language Models.pdf
│   │   │   ├── Differentiable Blocks World：Qualitative 3D Decomposition by Rendering Primitives.pdf
│   │   │   ├── MVDiffusion：Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion.pdf
│   │   │   ├── Chameleon：Plug-and-Play Compositional Reasoning with Large Language Models.pdf
│   │   │   ├── Vision-Flan：Scaling Human-Labeled Tasks in Visual Instruction Tuning.pdf
│   │   │   ├── MarioGPT：Open-Ended Text2Level Generation through Large Language Models.pdf
│   │   │   ├── Recommender Systems with Generative Retrieval.pdf
│   │   │   ├── AlpacaFarm：A Simulation Framework for Methods that Learn from Human Feedback.pdf
│   │   │   ├── Grammar Prompting for Domain-Specific Language Generation with Large Language Models.pdf
│   │   │   ├── QLoRA：Efficient Finetuning of Quantized LLMs.pdf
│   │   │   ├── Can Language Models Solve Graph Problems in Natural Language.pdf
│   │   │   ├── DPM-Solver-v3：Improved Diffusion ODE Solver with Empirical Model Statistics.pdf
│   │   │   ├── 3D-LLM：Injecting the 3D World into Large Language Models.pdf
│   │   │   ├── ToolkenGPT：Augmenting Frozen Language Models with Massive Tools via Tool Embeddings.pdf
│   │   │   ├── HuggingGPT：Solving AI Tasks with ChatGPT and its Friends in HuggingFace.pdf
│   │   │   ├── Sample-efficient Multi-objective Molecular Optimization with GFlowNets.pdf
│   │   │   ├── Tailoring Self-Attention for Graph via Rooted Subtrees.pdf
│   │   │   ├── SheetCopilot：Bringing Software Productivity to the Next Level through Large Language Models.pdf
│   │   │   ├── MotionGPT：Human Motion as a Foreign Language.pdf
│   │   │   ├── Fine-Grained Human Feedback Gives Better Rewards for Language Model Training.pdf
│   │   │   ├── Learning Large Graph Property Prediction via Graph Segment Training.pdf
│   │   │   ├── White-Box Transformers via Sparse Rate Reduction.pdf
│   │   │   ├── Meta In-Context Learning：Harnessing Large Language Models for Electrical Data Classification.pdf
│   │   │   ├── Deductive Verification of Chain-of-Thought Reasoning.pdf
│   │   │   ├── Fairness-guided Few-shot Prompting for Large Language Models.pdf
│   │   │   ├── No Train No Gain：Revisiting Efficient Training Algorithms For Transformer-based Language Models.pdf
│   │   │   ├── ImageReward：Learning and Evaluating Human Preferences for Text-to-Image Generation.pdf
│   │   │   ├── Are aligned neural networks adversarially aligned.pdf
│   │   │   ├── Convolutions Die Hard：Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP.pdf
│   │   │   ├── Large Language Models of Code Fail at Completing Code with Potential Bugs.pdf
│   │   │   ├── A Decomposable Causal View of Compositional Zero-Shot Learning.pdf
│   │   │   ├── HyenaDNA：Long-Range Genomic Sequence Modeling at Single Nucleotide Resolution.pdf
│   │   │   ├── Tree of Thoughts：Deliberate Problem Solving with Large Language Models.pdf
│   │   │   ├── LIMA：Less Is More for Alignment.pdf
│   │   │   ├── Improving CLIP Training with Language Rewrites.pdf
│   │   │   ├── Language models are weak learners.pdf
│   │   │   ├── Reverse Engineering Self-Supervised Learning.pdf
│   │   │   ├── ProlificDreamer：High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation.pdf
│   │   │   ├── Large Language Models as Commonsense Knowledge for Large-Scale Task Planning.pdf
│   │   │   ├── AR-Diffusion：Auto-Regressive Diffusion Model for Text Generation.pdf
│   │   │   ├── Reflexion：language agents with verbal reinforcement learning.pdf
│   │   │   ├── Symbolic Discovery of Optimization Algorithms.pdf
│   │   │   ├── Language Models Don't Always Say What They Think：Unfaithful Explanations in Chain-of-Thought Prompting.pdf
│   │   │   ├── InstructBLIP：Towards General-purpose Vision-Language Models with Instruction Tuning.pdf
│   │   │   ├── Cheap and Quick：Efficient Vision-Language Instruction Tuning for Large Language Models.pdf
│   │   │   ├── Inference-Time Intervention：Eliciting Truthful Answers from a Language Model.pdf
│   │   │   ├── DoReMi：Optimizing Data Mixtures Speeds Up Language Model Pretraining.pdf
│   │   │   ├── Toolformer：Language Models Can Teach Themselves to Use Tools.pdf
│   │   │   ├── Transformers learn through gradual rank increase.pdf
│   │   │   ├── Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning.pdf
│   │   │   ├── GPT4Tools：Teaching Large Language Model to Use Tools via Self-instruction.pdf
│   │   │   ├── STEVE-1：A Generative Model for Text-to-Behavior in Minecraft.pdf
│   │   │   ├── Self-Refine：Iterative Refinement with Self-Feedback.pdf
│   │   │   ├── Are Emergent Abilities of Large Language Models a Mirage.pdf
│   │   │   ├── Augmenting Language Models with Long-Term Memory.pdf
│   │   │   ├── UniControl：A Unified Diffusion Model for Controllable Visual Generation In the Wild.pdf
│   │   │   ├── DiffComplete：Diffusion-based Generative 3D Shape Completion.pdf
│   │   │   ├── Any-to-Any Generation via Composable Diffusion.pdf
│   │   │   ├── SANeRF-HQ：Segment Anything for NeRF in High Quality.pdf
│   │   │   ├── Voicebox：Text-Guided Multilingual Universal Speech Generation at Scale.pdf
│   │   │   ├── MEGABYTE：Predicting Million-byte Sequences with Multiscale Transformers.pdf
│   │   │   ├── VisorGPT：Learning Visual Prior via Generative Pre-Training.pdf
│   │   │   ├── Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition.pdf
│   │   │   ├── Simple and Controllable Music Generation.pdf
│   │   │   ├── Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models.pdf
│   │   │   ├── Flocks of Stochastic Parrots：Differentially Private Prompt Learning for Large Language Models.pdf
│   │   │   ├── SwiftSage：A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks.pdf
│   │   │   ├── EmbodiedGPT：Vision-Language Pre-Training via Embodied Chain of Thought.pdf
│   │   ├── 20篇llm必读
│   │   │   ├── AWQ Activation-aware Weight Quantization.pdf
│   │   │   ├── The Internal State of an LLM Knows When It’s Lying.pdf
│   │   │   ├── OpenAGI When LLM Meets Domain Experts.pdf
│   │   │   ├── X-LLM.pdf
│   │   │   ├── Wider and Deeper LLM Networks.pdf
│   │   │   ├── Judging LLM-as-a-Judge.pdf
│   │   │   ├── Jailbroken How Does LLM Safety Training Fail.pdf
│   │   │   ├── Can LLM Already Serve as A Database Interface.pdf
│   │   │   ├── LLM-grounded Diffusion Enhancing Prompt Understanding of.pdf
│   │   │   ├── Why Johnny Can’t Prompt.pdf
│   │   │   ├── NExT-GPT Any-to-Any Multimodal LLM.pdf
│   │   │   ├── Large Language Models are Few-shot Testers.pdf
│   │   │   ├── AutoGen Enabling Next-Gen LLM.pdf
│   │   │   ├── Song_LLM-Planner_Few-Shot_Grounded_Planning_for_Embodied_Agents_with_Large_Language_ICCV_2023_paper.pdf
│   │   │   ├── CHATEVAL TOWARDS BETTER LLM-BASED EVALUATORS THROUGH MULTI-AGENT DEBATE.pdf
│   │   │   ├── Large language models (LLM) and ChatGPT what will the impact.pdf
│   │   │   ├── LLM-Pruner On the Structural Pruning.pdf
│   │   │   ├── The RefinedWeb Dataset for Falcon LLM.pdf
│   │   │   ├── LLM-BL E N D E R Ensembling Large Language Models.pdf
│   │   │   ├── LLM-Adapters An Adapter Family for Parameter-Efficient Fine-Tuning of.pdf
│   │   ├── ICLR 2024
│   │   │   ├── 【时间检验奖】Auto-Encoding Variational Bayes.pdf
│   │   ├── AAAI 2024 111篇
│   │   │   ├── Parallel Ranking of Ads and Creative Services for Real-time.pdf
│   │   │   ├── AT4CTR Auxiliary Match Tasks for Enhancing Click-Through Rate Prediction.pdf
│   │   │   ├── Upper Bounding Barlow Twins：A Novel Filter for Multi-relational.pdf
│   │   │   ├── Non-Excludable Bilateral Trade Between Groups.pdf
│   │   │   ├── Identification of Causal Structure in the Presence of Missing Data with Additive.pdf
│   │   │   ├── Few-shot Part Segmentation Reveals Compositional Logic for Industrial.pdf
│   │   │   ├── Learning Human-like Representations to Enable Learning Human Values.pdf
│   │   │   ├── OVD-Explorer：Optimism Should Not Be the Sole Pursuit of Exploration.pdf
│   │   │   ├── Federated Learning with Extremely Noisy Clients via Negative Distillation.pdf
│   │   │   ├── EarthVQA：Towards Queryable Earth via Relational Reasoning-Based Remote.pdf
│   │   │   ├── MDGNN：Multi-Relational Dynamic Graph Neural Network for Comprehensive and Dynamic Stock Investment Prediction.pdf
│   │   │   ├── Towards Fairness in Online Service with k Servers and its Application.pdf
│   │   │   ├── Unified framework for diffusion generative models in SO(3).pdf
│   │   │   ├── Text2Analysis：A Benchmark of Table Question Answering with Advanced.pdf
│   │   │   ├── Spectral-based Graph Neutral Networks for Complementary Item.pdf
│   │   │   ├── ECHO-GL Earnings Calls-Driven Heterogeneous Graph Learning for Stock.pdf
│   │   │   ├── Point Cloud Part Editing：Segmentation, Generation, Assembly, and.pdf
│   │   │   ├── IS-DARTS：Stabilizing DARTS through Precise Measurement.pdf
│   │   │   ├── Robust Active Measuring under Model Uncertainty.pdf
│   │   │   ├── MASTER：Market-Guided Stock Transformer for Stock Price Forecasting.pdf
│   │   │   ├── Provably Convergent Federated Trilevel Learning.pdf
│   │   │   ├── Exploring Gradient Explosion in Generative Adversarial Imitation.pdf
│   │   │   ├── AE-NeRF：Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis.pdf
│   │   │   ├── Learning Fair Policies for Multi-stage Problem Solving from.pdf
│   │   │   ├── AI-Based Energy Transportation Safety：Pipeline Radial Threat.pdf
│   │   │   ├── EFFECT SIZE ESTIMATION FOR DURATION RECOMMENDATION.pdf
│   │   │   ├── When Model Meets New Normals：Test-Time Adaptation for Unsupervised.pdf
│   │   │   ├── Fluctuation-based Adaptive Structured Pruning for Large Language.pdf
│   │   │   ├── ContraNovo：A Contrastive Learning Approach to Enhance De Novo Peptide.pdf
│   │   │   ├── CR-SAM： Curvature Regularized Sharpness-aware Minimization.pdf
│   │   │   ├── HuTuMotion：Human-Tuned Motion of Latent Motion Diffusions with.pdf
│   │   │   ├── Enhancing Job Recommendation through.pdf
│   │   │   ├── H-ensemble： An Information Theoretic Approach to Reliable Few-Shot.pdf
│   │   │   ├── Temporally and Distributionally Robust Optimization for Cold-start.pdf
│   │   │   ├── Structure-CLIP：Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations.pdf
│   │   │   ├── Probabilistic Offline Policy Ranking with Approximate Bayesian.pdf
│   │   │   ├── Foreseeing Reconstruction Quality of Gradient Inversion.pdf
│   │   │   ├── Successive POI Recommendation via Brain-inspired Spatiotemporal Aware Representation.pdf
│   │   │   ├── No More Shortcuts：Realizing the Potential of Temporal Self-Supervision.pdf
│   │   │   ├── PPEA-Depth：Progressive Parameter-efficient Adaptation for.pdf
│   │   │   ├── FedDiv：Collaborative Noise Filtering for Federated Learning with Noisy Labels.pdf
│   │   │   ├── Cached Transformers：Improving Transformers with Differentiable Memory.pdf
│   │   │   ├── Market-GAN Adding Control to Financial Market Data Generation with.pdf
│   │   │   ├── CORECODE： A Common Sense Annotated Dialogue Dataset with Benchmark.pdf
│   │   │   ├── Uncertainty Quantification for Data-Driven Change-Point Learning via.pdf
│   │   │   ├── Regulating Intermediate 3D Features for Vision-Centric Autonomous.pdf
│   │   │   ├── Imitation of Life：A Search Engine for Biologically Inspired Design.pdf
│   │   │   ├── Blind-Touch：Homomorphic Encryption-Based Distributed Neural Network.pdf
│   │   │   ├── Domain Invariant Learning for Gaussian Processes and Bayesian.pdf
│   │   │   ├── Effectiveness of Constant Stepsize in Markovian LSA and Statistical.pdf
│   │   │   ├── On Partial Optimal Transport：Revising the Infeasibility of Sinkhorn.pdf
│   │   │   ├── Peer Learning Learning Complex Policies in Groups from Scratch via Action.pdf
│   │   │   ├── Identification of Causal Structure with Latent Variables Based on Higher Order Cumulants.pdf
│   │   │   ├── MmAP：Multi-modal Alignment Prompt for Cross-domain Multi-task Learning.pdf
│   │   │   ├── DataElixir：Purifying Poisoned Dataset to Mitigate Backdoor Attacks.pdf
│   │   │   ├── Estimation of individual causal effects in network setup for multiple.pdf
│   │   │   ├── VITA：Carefully Chosen and Weighted Less Is Better in Medication.pdf
│   │   │   ├── SeGA：Preference-Aware Self-Contrasting Learning with Prompts for.pdf
│   │   │   ├── Scalable Geometric Fracture Assembly via Co-creation Space among Assemblers.pdf
│   │   │   ├── Fine-Grained Knowledge Selection and Restoration for Non-exemplar.pdf
│   │   │   ├── Augmented Negative Sampling for Collaborative Filtering.pdf
│   │   │   ├── Chasing Fairness in Graphs： A GNN Architecture Perspective.pdf
│   │   │   ├── LGMRec Local and Global Graph Learning for Multimodal Recommendation.pdf
│   │   │   ├── Fine-tuning Graph Neural Networks by Preserving Graph Generative.pdf
│   │   │   ├── Hierarchical and Incremental Structural Entropy Minimization for Unsupervised Social Event Detection.pdf
│   │   │   ├── Coreference Graph Guidance for Mind-Map Generation.pdf
│   │   │   ├── Doubly Perturbed Task Free Continual Learning.pdf
│   │   │   ├── Explaining Reinforcement Learning Agents Through Counterfactual Action Outcomes.pdf
│   │   │   ├── Progressive Poisoned Data Isolation for Training-time Backdoor Attack.pdf
│   │   │   ├── COOPER： Coordinating Specialized Agents towards a Complex Dialogue Goal.pdf
│   │   │   ├── BadRL：Sparse Targeted Backdoor Attack Against Reinforcement Learning.pdf
│   │   │   ├── Learning Multimodal Volumetric Features for Large-Scale Neuron Tracing.pdf
│   │   │   ├── Curriculum Learning and Imitation Learning for Model-free Control on Financial Time-series.pdf
│   │   │   ├── An Attentive Inductive Bias for Sequential Recommendation.pdf
│   │   │   ├── Entropic Open-set Active Learning.pdf
│   │   │   ├── EarnHFT Efficient Hierarchical Reinforcement Learning for High Frequency Trading.pdf
│   │   │   ├── Distributional Off-Policy Evaluation for Slate Recommendations.pdf
│   │   │   ├── Robust Loss Functions for Training Decision Trees with Noisy Labels.pdf
│   │   │   ├── VITA ‘Carefully Chosen and Weighted Less’ Is Better.pdf
│   │   │   ├── Big Learning Expectation Maximization.pdf
│   │   │   ├── Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling.pdf
│   │   │   ├── Competition among Pairwise Lottery Contests.pdf
│   │   │   ├── Envy-free House Allocation under Uncertainty Preferences.pdf
│   │   │   ├── Learning Domain-Independent Heuristics for Grounded and Lifted Planning.pdf
│   │   │   ├── RadOcc：Learning Cross-Modality Occupancy Knowledge through Rendering.pdf
│   │   │   ├── Root Cause Explanation of Outliers under Noisy Mechanisms.pdf
│   │   │   ├── Exploring Large Language Model for Graph Data Understanding.pdf
│   │   │   ├── Q-SENN： Quantized Self-explaining Neural Networks.pdf
│   │   │   ├── Knowledge Graph Error Detection with Contrastive Confidence Adaption.pdf
│   │   │   ├── Token-Level Contrastive Learning with Modality-Aware Prompting for Multimodal Intent Recognition.pdf
│   │   │   ├── STEM Unleashing the Power of Embeddings for Multi-task Recommendation.pdf
│   │   │   ├── Protect Your Score： Contact Tracing with Differential Privacy.pdf
│   │   │   ├── Inducing Point Operator Transformer：A Flexible and Scalable Architecture for Solving PDEs.pdf
│   │   │   ├── Weakly Supervised Open-Vocabulary Object Detection.pdf
│   │   │   ├── Cautiously-Optimistic Knowledge Sharing for Cooperative Multi-Agent Reinforcement Learning.pdf
│   │   │   ├── Ada-Ranker A Data Distribution Adaptive Ranking Paradigm.pdf
│   │   │   ├── Topic Shifts as a Proxy for Assessing Politicization in Social Media.pdf
│   │   │   ├── No prejudice! Fair Federated Graph Neural Networks for Personalized.pdf
│   │   │   ├── Fortify Your Defenses：Strategic Allocation to Enhance Defense Grid.pdf
│   │   │   ├── MESED： A Multi-modal Entity Set Expansion Dataset with Fine-grained Semantic Classes and Hard Negative Entities.pdf
│   │   │   ├── CI-STHPAN Pre-trained Attention Network for Stock Selection with Channel-Independent Spatio-Temporal Hypergraph.pdf
│   │   │   ├── Towards Efficient Verification of Quantized Neural Networks.pdf
│   │   │   ├── On the Role of Server Momentum in Federated Learning.pdf
│   │   │   ├── Roll With the Punches：Expansion and Shrinkage of Soft Label Selection.pdf
│   │   │   ├── Bi-directional Adapter for Multi-modal Tracking.pdf
│   │   │   ├── FontDiffuser： One-Shot Font Generation via Denoising Diffusion with.pdf
│   │   │   ├── Signed Graph Neural Ordinary Differential Equation for Modeling.pdf
│   │   │   ├── Continuous Time Graph Representation with Sequential Survival Process.pdf
│   │   │   ├── FontDiffuser：One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning.pdf
│   │   │   ├── Brush Your Text：Synthesize Any Scene Text on Images via Diffusion Model.pdf
│   │   │   ├── LAMM：Label Alignment for Multi-Modal Prompt Learning.pdf
│   │   ├── 大模型MoE必读论文
│   │   │   ├── 【直播课原文】Pushing Mixture of Experts to the Limit Extremely Parameter Efficient MoE for Instruction Tuning.pdf
│   │   ├── LISA：大模型微调40篇
│   │   │   ├── 在大型视觉语言模型中评估物体幻觉.pdf
│   │   │   ├── MiniGPT-v2：大型语言模型作为视觉语言多任务学习的统一接口.pdf
│   │   │   ├── SPHINX：多模态大型语言模型的权重、任务和视觉嵌入的联合混合.pdf
│   │   │   ├── 利用显式推理链和可视化问题生成推进大型多模态模型.pdf
│   │   │   ├── 睁大眼睛？探索多模态LLMs的视觉缺陷.pdf
│   │   │   ├── LLaMA-VID：在大型语言模型中，一个图像值 2 个令牌.pdf
│   │   │   ├── LST：用于参数和内存高效迁移学习的梯形图侧调.pdf
│   │   │   ├── VL-PET：通过粒度控制进行视觉和语言参数高效调整.pdf
│   │   │   ├── mPLUG-Owl2：通过模态协作彻底改变多模态大型语言模型.pdf
│   │   │   ├── CaMML：适用于大型模型的情境感知多模态学习器.pdf
│   │   │   ├── Ziya-Visual：通过多任务指令调优的双语大型视觉语言模型.pdf
│   │   │   ├── Qwen-VL：用于理解、定位、文本阅读等的多功能视觉语言模型.pdf
│   │   │   ├── Lyrics-通过语义感知视觉对象促进细粒度语言-视觉对齐和理解.pdf
│   │   │   ├── MMBench：你的多模态模型是一个全能的玩家吗？.pdf
│   │   │   ├── OtterHD：高分辨率多模态模型.pdf
│   │   │   ├── 通过视觉指令调整改进基线.pdf
│   │   │   ├── 可视化指令调优.pdf
│   │   │   ├── 对比视觉-语言对齐使教学成为学习者的高效.pdf
│   │   │   ├── MiniGPT-4：使用高级大型语言模型增强视觉语言理解.pdf
│   │   │   ├── SVIT：扩展可视化指令调优.pdf
│   │   │   ├── InfMLLM：可视化语言任务的统一框架.pdf
│   │   │   ├── ReForm-Eval：通过统一重新制定面向任务的基准来评估大型视觉语言模型.pdf
│   │   │   ├── InstructBLIP：通过指令调整实现通用视觉语言模型.pdf
│   │   │   ├── Compacter：高效的低秩超复杂适配器层.pdf
│   │   │   ├── Shikra：释放多模态LLM的参照对话魔力.pdf
│   │   │   ├── Genixer：将多模态大型语言模型赋能为强大的数据生成器提供支持.pdf
│   │   │   ├── 眼见为实：提示 GPT-4V 进行更好的视觉指令调整.pdf
│   │   │   ├── SEED-Bench：对多模态LLMs进行生成式理解的基准测试.pdf
│   │   │   ├── UniPT：具有高效参数和存储器的迁移学习通用并行调优.pdf
│   │   │   ├── LISA： Layerwise Importance Sampling for Memory-efficient Large Language Model Fine-Tuning.pdf
│   │   │   ├── GlitchBench：大型多模态模型可以检测视频游戏故障吗？.pdf
│   │   │   ├── Video-LLaVA：通过投影前的对齐来学习统一的视觉表示.pdf
│   │   │   ├── 视觉语言预训练模型的近似提示调整.pdf
│   │   │   ├── VL-ADAPTER：用于视觉和语言任务的参数高效迁移学习.pdf
│   │   │   ├── ShareGPT4V：使用更好的字幕改进大型多模态模型.pdf
│   │   │   ├── 关于多模态语言模型的性能.pdf
│   │   │   ├── Visual Instruction Tuning with Polite Flamingo.pdf
│   │   │   ├── MM-Vet：评估大型多模态模型的集成能力.pdf
│   │   │   ├── HyperPELT：针对语言和视觉与语言任务的统一参数高效语言模型调优.pdf
│   │   │   ├── DoRA- Weight-Decomposed Low-Rank Adaptation.pdf
│   │   ├── ECCV24 收录论文83篇（更新中）
│   │   │   ├── 推荐工作
│   │   │   │   ├── FontStudio  Shape-Adaptive Diffusion Model for Coherent and Consistent Font Effect Generation.pdf
│   │   │   │   ├── LEGO Learning EGOcentric Action FrameGeneration via Visual Instruction Tuning.pdf
│   │   │   │   ├── FSGS Real Time Few shot View Synthesis using Gaussian Splatting.pdf
│   │   │   │   ├── Glyph-ByT5  A Customized Text Encoder for Accurate Visual Text Rendering.pdf
│   │   │   │   ├── ZipLoRA  Any Subject in Any Style by Effectively Merging LoRAs..pdf
│   │   │   │   ├── DreamScene360 Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting.pdf
│   │   │   │   ├── SwapAnything  Enabling Arbitrary Object Swapping in Personalized Visual Editing.pdf
│   │   │   │   ├── DiffiT  Diffusion Vision Transformers for Image Generation.pdf
│   │   │   ├── Contrastive Region Guidance：Improving Grounding in Vision-Language Models without Training.pdf
│   │   │   ├── MIPI 2024 Challenge on Demosaic for Hybridevs Camera： Methods and Results.pdf
│   │   │   ├── BLINK：Multimodal Large Language Models Can See but Not Perceive.pdf
│   │   │   ├── CityGaussian：Real-time High-quality Large-Scale Scene Rendering with Gaussians.pdf
│   │   │   ├── Align, Minimize and Diversify  A Source-Free Unsupervised Domain Adaptation Method for Handwritten Text Recognition.pdf
│   │   │   ├── DATENeRF：Depth-Aware Text-based Editing of NeRFs.pdf
│   │   │   ├── Dyadic Interaction Modeling for Social Behavior Generation.pdf
│   │   │   ├── DragAnything：Motion Control for Anything.pdf
│   │   │   ├── GiT：Towards Generalist Vision Transformer through Universal Language Interface.pdf
│   │   │   ├── SuperGaussian：Repurposing Video Models for 3D Super Resolution.pdf
│   │   │   ├── EvAC3D  From Event-based Apparent Contours to 3D Models via Continuous Visual Hulls.pdf
│   │   │   ├── GScream：Learning 3D Geometry and Feature Consistent Gaussian Splatting for Object Removal.pdf
│   │   │   ├── N2F2：Hierarchical Scene Understanding with Nested Neural Feature Fields.pdf
│   │   │   ├── Object-Centric Diffusion for Efficient Video Editing.pdf
│   │   │   ├── SALVe： Semantic Alignment Verification for Floorplan Reconstruction from Sparse Panoramas.pdf
│   │   │   ├── Listen to Look into the Future：Audio-Visual Egocentric Gaze Anticipation.pdf
│   │   │   ├── MixDQ：Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization.pdf
│   │   │   ├── DreamMotion：Space-Time Self-Similarity Score Distillation for Zero-Shot Video Editing.pdf
│   │   │   ├── PEAVS：Perceptual Evaluation of Audio-Visual Synchrony Grounded in Viewers' Opinion Scores.pdf
│   │   │   ├── FreeInit：Bridging Initialization Gap in Video Diffusion Models.pdf
│   │   │   ├── SpecFormer Guarding Vision Transformer Robustness via Maximum Singular Value Penalization.pdf
│   │   │   ├── Empowering 3D Visual Grounding with Reasoning Capabilities.pdf
│   │   │   ├── Introducing HOT3D：An Egocentric Dataset for 3D Hand and Object Tracking.pdf
│   │   │   ├── Rasterized Edge Gradients：Handling Discontinuities Differentiably.pdf
│   │   │   ├── A Task is Worth One Word：Learning with Task Prompts for High-Quality Versatile Image Inpainting.pdf
│   │   │   ├── An Image is Worth 1`2 Tokens After Layer 2：Plug and Play Inference Acceleration for Large Vision Language Models.pdf
│   │   │   ├── Neural Graphics Texture Compression Supporting Random Access.pdf
│   │   │   ├── LA3  Efficient Label-Aware AutoAugment.pdf
│   │   │   ├── Towards Multimodal Open-Set Domain Generalization and Adaptation through Self-supervision.pdf
│   │   │   ├── Learning Neural Volumetric Pose Features for Camera Localization.pdf
│   │   │   ├── UniDream：UnifyingDiffusionPriorsforRelightableText-to-3DGeneration.pdf
│   │   │   ├── Prompt Federated Learning for Weather Forecasting：Toward Foundation Models on Meteorological Data.pdf
│   │   │   ├── Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance.pdf
│   │   │   ├── DGInStyle：Domain Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control.pdf
│   │   │   ├── Robo-ABC：Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipulation.pdf
│   │   │   ├── Agent3D-Zero： An automatic agent leverages VLM for zero-shot 3D understanding.pdf
│   │   │   ├── Compact3D：Smaller and Faster Gaussian Splatting with Vector Quantization.pdf
│   │   │   ├── Pix2Gif：Motion-Guided Diffusion for GIF Generation.pdf
│   │   │   ├── TriNeRFLet：A Wavelet Based Multiscale Triplane NeRF Representation.pdf
│   │   │   ├── ClusteringSDF：Self-Organized Neural Implicit Surfaces for 3D Decomposition.pdf
│   │   │   ├── Map-free Visual Relocalization：Metric Pose Relative to a Single Image.pdf
│   │   │   ├── T-Rex2： Towards Generic Object Detection via Text-Visual Prompt Synergy.pdf
│   │   │   ├── MVSplat：Efficient 3D Gaussian Splatting from Sparse Multi-View Images.pdf
│   │   │   ├── Training Full Spike Neural Networks via Auxiliary Accumulation Pathway.pdf
│   │   │   ├── ScanTalk：3D Talking Heads from Unregistered Scans.pdf
│   │   │   ├── VITATECS：A Diagnostic Dataset for Temporal Concept Understanding of Video-Language Models.pdf
│   │   │   ├── DenseNets Reloaded：Paradigm Shift Beyond ResNets and ViTs.pdf
│   │   │   ├── HYPE：Hyperbolic Entailment Filtering for Underspecified Images and Texts.pdf
│   │   │   ├── Open-Vocabulary SAM：Segment and Recognize Twenty-thousand Classes Interactively.pdf
│   │   │   ├── Controllable Human-Object Interaction Synthesis.pdf
│   │   │   ├── DragAPart：Learning a Part-Level Motion Prior for Articulated Objects.pdf
│   │   │   ├── DragVideo：Interactive Drag-style Video Editing.pdf
│   │   │   ├── GalLoP：Learning Global and Local Prompts.pdf
│   │   │   ├── GLAD： Towards Better Reconstruction with Global and Local Adaptive Diffusion Models for Unsupervised Anomaly Detection.pdf
│   │   │   ├── CoLLaVO：Crayon Large Language and Vision mOdel.pdf
│   │   │   ├── WordRobe：Text-Guided Generation of Textured 3D Garments.pdf
│   │   │   ├── AdaDistill：Adaptive Knowledge Distillation for Deep Face Recognition.pdf
│   │   │   ├── AnyLens：A Generative Diffusion Model with Any Rendering Lens.pdf
│   │   │   ├── PointLLM：Empowering Large Language Models to Understand Point Clouds.pdf
│   │   │   ├── E.T. the Exceptional Trajectories：Text-to-camera-trajectory generation with character awareness.pdf
│   │   │   ├── DreamReward：Text-to-3D Generation with Human Preference.pdf
│   │   │   ├── Discovering Novel Actions from Open World Egocentric Videos with Object-Grounded Visual Commonsense Reasoning.pdf
│   │   │   ├── Mismatch Quest：Visual and Textual Feedback for Image-Text Misalignment.pdf
│   │   │   ├── Pyramid Diffusion for Fine 3D Large Scene Generation.pdf
│   │   │   ├── MoAI：Mixture of All Intelligence for Large Language and Vision Models.pdf
│   │   │   ├── NIGHT  - Non-Line-of-Sight Imaging from Indirect Time of Flight Data.pdf
│   │   │   ├── Modality Translation for Object Detection Adaptation Without Forgetting Prior Knowledge.pdf
│   │   │   ├── Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation.pdf
│   │   │   ├── PaPr  Training-Free One-Step Patch Pruning with Lightweight ConvNets for Faster Inference.pdf
│   │   │   ├── ZeST：Zero-Shot Material Transfer from a Single Image.pdf
│   │   │   ├── GVGEN：A text-to-GS generation framework with volumetric representation.pdf
│   │   │   ├── MotionLCM：Real-time Controllable Motion Generation via Latent Consistency Model.pdf
│   │   │   ├── ManiGaussian：Dynamic Gaussian Splatting for Multi-task Robotic Manipulation.pdf
│   │   │   ├── MOTIONDIRECTOR：MOTION CUSTOMIZATION OF TEXT-TO-VIDEO DIFFUSION MODELS.pdf
│   │   ├── Code Llama论文（5月最新+内含24篇）
│   │   │   ├── 2 LLaMA 1、2 论文&源码
│   │   │   │   ├── 源码：llama-main.zip
│   │   │   │   ├── LLaMA： Open and Efficient Foundation Language Models.pdf
│   │   │   │   ├── Llama 2：Open Foundation and Fine-Tuned Chat Models.pdf
│   │   │   ├── 3 Code Llama 其他相关论文
│   │   │   │   ├── TinyLlama：An Open-Source Small Language Model.pdf
│   │   │   │   ├── S3LLM： Large-Scale Scientific Software Understanding.pdf
│   │   │   │   ├── IS SELF-REPAIR A SILVER BULLET FOR CODE GENERATION.pdf
│   │   │   │   ├── MFTCODER： BOOSTING CODE LLMS WITH MULTITASK.pdf
│   │   │   │   ├── README++：Benchmarking Multilingual Language Models for Multi-Domain Readability Assessment.pdf
│   │   │   │   ├── Open-TransMind：A New Baseline and Benchmark for 1st Foundation Model.pdf
│   │   │   │   ├── Binary Code Summarization：Benchmarking ChatGPT、GPT-4 and Other Large Language Models.pdf
│   │   │   │   ├── LLAMA PRO：Progressive LLaMA with Block Expansion.pdf
│   │   │   │   ├── LLaMA-LoRA Neural Prompt Engineering.pdf
│   │   │   │   ├── Open-SQL Framework： Enhancing Text-to-SQL on Open-source Large.pdf
│   │   │   │   ├── A Comparative Analysis of Large Language Models for Code.pdf
│   │   │   │   ├── A Controlled Experiment on the Energy Efficiency of the Source Code Generated by Code Llama.pdf
│   │   │   │   ├── CRUXEval：A Benchmark for Code Reasoning.pdf
│   │   │   │   ├── LLaMA-Reviewer：Advancing Code Review Automation with Large Language Models through Parameter-Efficient Fine-Tuning.pdf
│   │   │   │   ├── LLaMA-Adapter：Efficient Fine-tuning of Language.pdf
│   │   │   │   ├── LLaMA-Adapter V2：Parameter-Efficient Visual Instruction Model.pdf
│   │   │   │   ├── Semantic Similarity Loss for Neural Source Code.pdf
│   │   │   │   ├── Granite Code Models：A Family of Open.pdf
│   │   │   │   ├── Evaluating In-Context Learning of Libraries for Code Generation.pdf
│   │   │   │   ├── Making Large Language Models A Better Foundation For Dense Retrieval.pdf
│   │   │   │   ├── DebugBench：Evaluating Debugging Capability of Large Language Models.pdf
│   │   │   ├── 1 Code Llama 论文&源码
│   │   │   │   ├── 源码：codellama-main.zip
│   │   │   │   ├── 论文：Code Llama：Open Foundation Models for Code.pdf
│   │   ├── ICML 2024 67篇
│   │   │   ├── ICML'23
│   │   │   │   ├── 看不见的概括，逻辑推理和学位课程.pdf
│   │   │   │   ├── 适应零和不完全信息博弈中的博弈树.pdf
│   │   │   │   ├── 大型语言模型的水印.pdf
│   │   │   │   ├── 像素递归神经网络.pdf
│   │   │   │   ├── 混淆梯度给人一种虚假的安全感：规避对抗性示例的防御.pdf
│   │   │   │   ├── D-Adaptation 的无学习率学习.pdf
│   │   │   │   ├── 异质性治疗效果的因果等渗校准.pdf
│   │   │   │   ├── 通过影响函数理解黑盒预测.pdf
│   │   │   │   ├── Beyond Hawkes：时空点过程的神经多事件预测.pdf
│   │   │   │   ├── 用于统一通用逼近的 Leaky-ReLU 神经网络的最小宽度.pdf
│   │   │   │   ├── 通过噪声到噪声映射从噪声 3D 点云中学习有符号距离函数.pdf
│   │   │   │   ├── 用于子集选择的可解释行列式选择模型.pdf
│   │   │   │   ├── 正交解耦高斯过程的球形诱导特征.pdf
│   │   │   ├── ICML'24 最佳论文+时间检验奖
│   │   │   │   ├── Scaling Rectified Flow Transformers for High-Resolution Image Synthesis.pdf
│   │   │   │   ├── Debating with More Persuasive LLMs Leads to More Truthful Answers.pdf
│   │   │   │   ├── Information Complexity of Stochastic Convex OptimizationP：Applications to Generalization, Memorization, and Tracing.pdf
│   │   │   │   ├── Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo.pdf
│   │   │   │   ├── Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution.pdf
│   │   │   │   ├── DeCAF： A Deep Convolutional Activation Feature for Generic Visual Recognition.pdf
│   │   │   │   ├── VideoPoet：A Large Language Model for Zero-Shot Video Generation.pdf
│   │   │   │   ├── Stealing part of a production language model.pdf
│   │   │   │   ├── Genie：Generative Interactive Environments.pdf
│   │   │   │   ├── Considerations for Differentially Private Learning with Large-Scale Public Pretraining.pdf
│   │   │   │   ├── Position：Measure Dataset Diversity, Don't Just Claim It.pdf
│   │   │   ├── ICML'24 oral（更新中）
│   │   │   │   ├── Transformers Learn Nonlinear Features In Context：Nonconvex Mean-field Dynamics on the Attention Landscape.pdf
│   │   │   │   ├── HowPrivate are DP-SGD Implementations.pdf
│   │   │   │   ├── Monitoring AI-Modified Content at Scale：A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews.pdf
│   │   │   │   ├── Hybrid2 Neural ODE Causal Modeling and an Application to Glycemic Response.pdf
│   │   │   │   ├── GaLore：Memory-Efficient LLM Training by Gradient Low-Rank Projection.pdf
│   │   │   │   ├── PrE-Text：Training Language Models on Private Federated Data in the Age of LLMs.pdf
│   │   │   │   ├── FedMBridge：Bridgeable Multimodal Federated Learning.pdf
│   │   │   │   ├── Position：Open-Endedness is Essential for Artificial Superhuman Intelligence.pdf
│   │   │   │   ├── Less is More：on the Over-Globalizing Problem in Graph Transformers.pdf
│   │   │   │   ├── Evolution of Heuristics：Towards Efficient Automatic Algorithm Design Using Large Language Model.pdf
│   │   │   │   ├── Expressivity and Generalization：Fragment-Biases for Molecular GNNs.pdf
│   │   │   │   ├── Locality-Sensitive Hashing-Based Efficient Point Transformer with Applications in High-Energy Physics.pdf
│   │   │   │   ├── Stop Regressing：Training Value Functions via Classification for Scalable Deep RL.pdf
│   │   │   │   ├── Emergent Equivariance in Deep Ensembles.pdf
│   │   │   │   ├── Improving Transformers with Dynamically Composable Multi-Head Attention.pdf
│   │   │   │   ├── Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling.pdf
│   │   │   │   ├── SAPG：Split and Aggregate Policy Gradients.pdf
│   │   │   │   ├── Position：Automatic Environment Shaping is the Next Frontier in RL.pdf
│   │   │   │   ├── Multiplicative Weights Update, Area Convexity and Random Coordinate Descent for Densest Subgraph Problems.pdf
│   │   │   │   ├── Weak-to-Strong Generalization：Eliciting Strong Capabilities With Weak Supervision.pdf
│   │   │   │   ├── Discovering Environments with XRM.pdf
│   │   │   │   ├── Unified Training of Universal Time Series Forecasting Transformers.pdf
│   │   │   │   ├── A Dynamic Algorithm for Weighted Submodular Cover Problem.pdf
│   │   │   │   ├── Theoretical Analysis of Learned Database Operations under Distribution Shift through Distribution Learnability.pdf
│   │   │   │   ├── SceneCraft：An LLM Agent for Synthesizing 3D Scenes as Blender Code.pdf
│   │   │   │   ├── Doubly Robust Causal Effect Estimation under Networked Interference via Targeted Learning.pdf
│   │   │   │   ├── Robust CLIP：Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models.pdf
│   │   │   │   ├── Evaluation of LLMs on Syntax-Aware Code Fill-in-the-Middle Tasks.pdf
│   │   │   │   ├── Position：Technical Research and Talent is Needed for Effective AI Governance.pdf
│   │   │   │   ├── Position：Opportunities Exist for Machine Learning in Magnetic Fusion Energy.pdf
│   │   │   │   ├── Online Matching with Stochastic Rewards：Provable Better Bound via Adversarial Reinforcement Learning.pdf
│   │   │   │   ├── How do Large Language Models Navigate Conflicts between Honesty and Helpfulness.pdf
│   │   │   │   ├── Is DPO Superior to PPO for LLM Alignment  A Comprehensive Study.pdf
│   │   │   │   ├── Trained Random Forests Completely Reveal your Dataset.pdf
│   │   │   │   ├── Rethinking Data Shapley for Data Selection Tasks：Misleads and Merits.pdf
│   │   │   │   ├── Inferring the Long-Term Causal Effects of Long-Term Treatments from Short-Term Experiments.pdf
│   │   │   │   ├── Fast Co-Training under Weak Dependence via Stream-Based Active Learning.pdf
│   │   │   │   ├── Learning Useful Representations of Recurrent Neural Network Weight Matrices.pdf
│   │   │   │   ├── Bottleneck-Minimal Indexing for Generative Document Retrieval.pdf
│   │   │   │   ├── I.O Complexity of Attention or How Optimal is FlashAttention.pdf
│   │   │   │   ├── ACE：Off-Policy Actor-Critic with Causality-Aware Entropy Regularization.pdf
│   │   │   │   ├── Position：Beyond Personhood：Agency, Accountability, and the Limits of Anthropomorphic Ethical Analysis.pdf
│   │   │   │   ├── LoRA Training in the NTK Regime has No Spurious Local Minima.pdf
│   │   ├── 100篇大模型必读论文
│   │   │   ├── Solving Quantitative Reasoning Problems with Language Models.pdf
│   │   │   ├── A ConvNet for the 2020s..pdf
│   │   │   ├── KERPLE Kernelized Relative Positional Embedding for Length Extrapolation.pdf
│   │   │   ├── Emergent Abilities of Large Language Models.pdf
│   │   │   ├── Red Teaming Language Models with Language Models.pdf
│   │   │   ├── GET3D A Generative Model of High Quality 3D Textured Shapes Learned from Images.pdf
│   │   │   ├── GLM-130B An Open Bilingual Pre-trained Model.pdf
│   │   │   ├── Compositional character models for open vocabulary word representation.pdf
│   │   │   ├── Efficient Estimation of Word Representation in Vector Space.pdf
│   │   │   ├── Beyond the Imitation Game Quantifying and extrapolating the capabilities of language models.pdf
│   │   │   ├── A Survey on Knowledge Graphs Representation, Acquisition, and Applications.pdf
│   │   │   ├── Evaluating Large Language Models Trained on Code.pdf
│   │   │   ├── Multi-Grained Vision Language Pre-Training Aligning Texts with Visual Concepts.pdf
│   │   │   ├── When Vision Transformers Outperform ResNets without Pre-training or Strong Data Augmentations.pdf
│   │   │   ├── OFA Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework..pdf
│   │   │   ├── COLD A Benchmark for Chinese Offensive Language Detection.pdf
│   │   │   ├── Language models generalize beyond natural proteins.pdf
│   │   │   ├── High-Resolution Image Synthesis with Latent Diffusion Models.pdf
│   │   │   ├── Fine-Tuning Language Models from Human Preferences.pdf
│   │   │   ├── Imagen Video High Definition Video Generation with Diffusion Models.pdf
│   │   │   ├── No Language Left Behind Scaling Human-Centered Machine Translation.pdf
│   │   │   ├── Zero-Shot Video Question Answering via Frozen Bidirectional Language Models.pdf
│   │   │   ├── Towards Efficient Post-training Quantization of Pre-trained Language Models.pdf
│   │   │   ├── Retrieval Augmented Generation for.pdf
│   │   │   ├── Reducing Activation Recomputation in Large Transformer Models.pdf
│   │   │   ├── GPT Understands, Too.pdf
│   │   │   ├── Transformer-Xl Attentive Language Models Beyond A Fixed-Length Context.pdf
│   │   │   ├── InstructPix2Pix Learning to Follow Image Editing Instructions.pdf
│   │   │   ├── PPT Pre-trained Prompt Tuning for Few-shot Learning.pdf
│   │   │   ├── Generating Training Data with Language Models Towards Zero-Shot Language Understanding.pdf
│   │   │   ├── SmoothQuant Accurate and Efficient Post-Training Quantization for Large Language Models.pdf
│   │   │   ├── Tensor Programs V Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer.pdf
│   │   │   ├── Hierarchical Text-Conditional Image Generation with CLIP Latents.pdf
│   │   │   ├── Knowledgeable Prompt-tuning Incorporating Knowledge into Prompt Verbalizer for Text Classification.pdf
│   │   │   ├── BLOOM A 176B-Parameter Open-Access Multilingual Language Model.pdf
│   │   │   ├── SGM Sequence Generation Model for Multi-label Classification.pdf
│   │   │   ├── Pre-train, Prompt, and Predict A Systematic Survey of Prompting Methods in Natural Language Processing.pdf
│   │   │   ├── Improving Language Models by Retrieving from Trillions of Tokens.pdf
│   │   │   ├── Learning Transferable Visual Models From Natural Language Supervision.pdf
│   │   │   ├── BaGuaLu targeting brain scale pretrained models with over 37 million cores.pdf
│   │   │   ├── Zero-Shot Text-to-Image Generation.pdf
│   │   │   ├── CogView Mastering Text-to-Image Generation via Transformers.pdf
│   │   │   ├── Training Language Models with Memory Augmentation.pdf
│   │   │   ├── Denoising Diffusion Implicit Models.pdf
│   │   │   ├── WebGPT Browser-assisted question-answering with human feedback.pdf
│   │   │   ├── Fine-mixing Mitigating Backdoors in Fine-tuned Language Models.pdf
│   │   │   ├── GPT-NeoX-20B An Open-Source Autoregressive Language Model.pdf
│   │   │   ├── Character-level Convolutional Networks for Text Classification.pdf
│   │   │   ├── Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners.pdf
│   │   │   ├── FastMoE A Fast Mixture-of-Expert Training System.pdf
│   │   │   ├── Autoformalization with Large Language Models.pdf
│   │   │   ├── Evolutionary-scale prediction of atomic level protein structure with a language model.pdf
│   │   │   ├── Score-Based Generative Modeling through Stochastic Differential Equations.pdf
│   │   │   ├── ERNIE 3.0 Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation.pdf
│   │   │   ├── Versatile Diffusion Text, Images and Variations All in One Diffusion Model.pdf
│   │   │   ├── Discrete mean estimates and the Landau-Siegel zero.pdf
│   │   │   ├── Training Compute-Optimal Large Language Models.pdf
│   │   │   ├── Video PreTraining (VPT) Learning to Act by Watching Unlabeled Online Videos.pdf
│   │   │   ├── UnifiedSKG Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models.pdf
│   │   │   ├── Foundation Transformers.pdf
│   │   │   ├── Chain-of-Thought Prompting Elicits Reasoning in Large Language Models.pdf
│   │   │   ├── PAL Program-aided Language Models.pdf
│   │   │   ├── GLM General Language Model Pretraining with Autoregressive Blank Infilling.pdf
│   │   │   ├── Training language models to follow instructions with human feedback.pdf
│   │   │   ├── Colossal-AI A Unified Deep Learning System For Large-Scale Parallel Training.pdf
│   │   │   ├── Galactica A Large Language Model for Science.pdf
│   │   │   ├── Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval.pdf
│   │   │   ├── PaLM Scaling Language Modeling with Pathways.pdf
│   │   │   ├── OPT Open Pre-trained Transformer Language Models.pdf
│   │   │   ├── Few-shot Learning with Multilingual Language Models.pdf
│   │   │   ├── UL2 Unifying Language Learning Paradigms.pdf
│   │   │   ├── Prompt-and-Rerank A Method for Zero-Shot and Few-Shot Arbitrary Textual Style Transfer with Small Language Models.pdf
│   │   │   ├── InternImage Exploring Large-Scale Vision Foundation Models with Deformable Convolutions.pdf
│   │   │   ├── Sequence to Sequence Learning with Neural Networks.pdf
│   │   │   ├── AltCLIP Altering the Language Encoder in CLIP for Extended Language Capabilities.pdf
│   │   │   ├── Convolutional Neural Network for Sentence Classification.pdf
│   │   │   ├── Character-Aware Neural Language Models.pdf
│   │   │   ├── Holistic Evaluation of Language Models.pdf
│   │   │   ├── CPM A large-scale generative Chinese Pre-trained language model.pdf
│   │   │   ├── Language Models are Few-Shot Learners.pdf
│   │   │   ├── DiffusionDet Diffusion Model for Object Detection.pdf
│   │   │   ├── Improving language understanding by generative pre training.pdf
│   │   │   ├── DeepSpeed Data Efficiency Improving Deep Learning Model Quality and Training Efficiency via Efficient Data Sampling and Routing.pdf
│   │   │   ├── PaLI A Jointly-Scaled Multilingual Language-Image Model.pdf
│   │   │   ├── Language Models are Unsupervised Multitask Learners.pdf
│   │   │   ├── Git Re-Basin Merging Models modulo Permutation Symmetries.pdf
│   │   │   ├── How Much Knowledge Can You Pack Into the Parameters of a Language Model.pdf
│   │   │   ├── BLIP Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation..pdf
│   │   │   ├── Muse Text-To-Image Generation via Masked Generative Transformers.pdf
│   │   │   ├── The Stability-Efficiency Dilemma Investigating Sequence Length Warmup for Training GPT Models.pdf
│   │   │   ├── Masked Autoencoders Are Scalable Vision Learners.pdf
│   │   │   ├── A Survey on In-context Learning.pdf
│   │   │   ├── An Image is Worth 16x16 Words Transformers for Image Recognition at Scale.pdf
│   │   │   ├── Learning to summarize from human feedback.pdf
│   │   │   ├── ERNIE 3.0 Titan Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation.pdf
│   │   │   ├── Language Models as Knowledge Bases.pdf
│   │   │   ├── CodeGen An Open Large Language Model for Code with Multi-Turn Program Synthesis.pdf
│   │   │   ├── LAION-5B An open large-scale dataset for training next generation image-text models.pdf
│   │   │   ├── Generating Sequences With Recurrent Neural Networks.pdf
│   │   │   ├── Language Models as Zero-Shot Planners Extracting Actionable Knowledge for Embodied Agents.pdf
│   │   │   ├── Vision-Language Pre-Training with Triple Contrastive Learning.pdf
│   │   │   ├── 01必读.jpg
│   │   ├── EMNLP 19篇
│   │   │   ├── 自然语言生成的主动学习.pdf
│   │   │   ├── 通过概念化来解释嵌入空间.pdf
│   │   │   ├── IMTLab：用于构建、评估和诊断交互式机器翻译系统的开源平台.pdf
│   │   │   ├── 驾驭灰色地带：不确定性和过度自信的表达如何影响语言模型.pdf
│   │   │   ├── RAPL：一种用于少样本文档级关系提取的关系感知原型学习方法.pdf
│   │   │   ├── 重新审视机器翻译的跨语言分类.pdf
│   │   │   ├── 视觉、机器人技术及其他领域的语言基础.pdf
│   │   │   ├── 通过对NLP领域学术写作的对比分析来解决语言偏见.pdf
│   │   │   ├── 了解模型压缩对大型语言模型中社会偏见的影响.pdf
│   │   │   ├── 凝聚力：生成文本连贯性的增量与整体评估的新基准.pdf
│   │   │   ├── 用语言模型进行推理就是用世界模型进行规划.pdf
│   │   │   ├── 使用大型语言模型进行可解释的心理健康分析.pdf
│   │   │   ├── TopWORDS-Poetry：基于贝叶斯推理的中国古典诗歌同步文本分割和单词发现.pdf
│   │   │   ├── 学习用于多模态失语症类型检测的共同语音手势.pdf
│   │   │   ├── 具有 Wasserstein 独立性的公平文本分类.pdf
│   │   │   ├── ROBBIE：大型生成语言模型的鲁棒偏差评估.pdf
│   │   │   ├── 大型语言模型可以自我改进.pdf
│   │   │   ├── SODA：具有社会常识语境化的百万级对话提炼.pdf
│   │   │   ├── 混合倒挂索引是用于密集检索的鲁棒加速器.pdf
│   │   ├── CVPR 2024 (持续更新）
│   │   │   ├── 1 CVPR'24 获奖论文
│   │   │   │   ├── 4 最佳学生论文次优奖
│   │   │   │   │   ├── Objects as volumes： A stochastic geometry view of opaque solids.pdf
│   │   │   │   │   ├── Comparing the Decision-Making Mechanisms by Transformers and CNNs.pdf
│   │   │   │   ├── 3 最佳论文次优奖
│   │   │   │   │   ├── pixelSplat. 3D Gaussian Splats from lmage Pairs for Scalable Generalizable 3D Reconstruction.pdf
│   │   │   │   ├── 2 最佳学生论文奖
│   │   │   │   │   ├── Mip-Splatting：Alias-free 3D Gaussian Splatting.pdf
│   │   │   │   │   ├── BIOCLIP：A Vision Foundation Model for the Tree of Life.pdf
│   │   │   │   ├── 1 最佳论文奖
│   │   │   │   │   ├── Generative Image Dynamics.pdf
│   │   │   │   │   ├── Rich Human Feedback for Text-to-Image Generation.pdf
│   │   │   ├── 3 CVPR'24 oral论文（更新完毕）
│   │   │   │   ├── 10 自主导航和自我中心视觉
│   │   │   │   │   ├── EgoGen：An Egocentric Synthetic Data Generator.pdf
│   │   │   │   │   ├── SAFDNet： A Simple and Effective Network for Fully Sparse 3D Object Detection.pdf
│   │   │   │   │   ├── UnO：Unsupervised Occupancy Fields for Perception and Forecasting.pdf
│   │   │   │   ├── 15 低样本学习、自监督学习和半监督学习
│   │   │   │   │   ├── Improving Semantic Correspondence with Viewpoint-Guided Spherical Maps.pdf
│   │   │   │   │   ├── CroSel.pdf
│   │   │   │   │   ├── LTGC：Long-tail Recognition via Leveraging LLMs-driven Generated Content.pdf
│   │   │   │   ├── 13 数据集和评估
│   │   │   │   │   ├── 360+x：A Panoptic Multi-modal Scene Understanding Dataset.pdf
│   │   │   │   │   ├── Deep Generative Model based Rate-Distortion for Image Downscaling Assessment.pdf
│   │   │   │   │   ├── Ego-Exo4D：Understanding Skilled Human Activity from First- and Third-Person Perspectives.pdf
│   │   │   │   ├── 12 动作和运动分析
│   │   │   │   │   ├── An N-Point Linear Solver for Line and Motion Estimation with Event Cameras.pdf
│   │   │   │   │   ├── Modeling Multimodal Social Interactions：New Challenges and Baselines with Densely Aligned Representations.pdf
│   │   │   │   │   ├── FineParser：A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality Assessment.pdf
│   │   │   │   │   ├── RoHM：Robust Human Motion Reconstruction via Diffusio.pdf
│   │   │   │   ├── 5 深度学习架构与技术
│   │   │   │   │   ├── Learning Structure-from-Motion with Graph Attention Networks.pdf
│   │   │   │   │   ├── Florence-2： Advancing a Unified Representation for a Variety of Vision Tasks.pdf
│   │   │   │   │   ├── In Search of a Data Transformation That Accelerates Neural Field Training.pdf
│   │   │   │   │   ├── Neural Lineage.pdf
│   │   │   │   │   ├── Neural Redshift：Random Networks are not Random Functions.pdf
│   │   │   │   ├── 7 单视角三维技术
│   │   │   │   │   ├── WALT3D：Generating Realistic Training Data from Time-Lapse Imagery for Reconstructing Dynamic Objects Under Occlusion.pdf
│   │   │   │   │   ├── EscherNet：A Generative Model for Scalable View Synthesis.pdf
│   │   │   │   │   ├── Rethinking Inductive Biases for Surface Normal Estimation.pdf
│   │   │   │   ├── 17 图像与视频合成 2
│   │   │   │   │   ├── Visual Anagrams：Generating Multi-View Optical Illusions with Diffusion Models.pdf
│   │   │   │   │   ├── Alchemist：Parametric Control of Material Properties with Diffusion Models.pdf
│   │   │   │   │   ├── MonoHair：High-Fidelity Hair Modeling from a Monocular Video.pdf
│   │   │   │   ├── 1 低层次视觉
│   │   │   │   │   ├── Towards Robust Event-guided Low-Light Image Enhancement.pdf
│   │   │   │   │   ├── Bilateral Event Mining and Complementary for Event Stream Super-Resolution.pdf
│   │   │   │   │   ├── Specularity Factorization for Low-Light Enhancement.pdf
│   │   │   │   │   ├── FMA-Net：Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring.pdf
│   │   │   │   │   ├── FlowIE：Efficient Image Enhancement via Rectified Flow.pdf
│   │   │   │   ├── 4 图像与视频合成
│   │   │   │   │   ├── FreeU：Free Lunch in Diffusion U-Net.pdf
│   │   │   │   │   ├── Attention Calibration for Disentangled Text-to-Image Personalization.pdf
│   │   │   │   │   ├── Instruct-Imagen： Image Generation with Multi-modal Instruction.pdf
│   │   │   │   │   ├── Ranni：Taming Text-to-Image Diffusion for Accurate Instruction Following.pdf
│   │   │   │   │   ├── Style Aligned Image Generation via Shared Attention.pdf
│   │   │   │   ├── 18 多模态学习
│   │   │   │   │   ├── NoiseCLR：A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions in Diffusion Models.pdf
│   │   │   │   │   ├── InternVL：Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks.pdf
│   │   │   │   │   ├── MetaCloak.pdf
│   │   │   │   │   ├── Describing Differences in Image Sets with Natural Language.pdf
│   │   │   │   ├── 6 多视角三维技术和传感器
│   │   │   │   │   ├── Point Transformer V3：Simpler Faster Stronger.pdf
│   │   │   │   │   ├── Steerers：A Framework for Rotation Equivariant Keypoint Descriptors.pdf
│   │   │   │   │   ├── Tri-Perspective View Decomposition for Geometry-Aware Depth Completion.pdf
│   │   │   │   │   ├── Seeing the World through Your Eyes.pdf
│   │   │   │   │   ├── Matching 2D Images in 3D： Metric Relative Pose from Metric Correspondences.pdf
│   │   │   │   ├── 14 多视角三维技术和传感器 2
│   │   │   │   │   ├── Learning to Produce Semi-dense Correspondences for Visual Localization.pdf
│   │   │   │   ├── 3 人类行为和特征
│   │   │   │   │   ├── Semantic Human Mesh Reconstruction with Textures.pdf
│   │   │   │   │   ├── Stratified Avatar Generation from Sparse Observations.pdf
│   │   │   │   │   ├── MultiPly：Reconstruction of Multiple People from Monocular Video in the Wild.pdf
│   │   │   │   │   ├── Relightable Gaussian Codec Avatars.pdf
│   │   │   │   │   ├── URHand：Universal Relightable Hands.pdf
│   │   │   │   ├── 16 低层次视觉与遥感
│   │   │   │   │   ├── DART：Implicit Doppler Tomography for Radar Novel View Synthesis.pdf
│   │   │   │   │   ├── LDP： Language-driven Dual-Pixel Image Defocus Deblurring Network.pdf
│   │   │   │   ├── 8 视觉、语言与推理
│   │   │   │   │   ├── Eyes Wide Shut  Exploring the Visual Shortcomings of Multimodal LLMs.pdf
│   │   │   │   │   ├── Visual Program Distillation：Distilling Tools and Programmatic Reasoning into Vision-Language Models.pdf
│   │   │   │   │   ├── LISA：Reasoning Segmentation via Large Language Model.pdf
│   │   │   │   ├── 9 医学与物理视觉
│   │   │   │   │   ├── Transcriptomics-guided Slide Representation Learning in Computational Pathology.pdf
│   │   │   │   ├── 11三维视觉
│   │   │   │   │   ├── A Subspace-Constrained Tyler’s Estimator and its Applications to Structure from Motion.pdf
│   │   │   │   ├── 2 视觉与图形
│   │   │   │   │   ├── Eclipse：Disambiguating Illumination and Materials using Unintended Shadows.pdf
│   │   │   │   │   ├── Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation.pdf
│   │   │   │   │   ├── DiffusionLight：Light Probes for Free by Painting a Chrome Ball.pdf
│   │   │   ├── 4 CVPR'24 highlight论文（更新中）
│   │   │   │   ├── Learning Structure-from-Motion with Graph Attention Networks.pdf
│   │   │   │   ├── CFPL-FAS Class Free Prompt Learning for Generalizable Face Anti-spoofing.pdf
│   │   │   │   ├── Efficient Deformable ConvNets  Rethinking Dynamic and Sparse Operator for Vision Applications.pdf
│   │   │   │   ├── Human Motion Prediction Under Unexpected Perturbation.pdf
│   │   │   │   ├── XCube Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies.pdf
│   │   │   │   ├── Boosting Neural Representations for Videos with a Conditional Decoder.pdf
│   │   │   │   ├── Deciphering 'What' and 'Where' Visual Pathways from Spectral Clustering of Layer-Distributed Neural Representations.pdf
│   │   │   │   ├── Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation.pdf
│   │   │   │   ├── ODIN  A Single Model for 2D and 3D Segmentation.pdf
│   │   │   │   ├── LucidDreamer Towards High-Fidelity Text-to-3D Generation via Interval Score Matching.pdf
│   │   │   │   ├── Ranni Taming Text-to-Image Diffusion for Accurate Instruction Following.pdf
│   │   │   │   ├── Point2CAD  Reverse Engineering CAD Models from 3D Point Clouds.pdf
│   │   │   │   ├── ViT-CoMer Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.pdf
│   │   │   │   ├── Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation.pdf
│   │   │   │   ├── Multi-agent Long-term 3D Human Pose Forecasting via Interaction-aware Trajectory Conditioning.pdf
│   │   │   │   ├── FinePOSE Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models.pdf
│   │   │   │   ├── HOLD  Category-agnostic 3D Reconstruction of Interacting Hands and Objects from Vide.pdf
│   │   │   │   ├── Enhancing Video Super-Resolution via Implicit Resampling-based Alignment.pdf
│   │   │   │   ├── Relightable and Animatable Neural Avatar from Sparse-View Video.pdf
│   │   │   │   ├── In Search of a Data Transformation That Accelerates Neural Field Training.pdf
│   │   │   │   ├── FMA-Net Flow-Guided Dynamic Filtering and Iterative Feature Refinement with Multi-Attention for Joint Video Super-Resolution and Deblurring.pdf
│   │   │   │   ├── LocLLM  Exploiting Generalizable Human Keypoint Localization via Large Language Model.pdf
│   │   │   │   ├── DreamPropeller  Supercharge Text-to-3D Generation with Parallel Sampling.pdf
│   │   │   │   ├── Vanishing-Point-Guided Video Semantic Segmentation of Driving Scenes.pdf
│   │   │   │   ├── Breathing Life Into Sketches Using Text-to-Video Priors.pdf
│   │   │   │   ├── In-Context Matting.pdf
│   │   │   │   ├── From Correspondences to Pose  Non-minimal Certifiably Optimal Relative Pose without Disambiguation.pdf
│   │   │   │   ├── Neural Redshift  Random Networks are not Random Functions.pdf
│   │   │   │   ├── 3D Human Pose Perception from Egocentric Stereo Videos.pdf
│   │   │   │   ├── pix2gestalt  Amodal Segmentation by Synthesizing Wholes.pdf
│   │   │   │   ├── Frequency-Adaptive Dilated Convolution for Semantic Segmentation.pdf
│   │   │   │   ├── HandDiff 3D Hand Pose Estimation with Diffusion on Image-Point Cloud.pdf
│   │   │   │   ├── RAVE  Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models.pdf
│   │   │   │   ├── 4D-DRESS A 4D Dataset of Real-world Human Clothing with Semantic Annotations.pdf
│   │   │   │   ├── Bilateral Event Mining and Complementary for Event Stream Super-Resolution.pdf
│   │   │   │   ├── Real-Time Simulated Avatar from Head-Mounted Sensors.pdf
│   │   │   │   ├── Tri-Modal Motion Retrieval by Learning a Joint Embedding Space.pdf
│   │   │   │   ├── DiffusionLight Light Probes for Free by Painting a Chrome Ball.pdf
│   │   │   │   ├── From Activation to Initialization Scaling Insights for Optimizing Neural Fields.pdf
│   │   │   │   ├── FreeU Free Lunch in Diffusion U-Net.pdf
│   │   │   │   ├── MMM  Generative Masked Motion Model.pdf
│   │   │   │   ├── Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis.pdf
│   │   │   │   ├── Attention-Propagation Network for Egocentric Heatmap to 3D.pdf
│   │   │   │   ├── GraCo Granularity-Controllable Interactive Segmentation.pdf
│   │   │   │   ├── No Time to Train  Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation.pdf
│   │   │   │   ├── HashPoint Accelerated Point Searching and Sampling for Neural Rendering.pdf
│   │   │   │   ├── CAD-SIGNet CAD Language Inference from Point Clouds using Layer-wise Sketch Instance Guided Attention.pdf
│   │   │   │   ├── Tri-Perspective View Decomposition for Geometry-Aware Depth Completion.pdf
│   │   │   │   ├── Move as You Say, Interact as You Can  Language-guided Human Motion Generation with Scene Affordance.pdf
│   │   │   │   ├── Seeing the World through Your Eyes.pdf
│   │   │   │   ├── Enforcing Geometric and Physical Priors.pdf
│   │   │   │   ├── CAT-Seg  Cost Aggregation for Open-Vocabulary Semantic Segmentation.pdf
│   │   │   │   ├── Suppress and Rebalance  Towards Generalized Multi-Modal Face Anti-Spoofing.pdf
│   │   │   │   ├── Unbiased Estimator for Distorted Conics in Camera Calibration.pdf
│   │   │   │   ├── Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation.pdf
│   │   │   │   ├── 3D Face Reconstruction with the Geometric Guidance of Facial Part Segmentation.pdf
│   │   │   │   ├── Scaling Up Dynamic Human-Scene Interaction Modeling.pdf
│   │   │   │   ├── General Object Foundation Model for Images and Videos at Scale.pdf
│   │   │   │   ├── Putting the Object Back into Video Object Segmentation.pdf
│   │   │   │   ├── Time-, Memory- and Parameter-Efficient Visual Adaptation.pdf
│   │   │   │   ├── Towards Robust Event-guided Low-Light Image Enhancement  A Large-Scale Real-World Event-Image Dataset and Novel Approach.pdf
│   │   │   │   ├── GAvatar Animatable 3D Gaussian Avatars with Implicit Mesh Learning.pdf
│   │   │   │   ├── EAGLE  Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation.pdf
│   │   │   │   ├── Point Transformer V3 Simpler, Faster, Stronger.pdf
│   │   │   │   ├── CADTalk An Algorithm and Benchmark for Semantic Commenting of CAD Programs.pdf
│   │   │   │   ├── Steerers  A framework for rotation equivariant keypoint descriptors.pdf
│   │   │   │   ├── PhysGaussian  Physics-Integrated 3D Gaussians for Generative Dynamics.pdf
│   │   │   │   ├── Specularity Factorization for Low-Light Enhancement.pdf
│   │   │   │   ├── Objects as volumes  A stochastic geometry view of opaque solids.pdf
│   │   │   │   ├── LeGO  Leveraging a Surface Deformation Network for Animatable Stylized Face Generation with One Example.pdf
│   │   │   │   ├── Semantic-aware SAM for Point-Prompted Instance Segmentation.pdf
│   │   │   │   ├── Restoration by Generation with Constrained Priors.pdf
│   │   │   │   ├── Multi-view Aggregation Network for Dichotomous Image Segmentation.pdf
│   │   │   │   ├── Fantastic Animals and Where to Find Them Segment Any Marine Animal with Dual SAM.pdf
│   │   │   │   ├── From Activation to Initialization  Scaling Insights for Optimizing Neural Fields.pdf
│   │   │   │   ├── Self-Supervised Dual Contouring.pdf
│   │   │   │   ├── NRDF Neural Riemannian Distance Fields for Learning Articulated Pose Priors.pdf
│   │   │   │   ├── Matching 2D Images in 3D  Metric Relative Pose from Metric Correspondences.pdf
│   │   │   │   ├── Eclipse Disambiguating Illumination and Materials using Unintended Shadows.pdf
│   │   │   ├── 2 CVPR'24 最佳论文提名（更新完毕）
│   │   │   │   ├── 2 开源代码
│   │   │   │   │   ├── spider-match-main.zip
│   │   │   │   │   ├── PlatoNeRF-main.zip
│   │   │   │   │   ├── Registration-CorrMLP-master.zip
│   │   │   │   │   ├── pixelsplat-main.zip
│   │   │   │   │   ├── PaSCo-main.zip
│   │   │   │   │   ├── NVlabs-edm2-main.zip
│   │   │   │   │   ├── NeRF-HuGS-master.zip
│   │   │   │   │   ├── MMMU-main.zip
│   │   │   │   │   ├── Marigold-main.zip
│   │   │   │   │   ├── MemSAM-main.zip
│   │   │   │   │   ├── mip-splatting-main.zip
│   │   │   │   │   ├── lambda_vit-main mlp.zip
│   │   │   │   │   ├── MapUncertaintyPrediction-main.zip
│   │   │   │   │   ├── egtr-main.zip
│   │   │   │   │   ├── bioclip-main.zip
│   │   │   │   ├── 1 提名论文
│   │   │   │   │   ├── 9 Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation.pdf
│   │   │   │   │   ├── 8 PlatoNeRF 3D Reconstruction in Plato’s Cave via Single-View Two-Bounce Lidar.pdf
│   │   │   │   │   ├── 6 Producing and Leveraging Online Map Uncertainty in Trajectory Prediction.pdf
│   │   │   │   │   ├── 7 PaSCo：Urban 3D Panoptic Scene Completion with Uncertainty Awareness.pdf
│   │   │   │   │   ├── 5 Correlation-aware Coarse-to-fine MLPs for Deformable Medical Image Registration.pdf
│   │   │   │   │   ├── 4 MMMU  A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI.pdf
│   │   │   │   │   ├── 3 Comparing the Decision-Making Mechanisms by Transformers and CNNs.pdf
│   │   │   │   │   ├── 2 Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation.pdf
│   │   │   │   │   ├── 19 EGTR：Extracting Graph from Transformer for Scene Graph Generation.pdf
│   │   │   │   │   ├── 18 Analyzing and Improving the Training Dynamics of Diffusion Models.pdf
│   │   │   │   │   ├── 17 Generative Image Dynamics.pdf
│   │   │   │   │   ├── 16 MLPCanBeAGoodTransformer Learner.pdf
│   │   │   │   │   ├── 14 Mip-Splatting：Alias-free 3D Gaussian Splatting.pdf
│   │   │   │   │   ├── 15 pixelSplat. 3D Gaussian Splats from lmage Pairs for Scalable Generalizable 3D Reconstruction.pdf
│   │   │   │   │   ├── 13 NeRF-HuGS： Improved Neural Radiance Fields in Non-static Scenes.pdf
│   │   │   │   │   ├── 12 Grounding and Enhancing Grid-based Models for Neural Fields.pdf
│   │   │   │   │   ├── 11 BIOCLIP：A Vision Foundation Model for the Tree of Life.pdf
│   │   │   │   │   ├── 10 Rich Human Feedback for Text-to-Image Generation.pdf
│   │   │   │   │   ├── 1 Objects as volumes： A stochastic geometry view of opaque solids.pdf
│   ├── 小黄搞AI大模型面试目录
│   │   ├── 小黄搞AI_大模型面试100问（PDF更新至90）.pdf
│   │   ├── 小黄搞AI_大模型面试100问（PDF更新至74）.pdf
│   │   ├── 小黄搞AI_大模型面试100问（PDF更新至107）.pdf
│   ├── 大模型书籍
│   │   ├── Mastering Transformers_ Build state-of-the-art models from -- .pdf
│   │   ├── 预训练语言模型 2021 (邵浩 刘一烽) .pdf
│   │   ├── BERT基础教程：Transformer大模型实战   (苏达哈尔桑·拉维昌迪兰) .azw3
│   │   ├── 自然语言处理：基于预训练模型的方法_2021.pdf
│   │   ├── 精通Transformer：从零开始构建最先进的NLP模型_2023.epub
│   │   ├── Mastering NLP from Foundations to LLMs_ Apply advanced.pdf
│   │   ├── Building LLM Apps Create Intelligent Apps and Agents with Large Language Models_2024 .pdf
│   │   ├── 大规模语言模型：从理论到实践_2023.pdf
│   │   ├── 大语言模型_2024.pdf
│   │   ├── HuggingFace自然语言处理详解：基于BERT中文模型的任务实战.epub
│   │   ├── 大语言模型：基础与前沿_2024.epub
│   │   ├── 面向开发者的 LLM 入门课.pdf
│   │   ├── Transformer, BERT, and GPT：Including ChatGPT and Prompt Engineering_2024.pdf
│   │   ├── 大语言模型：基础与前沿_2024.pdf
│   │   ├── Transformers for Natural Language Processing Build, train, and fine-tune deep neural network architectures for NLP with... (--).pdf
│   │   ├── 扩散模型从原理到实战.epub
│   │   ├── Natural Language Processing with Transformers Building Language Applications with Hugging Face.pdf
│   │   ├── 中国人工智能系列白皮书——大模型技术（2023 版）.pdf
│   │   ├── Transformers in Action (MEAP v7) _2024 .pdf
│   │   ├── Transformers生成式AI实用指南（提前发售 GPT双语） _2023 .epub
│   │   ├── 自然语言处理：原理、方法与应用.zip
│   │   ├── HuggingFace自然语言处理详解：基于BERT中文模型的任务实战.pdf
│   │   ├── Mastering Large Language Models Advanced techniques, applications, cutting-edge methods, and top LLMs_2024 .pdf
│   │   ├── 自然语言处理导论 2023 张奇.pdf
│   │   ├── Modern Generative AI with ChatGPT and OpenAI Models.pdf
│   │   ├── Generative AI with LangChain_ Build large language model.pdf
│   │   ├── BERT基础教程：Transformer大模型实战_2023.zip
│   │   ├── 精通Transformer：从零开始构建最先进的NLP模型_2023.pdf
│   │   ├── Getting Started with Google BERT_ Build and train .pdf
│   │   ├── 自然语言处理：原理、方法与应用 2023 (王志立  雷鹏斌  吴宇凡) .epub
│   │   ├── LLM Prompt Engineering For Developers  The Art and Science of Unlocking LLMs True Potential_2024 .epub
│   │   ├── Mastering Large Language Models Advanced techniques, applications, cutting-edge methods, and top LLMs_2024 .epub
│   │   ├── Transformer自然语言处理实战：使用Hugging-Face-Transformers库构建NLP应用_2024.pdf
│   ├── 面试八股文
│   │   ├── 大模型校招面试题.pdf
│   │   ├── LLMs大模型面试问题和答案（97）.pdf
│   │   ├── 大模型常见面试题及解答1.pdf
│   │   ├── 大模型 LLM 最全八股和答案.pdf
│   │   ├── AI大模型面试题(102).pdf
│   │   ├── 大模型岗位面试全纪录.pdf
│   │   ├── 大模型常考面试题总结（含答案）.pdf
│   │   ├── 大模型常见面试题及解答2.pdf
│   │   ├── 大模型LLMS.pdf
│   │   ├── 从零开始大模型开发与微调基于PyTorch与ChatGLM.pdf
│   │   ├── 大模型常见面试题3.pdf
│   │   ├── 大模型落地应用案例集.pdf
│   ├── 大模型面试题
│   │   ├── 大模型（LLMs）参数高效微调(PEFT)面
│   │   │   ├── 适配器微调（Adapter-tuning）篇.pdf
│   │   │   ├── LoRA篇.pdf
│   │   │   ├── 参数高效微调篇PRFT.pdf
│   │   │   ├── 提示学习（Prompting）篇.pdf
│   │   ├── 大模型（LLMs）langchain面
│   │   │   ├── 基于LLM+向量库的文档对话经验面.pdf
│   │   │   ├── 大模型（LLMs）langchain面.pdf
│   │   ├── 31-LLM-Interview-Plus
│   │   │   ├── 大模型（LLMs）推理加速篇.pdf
│   │   │   ├── 大模型（LLMs）Tokenizer篇.pdf
│   │   │   ├── 多模态常见面试题.pdf
│   │   │   ├── 大模型校招面试题.pdf
│   │   │   ├── 大模型（LLMs）面试题答案Plus.pdf
│   │   │   ├── 大模型（LLMs）蒸馏面.pdf
│   │   │   ├── 大模型（LLMs）幻觉面.pdf
│   │   │   ├── 大模型（LLMs）分布式训练面.pdf
│   │   │   ├── 大模型（LLMs）显存问题面.pdf
│   │   │   ├── 大模型 RAG 检索增强生成面.pdf
│   │   │   ├── 大模型（LLMs）增量预训练篇.pdf
│   │   ├── 大模型（LLMs）强化学习—— PPO 面.pdf
│   │   ├── 大模型（LLMs）基础面.pdf
│   │   ├── 大模型（LLMs）强化学习——RLHF及其变种面.pdf
│   │   ├── 大模型（LLMs）训练集面.pdf
│   │   ├── 大模型（LLMs）进阶面.pdf
│   │   ├── 大模型（LLMs）评测面.pdf
│   │   ├── 大模型（LLMs）agent 面.pdf
│   │   ├── 大模型（LLMs）推理面.pdf
│   │   ├── 大模型（LLMs）幻觉面.pdf
│   │   ├── 大模型（LLMs）微调面.pdf
基于LangChain和知识图谱的大模型医疗问答机器人项目

文章展示

AI辅助神器Cursor –从0到1实战《仿小红书小程序》

极客Dify开发：AIAgent进阶实战

AI Agent全栈开发工程师|2025完结

尚硅谷大模型2025系统课

2025小王子ComfyUI商业应用AI系统|最新完结|1.35T

排行榜展示

LLM应用开发平台特训营|持续更新中（更新到第七阶段）。。。

聚客大模型4期

2025聚客最新版大模型RAG入门到精通实战教程

珠峰-姜文-2024年9月Vue 3.5 企业级管理系统实战直播版(已完结)

2025九天菜菜大模型与Agent开发课（新）|更新中。。。

2025西瓜AI大模型RAG项目实战课

基于LangChain和知识图谱的大模型医疗问答机器人项目

相关文章

文章展示

排行榜展示