_ | reinforcement | _ |
3DCNN-DQN-RNN: A Deep | reinforcement | Learning Framework for Semantic Parsing of Large-Scale 3D Point Clouds |
3M-RL: Multi-Resolution, Multi-Agent, Mean-Field | reinforcement | Learning for Autonomous UAV Routing |
A2-RL: Aesthetics Aware | reinforcement | Learning for Image Cropping |
Abnormal Behavior Recognition for Human Motion Based on Improved Deep | reinforcement | Learning |
Action Parsing-Driven Video Summarization Based on | reinforcement | Learning |
Action Recognition Using Visual Attention with | reinforcement | Learning |
Action-Decision Networks for Visual Tracking with Deep | reinforcement | Learning |
ActionSpotter: Deep | reinforcement | Learning Framework for Temporal Action Spotting in Videos |
Active Action Proposal Method Based on | reinforcement | Learning, An |
Active Object Localization with Deep | reinforcement | Learning |
AdaConfigure: | reinforcement | Learning-Based Adaptive Configuration for Video Analytics Services |
AdaPool: A Diurnal-Adaptive Fleet Management Framework Using Model-Free Deep | reinforcement | Learning and Change Point Detection |
Adaptive cooperative exploration for | reinforcement | learning from imperfect demonstrations |
Adaptive Data Collection and Offloading in Multi-UAV-Assisted Maritime IoT Systems: A Deep | reinforcement | Learning Approach |
Adaptive Deep | reinforcement | Learning-Based In-Loop Filter for VVC |
Adaptive Fusion by | reinforcement | Learning for Distributed Detection Systems |
Adaptive Metro Service Schedule and Train Composition With a Proximal Policy Optimization Approach Based on Deep | reinforcement | Learning |
Adaptive Neural Network Control of AUVs With Control Input Nonlinearities Using | reinforcement | Learning |
Adaptive Partial | reinforcement | Learning Neural Network-Based Tracking Control for Wheeled Mobile Robotic Systems |
Adaptive Road Configurations for Improved Autonomous Vehicle-Pedestrian Interactions Using | reinforcement | Learning |
Adaptive ROI generation for video object segmentation using | reinforcement | learning |
Adaptive Safe | reinforcement | Learning With Full-State Constraints and Constrained Adaptation for Autonomous Vehicles |
Adaptive Spotting: Deep | reinforcement | Object Search in 3d Point Clouds |
Adaptive Streaming of 360-Degree Videos with | reinforcement | Learning |
Adaptive Target Recognition Using | reinforcement | Learning |
Adaptive Traffic Light Control With Deep | reinforcement | Learning: An Evaluation of Traffic Flow and Energy Consumption |
Adaptive traffic signal control system using composite reward architecture based deep | reinforcement | learning |
Adaptive Traffic Signal Control With Deep | reinforcement | Learning and High Dimensional Sensory Inputs: Case Study and Comprehensive Sensitivity Analyses |
Adjacency Constraint for Efficient Hierarchical | reinforcement | Learning |
Advanced Deep Network with Attention and Genetic-Driven | reinforcement | Learning Layer for an Efficient Cancer Treatment Outcome Prediction |
Adversarial approach to domain adaptation for | reinforcement | learning on dialog systems |
Adversarial | reinforcement | Learning for Unsupervised Domain Adaptation |
Adversarial | reinforcement | Learning With Object-Scene Relational Graph for Video Captioning |
Aerial Image Dehazing Using | reinforcement | Learning |
Aesthetic Photo Collage With Deep | reinforcement | Learning |
AgentI2P: Optimizing Image-to-Point Cloud Registration via Behaviour Cloning and | reinforcement | Learning |
Aircraft Detection Framework Based on | reinforcement | Learning and Convolutional Neural Networks in Remote Sensing Images, An |
Altruistic cooperative adaptive cruise control of mixed traffic platoon based on deep | reinforcement | learning |
Anomaly Detection and Correction of Optimizing Autonomous Systems With Inverse | reinforcement | Learning |
Application of Relaxation to Edge | reinforcement | , An |
approach to the design of | reinforcement | functions in real world, agent-based applications, An |
Approach to Tune Fuzzy Controllers Based on | reinforcement | Learning for Autonomous Vehicle Control, An |
Artist Agent: A | reinforcement | Learning Approach to Automatic Stroke Generation in Oriental Ink Painting |
Assessing Transferability From Simulation to Reality for | reinforcement | Learning |
Asynchronous Deep | reinforcement | Learning for Collaborative Task Computing and On-Demand Resource Allocation in Vehicular Edge Computing |
Asynchronous Federated Deep | reinforcement | Learning-Based URLLC-Aware Computation Offloading in Space-Assisted Vehicular Networks |
Asynchronous Multithreading | reinforcement | -Learning-Based Path Planning and Tracking for Unmanned Underwater Vehicle |
Attention control with | reinforcement | learning for face recognition under partial occlusion |
Attention-Aware Deep | reinforcement | Learning for Video Face Recognition |
Attention-Aware Face Hallucination via Deep | reinforcement | Learning |
Attention-Based Deep | reinforcement | Learning for Virtual Cinematography of 360° Videos |
Auto uning of price prediction models for high-frequency trading via | reinforcement | learning |
Auto-Driving Policies in Highway based on Distributional Deep | reinforcement | Learning |
AutoCoMet: Smart Neural Architecture Search via Co-Regulated Shaping | reinforcement | |
Automated aerial suspended cargo delivery through | reinforcement | learning |
Automatic Face Aging in Videos via Deep | reinforcement | Learning |
Automatic generation of optimal road trajectory for the rescue vehicle in case of emergency on mountain freeway using | reinforcement | learning approach |
Automatic Itinerary Planning Using Triple-Agent Deep | reinforcement | Learning |
Autonomous Generation of Service Strategy for Household Tasks: A Progressive Learning Method With A Priori Knowledge and | reinforcement | Learning |
Autonomous Planetary Landing via Deep | reinforcement | Learning and Transfer Learning |
Autonomous Vehicle Cut-In Algorithm for Lane-Merging Scenarios via Policy-Based | reinforcement | Learning Nested Within Finite-State Machine |
Avalanche RL: A Continual | reinforcement | Learning Library |
AVD-Net: Attention Value Decomposition Network For Deep Multi-Agent | reinforcement | Learning |
Batch | reinforcement | Learning With a Nonparametric Off-Policy Policy Gradient |
Bayesian Approach to | reinforcement | Learning of Vision-Based Vehicular Control, A |
Bayesian Nonparametric Methods for Partially-Observable | reinforcement | Learning |
Beyond Greedy Search: Tracking by Multi-Agent | reinforcement | Learning-Based Beam Search |
BiPR-RL: Portrait relighting via bi-directional consistent deep | reinforcement | learning |
Blind decision making: | reinforcement | learning with delayed observations |
Blockchain and Deep | reinforcement | Learning Empowered Spatial Crowdsourcing in Software-Defined Internet of Vehicles |
Blockchain-Integrated Multiagent Deep | reinforcement | Learning for Securing Cooperative Adaptive Cruise Control |
Boundary-Aware Supervoxel-Level Iteratively Refined Interactive 3D Image Segmentation With Multi-Agent | reinforcement | Learning |
Can | reinforcement | Learning Lead to Healthy Life?: Simulation Study Based on User Activity Logs |
Can We Learn Heuristics for Graphical Model Inference Using | reinforcement | Learning? |
Catch: Context-based Meta | reinforcement | Learning for Transferrable Architecture Search |
Causal Meta- | reinforcement | Learning for Multimodal Remote Sensing Data Classification |
Challenges and Opportunities of Applying | reinforcement | Learning to Autonomous Racing |
Channel Pruning via Lookahead Search Guided | reinforcement | Learning |
Characteristic Views Extraction Modal Based-on Deep | reinforcement | Learning for 3D Model Retrieval |
CIRL: Controllable Imitative | reinforcement | Learning for Vision-Based Self-driving |
Class-wise Attention | reinforcement | for Semi-supervised Meta-Learning |
CLASTER: Clustering with | reinforcement | Learning for Zero-Shot Action Recognition |
Close-Loop Object Recognition Using | reinforcement | Learning |
Cloud Detection of SuperView-1 Remote Sensing Images Based on Genetic | reinforcement | Learning |
Clustering experience replay for the effective exploitation in | reinforcement | learning |
Co-reranking by mutual | reinforcement | for image search |
Co-speech Gesture Synthesis by | reinforcement | Learning with Contrastive Pretrained Rewards |
Collaborative Deep | reinforcement | Learning for Joint Object Search |
Collaborative Deep | reinforcement | Learning for Multi-object Tracking |
Collision Anticipation via Deep | reinforcement | Learning for Visual Navigation |
Color Feature | reinforcement | for Cosaliency Detection Without Single Saliency Residuals |
Combining Decision Making and Trajectory Planning for Lane Changing Using Deep | reinforcement | Learning |
Combining | reinforcement | Learning and Belief Revision: A Learning System for Active Vision |
Combining Semantic Guidance and Deep | reinforcement | Learning For Generating Human Level Paintings |
Complete | reinforcement | -Learning-Based Framework for Urban-Safety Perception, A |
Comprehensive Ocean Information-Enabled AUV Motion Planning Based on | reinforcement | Learning |
Computation Offloading and Resource Allocation in MEC-Enabled Integrated Aerial-Terrestrial Vehicular Networks: A | reinforcement | Learning Approach |
Computing Offloading With Fairness Guarantee: A Deep | reinforcement | Learning Method |
Computing on Wheels: A Deep | reinforcement | Learning-Based Approach |
Conditional Predictive Behavior Planning With Inverse | reinforcement | Learning for Human-Like Autonomous Driving |
Confidence-Aware | reinforcement | Learning for Self-Driving Cars |
Consensus-Agent Deep | reinforcement | Learning for Face Aging |
Constrained Policy Optimization Algorithm for Autonomous Driving via | reinforcement | Learning |
Context-Aware Taxi Dispatching at City-Scale Using Deep | reinforcement | Learning |
Continual | reinforcement | Learning in 3D Non-stationary Environments |
Continuous Action | reinforcement | Learning From a Mixture of Interpretable Experts |
Continuous Sign Language Recognition via | reinforcement | Learning |
Control Double Inverted Pendulum by | reinforcement | Learning with Double CMAC Network |
Cooperative Adaptive Cruise Control: A | reinforcement | Learning Approach |
Cooperative behavior acquisition for mobile robots in dynamically changing real worlds via vision-based | reinforcement | learning and development |
Coordinated Control of Urban Expressway Integrating Adjacent Signalized Intersections Using Adversarial Network Based | reinforcement | Learning Method |
Coordinated multi-agent hierarchical deep | reinforcement | learning to solve multi-trip vehicle routing problems with soft time windows |
Coordination Control Strategy for Human-Machine Cooperative Steering of Intelligent Vehicles: A | reinforcement | Learning Approach |
Correlation Filter Selection for Visual Tracking Using | reinforcement | Learning |
CoTV: Cooperative Control for Traffic Light Signals and Connected Autonomous Vehicles Using Deep | reinforcement | Learning |
Crack Detection and Refinement Via Deep | reinforcement | Learning |
Crafting a Toolchain for Image Restoration by Deep | reinforcement | Learning |
Cross-Task Multimodal | reinforcement | for Long Tail Next POI Recommendation |
cuRL: A Generic Framework for Bi-Criteria Optimum Path-Finding Based on Deep | reinforcement | Learning |
Curriculum-Based Asymmetric Multi-Task | reinforcement | Learning |
Data Freshness and Energy-Efficient UAV Navigation Optimization: A Deep | reinforcement | Learning Approach |
Data-Efficient Deep | reinforcement | Learning with Symmetric Consistency |
DDoS Mitigation Based on Space-Time Flow Regularities in IoV: A Feature Adaption | reinforcement | Learning Approach |
DECORE: Deep Compression with | reinforcement | Learning |
Deep Inverse | reinforcement | Learning for Behavior Prediction in Autonomous Driving: Accurate Forecasts of Vehicle Motion |
Deep Multi-Agent | reinforcement | Learning for Highway On-Ramp Merging in Mixed Traffic |
Deep Progressive | reinforcement | Learning for Skeleton-Based Action Recognition |
Deep Q-CapsNet | reinforcement | Learning Framework for Intrauterine Cavity Segmentation in TTTS Fetal Surgery Planning |
Deep | reinforcement | Active Learning for Human-in-the-Loop Person Re-Identification |
Deep | reinforcement | Clustering |
Deep | reinforcement | hashing with redundancy elimination for effective image retrieval |
Deep | reinforcement | Image Matching with Self-Termination |
Deep | reinforcement | Learning and NOMA-Based Multi-Objective RIS-Assisted IS-UAV-TNs: Trajectory Optimization and Beamforming Design |
Deep | reinforcement | Learning Approach for Airport Departure Metering Under Spatial-Temporal Airside Interactions, A |
Deep | reinforcement | Learning Approach for Emergency Response Management |
Deep | reinforcement | Learning Approach for V2X Managed Intersections of Connected Vehicles |
deep | reinforcement | learning approach to character segmentation of license plate images, A |
Deep | reinforcement | Learning Approach to Multiple Streams' Joint Bitrate Allocation, A |
Deep | reinforcement | Learning Approach to Traffic Signal Control With Temporal Traffic Pattern Mining, A |
Deep | reinforcement | Learning Assisted Beam Tracking and Data Transmission for 5G V2X Networks |
Deep | reinforcement | learning based active safety control for distributed drive electric vehicles |
Deep | reinforcement | learning based conflict detection and resolution in air traffic control |
Deep | reinforcement | Learning Based Freshness-Aware Path Planning for UAV-Assisted Edge Computing Networks with Device Mobility |
Deep | reinforcement | Learning Based Real-Time Solution Policy for the Traveling Salesman Problem, A |
Deep | reinforcement | learning collision avoidance using policy gradient optimisation and Q-learning |
Deep | reinforcement | Learning Designed Shinnar-Le Roux RF Pulse Using Root-Flipping: DeepRFSLR |
Deep | reinforcement | Learning for Automatic Thumbnail Generation |
Deep | reinforcement | Learning for Autonomous Driving by Transferring Visual Features |
Deep | reinforcement | Learning for Autonomous Driving: A Survey |
Deep | reinforcement | Learning for Computation and Communication Resource Allocation in Multiaccess MEC Assisted Railway IoT Networks |
Deep | reinforcement | Learning for Event-Driven Multi-Agent Decision Processes |
Deep | reinforcement | Learning for Exact Combinatorial Optimization: Learning to Branch |
Deep | reinforcement | Learning for Flipper Control of Tracked Robots in Urban Rescuing Environments |
Deep | reinforcement | Learning for Image Hashing |
Deep | reinforcement | Learning for Intelligent Transportation Systems: A Survey |
Deep | reinforcement | Learning for Playing 2.5D Fighting Games |
Deep | reinforcement | Learning for Semisupervised Hyperspectral Band Selection |
Deep | reinforcement | Learning for Solving the Heterogeneous Capacitated Vehicle Routing Problem |
Deep | reinforcement | Learning for the Electric Vehicle Routing Problem With Time Windows |
Deep | reinforcement | Learning for Video Prediction |
Deep | reinforcement | Learning for Weak Human Activity Localization |
Deep | reinforcement | Learning Method For Multimodal Data Fusion in Action Recognition, A |
Deep | reinforcement | Learning of Region Proposal Networks for Object Detection |
Deep | reinforcement | Learning of Volume-Guided Progressive View Inpainting for 3D Point Scene Completion From a Single Depth Image |
Deep | reinforcement | Learning on a Budget: 3D Control and Reasoning Without a Supercomputer |
Deep | reinforcement | Learning Two-Way Transit Signal Priority Algorithm for Optimizing Headway Adherence and Speed |
Deep | reinforcement | learning with credit assignment for combinatorial optimization |
Deep | reinforcement | Learning With Graph Representation for Vehicle Repositioning |
Deep | reinforcement | Learning with Iterative Shift for Visual Tracking |
Deep | reinforcement | Learning-Based Adaptive Modulation for Underwater Acoustic Communication with Outdated Channel State Information |
Deep | reinforcement | Learning-Based Image Captioning with Embedding Reward |
Deep | reinforcement | learning-based patch selection for illuminant estimation |
Deep | reinforcement | Learning-Based Resource Management Game in Vehicular Edge Computing, A |
Deep | reinforcement | Learning-Based Traffic Light Scheduling Framework for SDN-Enabled Smart Transportation System |
Deep | reinforcement | Learning: A Brief Survey |
Deep | reinforcement | Polishing Network for Video Captioning |
Deep | reinforcement | -learning-based driving policy for autonomous road vehicles |
Deep Variation-Structured | reinforcement | Learning for Visual Relationship and Attribute Detection |
Deep- | reinforcement | -Learning-Based Energy Management Strategy for Supercapacitor Energy Storage Systems in Urban Rail Transit |
DeepPool: Distributed Model-Free Algorithm for Ride-Sharing Using Deep | reinforcement | Learning |
Delay-Aware Content Delivery With Deep | reinforcement | Learning in Internet of Vehicles |
Delayed | reinforcement | Learning for Closed-Loop Object Recognition |
Design of Safe Optimal Guidance With Obstacle Avoidance Using Control Barrier Function-Based Actor-Critic | reinforcement | Learning |
Detecting and adapting to crisis pattern with context based Deep | reinforcement | Learning |
Detecting State of Charge False Reporting Attacks via | reinforcement | Learning Approach |
Development of an Efficient Driving Strategy for Connected and Automated Vehicles at Signalized Intersections: A | reinforcement | Learning Approach |
Diagnostics of | reinforcement | Conditions in Concrete Structures by GPR, Impact-Echo Method and Metal Magnetic Memory Method |
Differentiable Logic Policy for Interpretable Deep | reinforcement | Learning: A Study From an Optimization Perspective |
Digital Twin for Transportation Big Data: A | reinforcement | Learning-Based Network Traffic Prediction Approach |
Discrete space | reinforcement | learning algorithm based on support vector machine classification |
Discrete space | reinforcement | learning algorithm based on twin support vector machine classification |
Discriminative sampling via deep | reinforcement | learning for kinship verification |
DISeR: Designing Imaging Systems with | reinforcement | Learning |
Distort-and-Recover: Color Enhancement Using Deep | reinforcement | Learning |
Distributed Model-Free Algorithm for Multi-Hop Ride-Sharing Using Deep | reinforcement | Learning, A |
Distributed Model-Free Ride-Sharing Approach for Joint Matching, Pricing, and Dispatching Using Deep | reinforcement | Learning, A |
Distributed Multi-Agent | reinforcement | Learning With Graph Decomposition Approach for Large-Scale Adaptive Traffic Signal Control, A |
Distributed predictive cruise control based on | reinforcement | learning and validation on microscopic traffic simulation |
Distributed Signal Control of Arterial Corridors Using Multi-Agent Deep | reinforcement | Learning |
Double branch synergies with modal | reinforcement | for weakly supervised temporal action detection |
Drift-Proof Tracking With Deep | reinforcement | Learning |
Driving Behavior Modeling Using Naturalistic Human Driving Data With Inverse | reinforcement | Learning |
Driving policies of V2X autonomous vehicles based on | reinforcement | learning methods |
Driving-Behavior-Aware Optimal Energy Management Strategy for Multi-Source Fuel Cell Hybrid Electric Vehicles Based on Adaptive Soft Deep- | reinforcement | Learning |
DRLE: Decentralized | reinforcement | Learning at the Edge for Traffic Light Control in the IoV |
DSORL: Data Source Optimization With | reinforcement | Learning Scheme for Vehicular Named Data Networks |
Dual | reinforcement | Learning Framework for Weakly Supervised Phrase Grounding, A |
Dual-Agent Deep | reinforcement | Learning for Deformable Face Tracking |
Dynamic Deep | reinforcement | Learning-Bayesian Framework for Anomaly Detection, A |
Dynamic Edge Computation Offloading for Internet of Vehicles With Deep | reinforcement | Learning |
Dynamic Face Video Segmentation via | reinforcement | Learning |
Dynamic Pricing for Differentiated PEV Charging Services Using Deep | reinforcement | Learning |
Dynamic speed harmonization for mixed traffic flow on the freeway using deep | reinforcement | learning |
Dynamic traffic signal control using mean field multi-agent | reinforcement | learning in large scale road-networks |
Dynamical Hyperparameter Optimization via Deep | reinforcement | Learning in Tracking |
EarlGAN: An enhanced actor-critic | reinforcement | learning agent-driven GAN for de novo drug design |
Early Action Recognition With Category Exclusion Using Policy-Based | reinforcement | Learning |
Edge | reinforcement | Using Parameterized Relaxation Labeling |
Effect of Multi-step Methods on Overestimation in Deep | reinforcement | Learning, The |
Effective Charging Planning Based on Deep | reinforcement | Learning for Electric Vehicles |
Efficient and Private Scheduling of Wireless Electric Vehicles Charging Using | reinforcement | Learning |
Efficient Halftoning via Deep | reinforcement | Learning |
Efficient Human Activity Classification from Egocentric Videos Incorporating Actor-Critic | reinforcement | Learning |
Efficient Object Detection in Large Images Using Deep | reinforcement | Learning |
Emotion Attention-Aware Collaborative Deep | reinforcement | Learning for Image Cropping |
Emotion Detection for Conversations Based on | reinforcement | Learning Framework |
Emotional Contagion-Aware Deep | reinforcement | Learning for Antagonistic Crowd Simulation |
End-to-End Active Object Tracking and Its Real-World Deployment via | reinforcement | Learning |
End-to-End Driving in a Realistic Racing Game with Deep | reinforcement | Learning |
End-to-End Model-Free | reinforcement | Learning for Urban Driving Using Implicit Affordances |
End-to-End Urban Driving by Imitating a | reinforcement | Learning Coach |
End-to-End Video Captioning With Multitask | reinforcement | Learning |
Enhanced Bayesian Compression via Deep | reinforcement | Learning |
Enhancing Representation Learning With Spatial Transformation and Early Convolution for | reinforcement | Learning-Based Small Object Detection |
Enhancing Transferability of Deep | reinforcement | Learning-Based Variable Speed Limit Control Using Transfer Learning |
Ensemble Quantile Networks: Uncertainty-Aware | reinforcement | Learning With Applications in Autonomous Driving |
Environment Agnostic Representation for Visual | reinforcement | learning |
Environment Upgrade | reinforcement | Learning for Non-differentiable Multi-stage Pipelines |
Erlang planning network: An iterative model-based | reinforcement | learning with multi-perspective |
Error Bounds of Imitating Policies and Environments for | reinforcement | Learning |
Experience-Driven Power Allocation Using Multi-Agent Deep | reinforcement | Learning for Millimeter-Wave High-Speed Railway Systems |
Expert Level Control of Ramp Metering Based on Multi-Task Deep | reinforcement | Learning |
Expert Level Control of Ramp Metering Based on Multi-Task Deep | reinforcement | Learning |
Face Hallucination by Attentive Sequence Optimization with | reinforcement | Learning |
Face recognition using | reinforcement | learning |
Factor Selection for | reinforcement | Learning in HTTP Adaptive Streaming |
Fair Loss: Margin-Aware | reinforcement | Learning for Deep Face Recognition |
False Correlation Reduction for Offline | reinforcement | Learning |
Fast A3RL: Aesthetics-Aware Adversarial | reinforcement | Learning for Image Cropping |
Fast and low-complexity | reinforcement | learning for delay-sensitive energy harvesting wireless visual sensing systems |
Fast and Robust Algorithm with | reinforcement | Learning for Large UAV Cluster Mission Planning, A |
Fear-Neuro-Inspired | reinforcement | Learning for Safe Autonomous Driving |
Federated Deep | reinforcement | Learning-Based Spectrum Access Algorithm With Warranty Contract in Intelligent Transportation Systems |
FFNet: Video Fast-Forwarding via | reinforcement | Learning |
First-Person Activity Forecasting from Video with Online Inverse | reinforcement | Learning |
First-Person Activity Forecasting with Online Inverse | reinforcement | Learning |
Fleet Rebalancing for Expanding Shared e-Mobility Systems: A Multi-Agent Deep | reinforcement | Learning Approach |
FlexPool: A Distributed Model-Free Deep | reinforcement | Learning Algorithm for Joint Passengers and Goods Transportation |
Focus on Scene Text Using Deep | reinforcement | Learning |
Forming Adversarial Example Attacks Against Deep Neural Networks With | reinforcement | Learning |
Frame-part-activated deep | reinforcement | learning for Action Prediction |
Frequency Agile Anti-Interference Technology Based on | reinforcement | Learning Using Long Short-Term Memory and Multi-Layer Historical Information Observation |
From Rocks to Walls: a Model-free | reinforcement | Learning Approach to Dry Stacking with Irregular Rocks |
Frustratingly Easy Regularization on Representation Can Boost Deep | reinforcement | Learning |
Fuel-Efficient Switching Control for Platooning Systems With Deep | reinforcement | Learning |
Fusing Pre-Trained Language Models with Multimodal Prompts through | reinforcement | Learning |
Fusion of Multiple Behaviors Using Layered | reinforcement | Learning |
Fuzzy Inference Enabled Deep | reinforcement | Learning-Based Traffic Light Control for Intelligent Transportation System |
GaDQN-IDS: A Novel Self-Adaptive IDS for VANETs Based on Bayesian Game Theory and Deep | reinforcement | Learning |
GAIT: Generating Aesthetic Indoor Tours with Deep | reinforcement | Learning |
Galactic: Scaling End-to-End | reinforcement | Learning for Rearrangement at 100k Steps-Per-Second |
GE-DDRL: Graph Embedding and Deep Distributional | reinforcement | Learning for Reliable Shortest Path: A Universal and Scale Free Solution |
General Framework for Context-Specific Image Segmentation Using | reinforcement | Learning, A |
Generative Adversarial Network Enabled Deep Distributional | reinforcement | Learning for Transmission Scheduling in Internet of Vehicles, A |
Generic Markov Decision Process Model and | reinforcement | Learning Method for Scheduling Agile Earth Observation Satellites, A |
Giardino Delle Camelie in The Boboli Monumental Garden: Integrated Survey, Structural | reinforcement | and Restoration Project of The Architecture, The Decorations and The Hydraulic System, The |
Goal-Guided Transformer-Enabled | reinforcement | Learning for Efficient Autonomous Navigation |
Good, Better, Best: Textual Distractors Generation for Multiple-Choice Visual Question Answering via | reinforcement | Learning |
Graph based skill acquisition and transfer Learning for continuous | reinforcement | learning domains |
Graph Relational | reinforcement | Learning for Mobile Robot Navigation in Large-Scale Crowded Environments |
GraphBit: Bitwise Interaction Mining via Deep | reinforcement | Learning |
Group-Membership | reinforcement | for Straight Edges Based on Bayesian Networks |
H-MAS architecture and | reinforcement | learning method for autonomous robot path planning |
Halftoning with Multi-Agent Deep | reinforcement | Learning |
Handling Camera Movement Constraints in | reinforcement | Learning Based Active Object Recognition |
Harmonious Lane Changing via Deep | reinforcement | Learning |
Hash Bit Selection With | reinforcement | Learning for Image Retrieval |
HDRLM3D: A Deep | reinforcement | Learning-Based Model with Human-like Perceptron and Policy for Crowd Evacuation in 3D Environments |
Heterogeneous Attentions for Solving Pickup and Delivery Problem via Deep | reinforcement | Learning |
Heterogeneous | reinforcement | Learning Network for Aspect-Based Sentiment Classification With External Knowledge |
Hierarchical Deep | reinforcement | Learning for Self-Powered Monitoring and Communication Integrated System in High-Speed Railway Networks |
Hierarchical Deep | reinforcement | Learning Framework for 6-DOF UCAV Air-to-Air Combat, A |
Hierarchical Defect Detection Based On | reinforcement | Learning |
Hierarchical Framework for Passenger Inflow Control in Metro System With | reinforcement | Learning, A |
Hierarchical Motion Planning and Tracking for Autonomous Vehicles Using Global Heuristic Based Potential Field and | reinforcement | Learning Based Predictive Control |
hierarchical probabilistic underwater image enhancement model with | reinforcement | tuning, A |
Hierarchical Program-Triggered | reinforcement | Learning Agents for Automated Driving |
Hierarchical | reinforcement | Learning Algorithm Based on Attention Mechanism for UAV Autonomous Navigation, A |
Hierarchical | reinforcement | learning for chip-macro placement in integrated circuit |
Hierarchical | reinforcement | learning for self-driving decision-making without reliance on labelled driving data |
Hierarchical Tracking by | reinforcement | Learning-Based Searching and Coarse-to-Fine Verifying |
Highway Exiting Planner for Automated Vehicles Using | reinforcement | Learning |
HMDRL: Hierarchical Mixed Deep | reinforcement | Learning to Balance Vehicle Supply and Demand |
Human-Guided | reinforcement | Learning With Sim-to-Real Transfer for Autonomous Navigation |
Hybrid Autonomous Driving Guidance Strategy Combining Deep | reinforcement | Learning and Expert System |
Hybrid of Deep | reinforcement | Learning and Local Search for the Vehicle Routing Problems, A |
Hybrid | reinforcement | Learning-Based Eco-Driving Strategy for Connected and Automated Vehicles at Signalized Intersections |
Hyperspectral Feature Selection for SOM Prediction Using Deep | reinforcement | Learning and Multiple Subset Evaluation Strategies |
Identify, Estimate and Bound the Uncertainty of | reinforcement | Learning for Autonomous Driving |
Identity-Preserving Face Hallucination via Deep | reinforcement | Learning |
IG-RL: Inductive Graph | reinforcement | Learning for Massive-Scale Traffic Signal Control |
Image Captioning using Adversarial Networks and | reinforcement | Learning |
Image Captioning with | reinforcement | Learning |
Image Understanding With | reinforcement | Learning: Auto-Tuning Image Attributes and Model Parameters for Object Detection and Segmentation |
Imagination-Augmented | reinforcement | Learning Framework for Variable Speed Limit Control |
Improving Generalization in Visual | reinforcement | Learning via Conflict-aware Gradient Agreement Augmentation |
Improving Generalization of | reinforcement | Learning Using a Bilinear Policy Network |
Improving Spatiotemporal Self-supervision by Deep | reinforcement | Learning |
Increasing GPS Localization Accuracy With | reinforcement | Learning |
Information Fusion Approach to Intelligent Traffic Signal Control Using the Joint Methods of Multiagent | reinforcement | Learning and Artificial Intelligence of Things, An |
Information Optimization and Transferable State Abstractions in Deep | reinforcement | Learning |
Integrated Model for Autonomous Speed and Lane Change Decision-Making Based on Deep | reinforcement | Learning, An |
Integrated Traffic Control for Freeway Recurrent Bottleneck Based on Deep | reinforcement | Learning |
Integrating Model Predictive Control With Federated | reinforcement | Learning for Decentralized Energy Management of Fuel Cell Vehicles |
Integrating Relevance Feedback Techniques for Image Retrieval Using | reinforcement | Learning |
Intelligent Energy Management Strategy Based on an Improved | reinforcement | Learning Algorithm With Exploration Factor for a Plug-in PHEV |
Intelligent Mobile Vehicle Navigator Based on Fuzzy Logic and | reinforcement | Learning, An |
Intelligent Parameter Tuning in Optimization-Based Iterative CT Reconstruction via Deep | reinforcement | Learning |
Intelligent Train Operation Algorithms for Subway by Expert System and | reinforcement | Learning |
Intelligent video anomaly detection and classification using faster RCNN with deep | reinforcement | learning model |
Interpretable End-to-End Urban Autonomous Driving With Latent Deep | reinforcement | Learning |
Intersection-Based QoS Routing for Vehicular Ad Hoc Networks With | reinforcement | Learning, An |
Intersection-Based V2X Routing via | reinforcement | Learning in Vehicular Ad Hoc Networks |
IRLAS: Inverse | reinforcement | Learning for Architecture Search |
IRLSOT: Inverse | reinforcement | learning for scene-oriented trajectory prediction |
Iterative Shrinking for Referring Expression Grounding Using Deep | reinforcement | Learning |
Iteratively-Refined Interactive 3D Medical Image Segmentation with Multi-Agent | reinforcement | Learning |
Joint Collaborative Big Spectrum Data Sensing and | reinforcement | Learning Based Dynamic Spectrum Access for Cognitive Internet of Vehicles |
Joint Computing and Caching in 5G-Envisioned Internet of Vehicles: A Deep | reinforcement | Learning-Based Traffic Control System |
Joint | reinforcement | and Contrastive Learning for Unsupervised Video Summarization |
Joint Secure Offloading and Resource Allocation for Vehicular Edge Computing Network: A Multi-Agent Deep | reinforcement | Learning Approach |
Knowledge-Enhanced Causal | reinforcement | Learning Model for Interactive Recommendation |
Language-Driven Temporal Activity Localization: A Semantic Matching | reinforcement | Learning Model |
Large-scale and adaptive service composition based on deep | reinforcement | learning |
Large-Scale Maintenance and Rehabilitation Optimization for Multi-Lane Highway Asphalt Pavement: A | reinforcement | Learning Approach |
Latency-Energy Tradeoff in Connected Autonomous Vehicles: A Deep | reinforcement | Learning Scheme |
Learning Cooperative Visual Dialog Agents with Deep | reinforcement | Learning |
Learning from Learners: Adapting | reinforcement | Learning Agents to be Competitive in a Card Game |
Learning from Longitudinal Face Demonstration: Where Tractable Deep Modeling Meets Inverse | reinforcement | Learning |
Learning to Draw Through A Multi-Stage Environment Model Based | reinforcement | Learning |
Learning to Drive Like Human Beings: A Method Based on Deep | reinforcement | Learning |
Learning to Drive Using Sparse Imitation | reinforcement | Learning |
Learning to Identify Critical States for | reinforcement | Learning from Videos |
Learning to Paint With Model-Based Deep | reinforcement | Learning |
Learning to schedule multi-NUMA virtual machines via | reinforcement | learning |
Learning to Solve 3-D Bin Packing Problem via Deep | reinforcement | Learning and Constraint Programming |
Learning to Solve Multiple-TSP With Time Window and Rejections via Deep | reinforcement | Learning |
Learning to Transfer Learn: | reinforcement | Learning-based Selection for Adaptive Transfer Learning |
Learning to Walk on Low Friction Terrain by | reinforcement | Learning |
Learning When and Where to Zoom With Deep | reinforcement | Learning |
Leveraging Deep | reinforcement | Learning for Reaching Robotic Tasks |
Lightweight 3D hand pose estimation by cascading CNNs with | reinforcement | learning |
Local | reinforcement | Learning for Object Recognition |
Local-Guided Global: Paired Similarity Representation for Visual | reinforcement | Learning |
Look Before You Leap: Bridging Model-Free and Model-Based | reinforcement | Learning for Planned-Ahead Vision-and-Language Navigation |
MARVEL: Raster Gray-Level Manga Vectorization via Primitive-Wise Deep | reinforcement | Learning |
Masked Contrastive Representation Learning for | reinforcement | Learning |
MEDIRL: Predicting the Visual Attention of Drivers via Maximum Entropy Deep Inverse | reinforcement | Learning |
Memory-Based Deep | reinforcement | Learning for Obstacle Avoidance in UAV With Limited Environment Knowledge |
Meta- | reinforcement | Learning in Non-Stationary and Dynamic Environments |
META: A City-Wide Taxi Repositioning Framework Based on Multi-Agent | reinforcement | Learning |
MetaDrive: Composing Diverse Driving Scenarios for Generalizable | reinforcement | Learning |
Metaverse-Based Teaching Building Evacuation Training System With Deep | reinforcement | Learning, A |
Mitigating Bias in Face Recognition Using Skewness-Aware | reinforcement | Learning |
MO-MIX: Multi-Objective Multi-Agent Cooperative Decision-Making With Deep | reinforcement | Learning |
model-based | reinforcement | learning method based on conditional generative adversarial networks, A |
Model-Based | reinforcement | Learning With Isolated Imaginations |
Model-Free Dynamic Operations Management for EV Battery Swapping Stations: A Deep | reinforcement | Learning Approach |
Model-Reference | reinforcement | Learning for Collision-Free Tracking Control of Autonomous Surface Vehicles |
Modeling 3D Shapes by | reinforcement | Learning |
Modeling Crossing Behaviors of E-Bikes at Intersection With Deep Maximum Entropy Inverse | reinforcement | Learning Using Drone-Based Video Data |
Modeling the Effects of Autonomous Vehicles on Human Driver Car-Following Behaviors Using Inverse | reinforcement | Learning |
Modeling the Influences of Cyclic Top-Down and Bottom-Up Processes for | reinforcement | Learning in Eye Movements |
Modified Deep | reinforcement | Learning with Efficient Convolution Feature for Small Target Detection in VHR Remote Sensing Imagery |
Modular and Transferable | reinforcement | Learning Framework for the Fleet Rebalancing Problem, A |
Monocular Camera-Based Complex Obstacle Avoidance via Efficient Deep | reinforcement | Learning |
Motion control of unmanned underwater vehicles via deep imitation | reinforcement | learning algorithm |
MSN: Mapless Short-Range Navigation Based on Time Critical Deep | reinforcement | Learning |
Multi-Agent Deep | reinforcement | Learning for Large-Scale Traffic Signal Control |
Multi-Agent Deep | reinforcement | Learning for Online 3D Human Poses Estimation |
Multi-Agent Deep | reinforcement | Learning Framework Strategized by Unmanned Aerial Vehicles for Multi-Vessel Full Communication Connection |
Multi-Agent Mix Hierarchical Deep | reinforcement | Learning for Large-Scale Fleet Management |
Multi-Agent | reinforcement | Learning Based Frame Sampling for Effective Untrimmed Video Recognition |
Multi-Agent | reinforcement | Learning for Intelligent V2G Integration in Future Transportation Systems |
Multi-Agent | reinforcement | Learning for Slicing Resource Allocation in Vehicular Networks |
Multi-Agent | reinforcement | Learning Method With Route Recorders for Vehicle Routing in Supply Chain Management, A |
Multi-Agent | reinforcement | Learning With Policy Clipping and Average Evaluation for UAV-Assisted Communication Markov Game |
Multi-Agent Transfer | reinforcement | Learning With Multi-View Encoder for Adaptive Traffic Signal Control |
Multi-graph similarity | reinforcement | for image annotation refinement |
Multi-Kernel Online | reinforcement | Learning for Path Tracking Control of Intelligent Vehicles |
Multi-Level Policy and Reward-Based Deep | reinforcement | Learning Framework for Image Captioning |
Multi-Objective Multi-Satellite Imaging Mission Planning Algorithm for Regional Mapping Based on Deep | reinforcement | Learning |
Multi-Scale Deep | reinforcement | Learning for Real-Time 3D-Landmark Detection in CT Scans |
multi-scenario text generation method based on meta | reinforcement | learning, A |
Multi-source Transfer Learning for Deep | reinforcement | Learning |
Multi-Step | reinforcement | Learning for Single Image Super-Resolution |
Multi-User Adaptive Video Delivery Over Wireless Networks: A Physical Layer Resource-Aware Deep | reinforcement | Learning Approach |
Multiagent | reinforcement | Learning for Integrated Network of Adaptive Traffic Signal Controllers (MARLIN-ATSC): Methodology and Large-Scale Application on Downtown Toronto |
Multimodal Vehicular Trajectory Prediction With Inverse | reinforcement | Learning and Risk Aversion at Urban Unsignalized Intersections |
Multiple-Goal | reinforcement | Learning Method for Complex Vehicle Overtaking Maneuvers, A |
Multistep Multiagent | reinforcement | Learning for Optimal Energy Schedule Strategy of Charging Stations in Smart Grid |
Multitask | reinforcement | Learning in Nondeterministic Environments: Maze Problem Case |
MuRCL: Multi-Instance | reinforcement | Contrastive Learning for Whole Slide Image Classification |
MVSSC: Meta- | reinforcement | learning based visual indoor navigation using multi-view semantic spatial context |
Navigating Robots in Dynamic Environment with Deep | reinforcement | Learning |
Neighborhood Cooperative Multiagent | reinforcement | Learning for Adaptive Traffic Signal Control in Epidemic Regions |
Neighbourhood Context Embeddings in Deep Inverse | reinforcement | Learning for Predicting Pedestrian Motion Over Long Time Horizons |
Neural Batch Sampling with | reinforcement | Learning for Semi-Supervised Anomaly Detection |
Neural network based | reinforcement | learning for audio-visual gaze control in human-robot interaction |
Neural Network Pruning Through Constrained | reinforcement | Learning |
Novel Edge Caching Approach Based on Multi-Agent Deep | reinforcement | Learning for Internet of Vehicles |
novel fast fractal image compression based on | reinforcement | learning, A |
Object Localization Without Bounding Box Information Using Generative Adversarial | reinforcement | Learning |
Off-policy | reinforcement | Learning for Efficient and Effective GAN Architecture Search |
On the Development of an Acoustic-Driven Method to Improve Driver's Comfort Based on Deep | reinforcement | Learning |
On-Line Classification of Data Streams with Missing Values Based on | reinforcement | Learning |
Online Crowd Semantic Segmentation Method Based on | reinforcement | Learning, An |
Online Deep | reinforcement | Learning-Based Order Recommendation Framework for Rider-Centered Food Delivery System, An |
Online | reinforcement | Learning Approach for User-Optimal Parking Searching Strategy Exploiting Unique Problem Property and Network Topology, An |
Online | reinforcement | Learning Control for the Personalization of a Robotic Knee Prosthesis |
Online | reinforcement | Learning for Dynamic Multimedia Systems |
Online | reinforcement | Learning of X-Haul Content Delivery Mode in Fog Radio Access Networks |
Online Vehicle Routing With Neural Combinatorial Optimization and Deep | reinforcement | Learning |
Operating Electric Vehicle Fleet for Ride-Hailing Services With | reinforcement | Learning |
Optimal Boxes: Boosting End-to-End Scene Text Recognition by Adjusting Annotated Bounding Boxes via | reinforcement | Learning |
Optimal EV Fast Charging Station Deployment Based on a | reinforcement | Learning Framework |
Optimal Local Basis: A | reinforcement | Learning Approach for Face Recognition |
Optimization Approach to Edge | reinforcement | , An |
Optimization Method for Collaborative Radar Antijamming Based on Multi-Agent | reinforcement | Learning, An |
Optimized Assistive Human-Robot Interaction Using | reinforcement | Learning |
Optimizing Nitrogen Management with Deep | reinforcement | Learning and Crop Simulations |
Option compatible reward inverse | reinforcement | learning |
Option-Based Multi-Agent | reinforcement | Learning for Painting With Multiple Large-Sized Robots |
Parsing Facades with Shape Grammars and | reinforcement | Learning |
Part-Activated Deep | reinforcement | Learning for Action Prediction |
Partial Policy-Based | reinforcement | Learning for Anatomical Landmark Localization in 3D Medical Images |
Partial | reinforcement | in Game Biofeedback for Relaxation Training |
Particle Filter Design Based on | reinforcement | Learning and Its Application to Mobile Robot Localization |
PassGoodPool: Joint Passengers and Goods Fleet Management With | reinforcement | Learning Aided Pricing, Matching, and Route Planning |
Patchattack: A Black-box Texture-based Attack with | reinforcement | Learning |
Path Following Optimization for an Underactuated USV Using Smoothly-Convergent Deep | reinforcement | Learning |
Path-Analysis-Based | reinforcement | Learning Algorithm for Imitation Filming |
Perception-Action Based Object Detection from Local Descriptor Combination and | reinforcement | Learning |
Personalized Car-Following Control Based on a Hybrid of | reinforcement | Learning and Supervised Learning |
PFRL: Pose-Free | reinforcement | Learning for 6D Pose Estimation |
Physics Informed Deep | reinforcement | Learning for Aircraft Conflict Resolution |
PixelRL: Fully Convolutional Network With | reinforcement | Learning for Image Processing |
Planar Pose Estimation Using Object Detection and | reinforcement | Learning |
Point-2s | reinforcement | learning biomimetic model for estimating and analyzing human 3D motion posture, A |
Policy-Based | reinforcement | Learning for Training Autonomous Driving Agents in Urban Areas With Affordance Learning |
PolicyCleanse: Backdoor Detection and Mitigation for Competitive | reinforcement | Learning |
PoseAgent: Budget-Constrained 6D Object Pose Estimation via | reinforcement | Learning |
Positive Impact of State Similarity on | reinforcement | Learning Performance |
Potential Game Based Task Offloading in the High-Speed Railway With | reinforcement | Learning |
PPO2: Location Privacy-Oriented Task Offloading to Edge Computing Using | reinforcement | Learning for Intelligent Autonomous Transport Systems |
PR-RL: Portrait Relighting Via Deep | reinforcement | Learning |
Precise detection of Chinese characters in historical documents with deep | reinforcement | learning |
Predicting Goal-Directed Human Attention Using Inverse | reinforcement | Learning |
Predicting Head Movement in Panoramic Video: A Deep | reinforcement | Learning Approach |
Preserving Location-Privacy in Vehicular Networks via | reinforcement | Learning |
Privacy-Aware Multiagent Deep | reinforcement | Learning for Task Offloading in VANET |
Privacy-Preserving Federated Deep | reinforcement | Learning for Mobility-as-a-Service |
Product Image Recommendation with Transformer Model Using Deep | reinforcement | Learning |
Progressive Modality | reinforcement | for Human Multimodal Emotion Recognition from Unaligned Multimodal Sequences |
QoE-Based Task Offloading With Deep | reinforcement | Learning in Edge-Enabled Internet of Vehicles |
Quadrotor Autonomous Navigation in Semi-Known Environments Based on Deep | reinforcement | Learning |
Qualitative Transfer for | reinforcement | Learning with Continuous State and Action Spaces |
Quality-aware dual-modal saliency detection via deep | reinforcement | learning |
R3L: Connecting Deep | reinforcement | Learning To Recurrent Neural Networks for Image Denoising Via Residual Recovery |
RAPT360: | reinforcement | Learning-Based Rate Adaptation for 360-Degree Video Streaming With Adaptive Prediction and Tiling |
Rate Control Method Based on Deep | reinforcement | Learning for Dynamic Video Sequences in HEVC |
ReAgent: Point Cloud Registration using Imitation and | reinforcement | Learning |
Real-Time Charging Scheduling of Automated Guided Vehicles in Cyber-Physical Smart Factories Using Feature-Based | reinforcement | Learning |
Real-Time Holding Control for Transfer Synchronization via Robust Multiagent | reinforcement | Learning |
Real-time Motion Planning for Robotic Teleoperation Using Dynamic-goal Deep | reinforcement | Learning |
Real-time Multi-person Pose Tracking Method Using Deep | reinforcement | Learning |
Real-Time Video Emotion Recognition Based on | reinforcement | Learning and Domain Knowledge |
Real-World | reinforcement | Learning Framework for Safe and Human-Like Tactical Decision-Making, A |
Realizing Railway Cognitive Radio: A | reinforcement | Base-Station Multi-Agent Model |
Reaper: Articulated Object 6d Pose Estimation with Deep | reinforcement | Learning |
Reassignment Algorithm of the Ride-Sourcing Market Based on | reinforcement | Learning |
Receding Horizon Cache and Extreme Learning Machine based | reinforcement | Learning |
Reconstruct and Represent Video Contents for Captioning via | reinforcement | Learning |
Region | reinforcement | Network With Topic Constraint for Image-Text Matching |
Regularising neural networks for future trajectory prediction via inverse | reinforcement | learning framework |
Reinforce Data, Multiply Impact: Improved Model Accuracy and Robustness with Dataset | reinforcement | |
Reinforcedet: Object Detection By Integrating | reinforcement | Learning With Decoupled Pipeline |
| reinforcement | Cutting-Agent Learning for Video Object Segmentation |
| reinforcement | Learning |
| reinforcement | Learning and Neuroevolution in Flappy Bird Game |
| reinforcement | Learning and Particle Swarm Optimization Supporting Real-Time Rescue Assignments for Multiple Autonomous Underwater Vehicles |
| reinforcement | Learning and Prediction-Based Lookahead Policy for Vehicle Repositioning in Online Ride-Hailing Systems, A |
| reinforcement | Learning Approach for Enacting Cautious Behaviours in Autonomous Driving System: Safe Speed Choice in the Interaction With Distracted Pedestrians, A |
| reinforcement | Learning Approach for Rebalancing Electric Vehicle Sharing Systems, A |
| reinforcement | learning approach to active camera foveation, A |
| reinforcement | Learning Approach to Autonomous Decision Making of Intelligent Vehicles on Highways, A |
| reinforcement | Learning Approach to the View Planning Problem, A |
| reinforcement | Learning Based Advertising Strategy Using Crowdsensing Vehicular Data |
| reinforcement | learning based coding unit early termination algorithm for high efficiency video coding |
| reinforcement | learning based mainline dynamic speed limit adjustment of expressway off-ramp upstream under connected and autonomous vehicles environment |
| reinforcement | Learning Based Relay Selection for Underwater Acoustic Cooperative Networks |
| reinforcement | Learning Based Two-Stage Model for Emotion Cause Pair Extraction, A |
| reinforcement | learning based visual attention with application to face detection |
| reinforcement | learning combined with a fuzzy adaptive learning control network (FALCON-R) for pattern classification |
| reinforcement | learning cropping method based on comprehensive feature and aesthetics assessment |
| reinforcement | Learning Enhanced PicHunter for Interactive Search |
| reinforcement | learning for combining relevance feedback techniques |
| reinforcement | Learning for Compressed-Sensing Based Frequency Agile Radar in the Presence of Active Interference |
| reinforcement | learning for instance segmentation with high-level priors |
| reinforcement | Learning for Integrating Context with Clutter Models for Target Detection |
| reinforcement | Learning for Logic Recipe Generation: Bridging Gaps From Images to Plans |
| reinforcement | learning for neural architecture search: A review |
| reinforcement | Learning for Online Dispatching Policy in Real-Time Train Timetable Rescheduling |
| reinforcement | Learning for Robust and Efficient Real-World Tracking |
| reinforcement | learning for video encoder control in HEVC |
| reinforcement | Learning for Visual Object Detection |
| reinforcement | learning framework for parameter control in computer vision applications, A |
| reinforcement | Learning Helps SLAM: Learning to Build Maps |
| reinforcement | Learning in Reproducing Kernel Hilbert Spaces: Enabling Continuous Brain-Machine Interface Adaptation |
| reinforcement | learning Integrated Image Segmentation and Object Recognition |
| reinforcement | Learning to Drive a Car by Pattern Matching |
| reinforcement | Learning via Recurrent Convolutional Neural Networks |
| reinforcement | Learning with Dual Attention Guided Graph Convolution for Relation Extraction |
| reinforcement | Learning With Function Approximation for Traffic Signal Control |
| reinforcement | Learning With Multiple Relational Attention for Solving Vehicle Routing Problems |
| reinforcement | Learning with Raw Image Pixels as Input State |
| reinforcement | learning with space carving for plant scanning |
| reinforcement | Learning-Based Black-Box Model Inversion Attacks |
| reinforcement | learning-based feature learning for object tracking |
| reinforcement | Learning-Based Interactive Video Search |
| reinforcement | Learning-Based Layer-Wise Quantization For Lightweight Deep Neural Networks |
| reinforcement | Learning-Based Variable Speed Limit Control Strategy to Reduce Traffic Congestion at Freeway Recurrent Bottlenecks |
| reinforcement | Matching Using Region Context |
| reinforcement | online learning for emotion prediction by using physiological signals |
| reinforcement | Shrink-Mask for Text Detection |
| reinforcement | -based Display Selection for Frugal Learning |
| reinforcement | -Learning-Based Cooperative Adaptive Cruise Control of Buses in the Lincoln Tunnel Corridor with Time-Varying Topology |
| reinforcement | -Learning-Based Energy-Efficient Framework for Multi-Task Video Analytics Pipeline, A |
| reinforcement | -Tracking: An Effective Trajectory Tracking and Navigation Method for Autonomous Urban Driving |
ReLeaPS: | reinforcement | Learning-based Illumination Planning for Generalized Photometric Stereo |
REPNP: Plug-and-Play with Deep | reinforcement | Learning Prior for Robust Image Restoration |
Research on Image Recognition based on | reinforcement | Learning |
Research on Resource Allocation Method of Space Information Networks Based on Deep | reinforcement | Learning |
review of | reinforcement | learning applications in adaptive traffic signal control, A |
Revising the Observation Satellite Scheduling Problem Based on Deep | reinforcement | Learning |
Revisiting Jump-Diffusion Process for Visual Tracking: A | reinforcement | Learning Approach |
Reward-Adaptive | reinforcement | Learning: Dynamic Policy Gradient Optimization for Bipedal Locomotion |
Risk-aware controller for autonomous vehicles using model-based collision prediction and | reinforcement | learning |
RL-AKF: An Adaptive Kalman Filter Navigation Algorithm Based on | reinforcement | Learning for Ground Vehicles |
RL-CAM: Visual Explanations for Convolutional Networks using | reinforcement | Learning |
RL-CycleGAN: | reinforcement | Learning Aware Simulation-to-Real |
RL-GAN-Net: A | reinforcement | Learning Agent Controlled GAN Network for Real-Time Point Cloud Shape Completion |
RLID-V: | reinforcement | Learning-Based Information Dissemination Policy Generation in VANETs |
RLS-DTS: | reinforcement | -Learning Linguistic Steganalysis in Distribution-Transformed Scenario |
RLSAC: | reinforcement | Learning enhanced Sample Consensus for End-to-End Robust Estimation |
RlSnake: a Hybrid | reinforcement | Learning Approach for Road Detection |
RLSS: A Deep | reinforcement | Learning Algorithm for Sequential Scene Generation |
RLST: A | reinforcement | Learning Approach to Scene Text Detection Refinement |
RLStereo: Real-Time Stereo Matching Based on | reinforcement | Learning |
Roadside Decision-Making Methodology Based on Deep | reinforcement | Learning to Simultaneously Improve the Safety and Efficiency of Merging Zone, A |
Robot motion adaptation through user intervention and | reinforcement | learning |
Robust Decision Making for Autonomous Vehicles at Highway On-Ramps: A Constrained Adversarial | reinforcement | Learning Approach |
Robust Dynamic Bus Control: a Distributional Multi-Agent | reinforcement | Learning Approach |
Robust experience replay sampling for multi-agent | reinforcement | learning |
Robust Motion Control for UAV in Dynamic Uncertain Environments Using Deep | reinforcement | Learning |
Robust multi-agent | reinforcement | learning via Bayesian distributional value estimation |
Robust Multimodal Image Registration Using Deep Recurrent | reinforcement | Learning |
Robustness Analysis of Discrete State-Based | reinforcement | Learning Models in Traffic Signal Control |
Robustness with Query-efficient Adversarial Attack using | reinforcement | Learning |
RSSI Map-Based Trajectory Design for UGV Against Malicious Radio Source: A | reinforcement | Learning Approach |
Rule-constrained | reinforcement | learning control for autonomous vehicle left turn at unsignalized intersection |
Safe | reinforcement | Learning for Autonomous Vehicle Using Monte Carlo Tree Search |
Safe | reinforcement | Learning for Single Train Trajectory Optimization via Shield SARSA |
Safe-State Enhancement Method for Autonomous Driving via Direct Hierarchical | reinforcement | Learning |
San Carlo Dei Barnabiti: Restoration And | reinforcement | of the Roofing Of A Florentine Baroque Masterpiece |
Sarod: Efficient End-To-End Object Detection on SAR Images With | reinforcement | Learning |
Scalable lifelong | reinforcement | learning |
Scalable | reinforcement | Learning Algorithm for Scheduling Railway Lines, A |
Scheduling the Operation of a Connected Vehicular Network Using Deep | reinforcement | Learning |
SeedNet: Automatic Seed Generation with Deep | reinforcement | Learning for Robust Interactive Segmentation |
Seeing Beyond the Patch: Scale-Adaptive Semantic Segmentation of High-resolution Remote Sensing Imagery based on | reinforcement | Learning |
Seeing By Haptic Glance: | reinforcement | Learning Based 3d Object Recognition |
Seek-and-Hide: Adversarial Steganography via Deep | reinforcement | Learning |
Selecting A Diverse Set Of Aesthetically-Pleasing and Representative Video Thumbnails Using | reinforcement | Learning |
Selecting Vision Operators and Fixing Their Optimal Parameters Values Using | reinforcement | Learning |
Selective Federated | reinforcement | Learning Strategy for Autonomous Driving, A |
Selective part-based correlation filter tracking algorithm with | reinforcement | learning |
Selective Spatial Regularization by | reinforcement | Learned Decision Making for Object Tracking |
Self-imitation guided goal-conditioned | reinforcement | learning |
Self-Supervise | reinforcement | Learning Method for Vacant Parking Space Detection Based on Task Consistency and Corrupted Rewards |
self-supervised causal feature | reinforcement | learning method for non-invasive hemoglobin prediction, A |
Self-Supervised Discovering of Interpretable Features for | reinforcement | Learning |
Semantic Boundary Detection With | reinforcement | Learning for Continuous Sign Language Recognition |
Semantics | reinforcement | and fusion learning for multimedia streams |
Semi-Decentralized Network Slicing for Reliable V2V Service Provisioning: A Model-Free Deep | reinforcement | Learning Approach |
Semi-Infinitely Constrained Markov Decision Processes and Provably Efficient | reinforcement | Learning |
Semi-supervised double duelling broad | reinforcement | learning in support of traffic service in smart cities |
Semi-Supervised Pedestrian Instance Synthesis and Detection With Mutual | reinforcement | |
Shape grammar parsing via | reinforcement | Learning |
sharing deep | reinforcement | learning method for efficient vehicle platooning control, A |
Ship Rotation Detection Model in Remote Sensing Images Based on Feature Fusion Pyramid Network and Deep | reinforcement | Learning, A |
Sim-Real Joint | reinforcement | Transfer for 3D Indoor Navigation |
Simoun: Synergizing Interactive Motion-appearance Understanding for Vision-based | reinforcement | Learning |
simple boundary | reinforcement | technique for segmentation without prior, A |
Sketch-Based Image Retrieval by Salient Contour | reinforcement | |
Skill-Based Hierarchical | reinforcement | Learning for Target Visual Navigation |
Slide deep | reinforcement | learning networks: Application for left ventricle segmentation |
Smart Underwater Pollution Detection Based on Graph-Based Multi-Agent | reinforcement | Learning Towards AUV-Based Network ITS |
SME-Net: Sparse Motion Estimation for Parametric Video Prediction Through | reinforcement | Learning |
Social-Aware Incentive Mechanism for Vehicular Crowdsensing by Deep | reinforcement | Learning |
Software-Defined Vehicular Networks With Trust Management: A Deep | reinforcement | Learning Approach |
Sparse Black-Box Video Attack with | reinforcement | Learning |
Spatial Geometric Reasoning for Room Layout Estimation via Deep | reinforcement | Learning |
Speed harmonisation and merge control using connected automated vehicles on a highway lane closure: a | reinforcement | learning approach |
SPSD: Semantics and Deep | reinforcement | Learning Based Motion Planning for Supermarket Robot |
SRL-TR2: A Safe | reinforcement | Learning Based TRajectory TRacker Framework |
Stabilizing Visual | reinforcement | Learning via Asymmetric Interactive Cooperation |
StARformer: Transformer with State-Action-Reward Representations for Visual | reinforcement | Learning |
State Representation Learning With Adjacent State Consistency Loss for Deep | reinforcement | Learning |
State-Temporal Compression in | reinforcement | Learning With the Reward-Restricted Geodesic Metric |
Straight to the Point: Fast-Forwarding Videos via | reinforcement | Learning Using Textual Data |
Structured Cooperative | reinforcement | Learning With Time-Varying Composite Action Space |
Style-Agnostic | reinforcement | Learning |
Subject-Specific Cardiac Segmentation Based on | reinforcement | Learning with Shape Instantiation |
Successor Feature-Based Transfer | reinforcement | Learning for Video Rate Adaptation With Heterogeneous QoE Preferences |
Survey of Deep | reinforcement | Learning for Motion Planning of Autonomous Vehicles |
Swarm | reinforcement | learning for traffic signal control based on cooperative multi-agent framework |
Switching to Discriminative Image Captioning by Relieving a Bottleneck of | reinforcement | Learning |
Target Tracking Control of UAV Through Deep | reinforcement | Learning |
Task-Driven Semantic Coding via | reinforcement | Learning |
Task-Risk Consistency Object Detection Framework Based on Deep | reinforcement | Learning, A |
TCLiVi: Transmission Control in Live Video Streaming Based on Deep | reinforcement | Learning |
Temporal Alignment for History Representation in | reinforcement | Learning |
Temporal Complementarity-Guided | reinforcement | Learning for Image-to-Video Person Re-Identification |
Temporal Difference-Aware Graph Convolutional | reinforcement | Learning for Multi-Intersection Traffic Signal Control |
Temporal-Spatial Causal Interpretations for Vision-Based | reinforcement | Learning |
Text-Driven Video Acceleration: A Weakly-Supervised | reinforcement | Learning Method |
Toward Attack-Resistant Route Mutation for VANETs: An Online and Adaptive Multiagent | reinforcement | Learning Approach |
Toward Optimal Real-Time Volumetric Video Streaming: A Rolling Optimization and Deep | reinforcement | Learning Based Approach |
Toward Robots' Behavioral Transparency of Temporal Difference | reinforcement | Learning With a Human Teacher |
Towards Comprehensive Testing on the Robustness of Cooperative Multi-agent | reinforcement | Learning |
Towards Continuous Control for Mobile Robot Navigation: A | reinforcement | Learning and Slam Based Approach |
Towards Interpretable Deep | reinforcement | Learning Models via Inverse Reinforcement Learning |
Towards Interpretable Deep | reinforcement | Learning Models via Inverse Reinforcement Learning |
Towards Neural Charged Particle Tracking in Digital Tracking Calorimeters With | reinforcement | Learning |
Tracking as Online Decision-Making: Learning a Policy from Streaming Videos with | reinforcement | Learning |
Tractable large-scale deep | reinforcement | learning |
Traffic light control using deep policy-gradient and value-function-based | reinforcement | learning |
Traffic Signal Control Using End-to-End Off-Policy Deep | reinforcement | Learning |
Traffic Signal Control With | reinforcement | Learning Based on Region-Aware Cooperative Strategy |
Traffic signal priority control based on shared experience multi-agent deep | reinforcement | learning |
Training Drift Counteraction Optimal Control Policies Using | reinforcement | Learning: An Adaptive Cruise Control Example |
Training Socially Engaging Robots: Modeling Backchannel Behaviors with Batch | reinforcement | Learning |
Trajectory Jerking Suppression for Mixed Traffic Flow at a Signalized Intersection: A Trajectory Prediction Based Deep | reinforcement | Learning Method |
Transfer Learning in Deep | reinforcement | Learning: A Survey |
Transformer-Based | reinforcement | Learning for Pickup and Delivery Problems With Late Penalties |
Trustworthy Edge Storage Orchestration in Intelligent Transportation Systems Using | reinforcement | Learning |
UAV First View Landmark Localization via Deep | reinforcement | Learning |
UAV first view landmark localization with active | reinforcement | learning |
UAV Maneuvering Target Tracking in Uncertain Environments Based on Deep | reinforcement | Learning and Meta-Learning |
UAV-Assisted Fair Communication for Mobile Networks: A Multi-Agent Deep | reinforcement | Learning Approach |
UNAS: Differentiable Architecture Search Meets | reinforcement | Learning |
Underwater Equipotential Line Tracking Based on Self-Attention Embedded Multiagent | reinforcement | Learning Toward AUV-Based ITS |
Unsupervised Learning for Maximum Consensus Robust Fitting: A | reinforcement | Learning Approach |
Unsupervised Learning for Robust Fitting: A | reinforcement | Learning Approach |
Unsupervised | reinforcement | Learning of Transferable Meta-Skills for Embodied Navigation |
Unsupervised Video Summarization via Deep | reinforcement | Learning With Shot-Level Semantics |
Unsupervised Visual Attention and Invariance for | reinforcement | Learning |
Urban Multiple Route Planning Model Using Dynamic Programming in | reinforcement | Learning |
Urban Traffic Control in Software Defined Internet of Things via a Multi-Agent Deep | reinforcement | Learning Approach |
User Association for Load Balancing in Vehicular Networks: An Online | reinforcement | Learning Approach |
User-Guided Personalized Image Aesthetic Assessment Based on Deep | reinforcement | Learning |
Using Deep | reinforcement | Learning to Automate Network Configurations for Internet of Vehicles |
Using | reinforcement | Learning to Control Traffic Signals in a Real-World Scenario: An Approach Based on Linear Function Approximation |
Using | reinforcement | Learning With Partial Vehicle Detection for Intelligent Traffic Signal Control |
Using Semantic Information to Improve Generalization of | reinforcement | Learning Policies for Autonomous Driving |
Vacant Parking Space Detection based on Task Consistency and | reinforcement | Learning |
Value-based deep | reinforcement | learning for adaptive isolated intersection signal control |
Variance Reduced Domain Randomization for | reinforcement | Learning With Policy Gradient |
VeSoNet: Traffic-Aware Content Caching for Vehicular Social Networks Using Deep | reinforcement | Learning |
Video Annotation Through Search and Graph | reinforcement | Mining |
Video Captioning via Hierarchical | reinforcement | Learning |
Video Summarization Through | reinforcement | Learning With a 3D Spatio-Temporal U-Net |
Video Summarization Using | reinforcement | Learning in Eigenspace |
Viewpoint and Scale Consistency | reinforcement | for UAV Vehicle Re-Identification |
Viewport-Aware Deep | reinforcement | Learning Approach for 360° Video Caching |
Visible Routes In 3D Dense City Using | reinforcement | Learning |
Vision Processing for Assistive Vision: A Deep | reinforcement | Learning Approach |
Visual Object Tracking in Drone Images with Deep | reinforcement | Learning |
Visual Tracking by Means of Deep | reinforcement | Learning and an Expert Demonstrator |
Wasserstein Loss With Alternative | reinforcement | Learning for Severity-Aware Semantic Segmentation |
Weakly Supervised Deep | reinforcement | Learning for Video Summarization With Semantically Meaningful Reward |
Weighing Counts: Sequential Crowd Counting by | reinforcement | Learning |
Zwei: A Self-Play | reinforcement | Learning Framework for Video Transmission Services |
704 for reinforcement