dc.contributor.advisor |
Wang, Zenghui
|
|
dc.contributor.author |
Ohifemen, Obadan Samuel
|
|
dc.date.accessioned |
2022-07-14T11:26:24Z |
|
dc.date.available |
2022-07-14T11:26:24Z |
|
dc.date.issued |
2022-06 |
|
dc.identifier.uri |
https://hdl.handle.net/10500/29107 |
|
dc.description.abstract |
The cooperate behavior that emerges from the interactions among simple multi-agent robots along with the solution possibilities these interactions provide, has formed part of the growing research areas in recent years within the confines of artificial intelligence. In this thesis, we explore these possibilities leveraging single and multi-objective optimization on a machine-learning algorithm: the artificial neural network. The rationale is to achieve goal oriented collaborative control (group behavior) with regard to localizing multiple gradient sources during search operations. With a view to solving the multisource localization problem, we develop an ingenious hybrid metaheuristics algorithm for optimizing exploration (search-oriented) and exploitation (goal-oriented) in both fully observable and partially observable domains. We compared the performance of our model with the existing state of the art metaheuristic algorithms such as Simulated Annealing (SA), Cuckoo-search (CK), Particle Swarm Optimization (PSO) and Genetic Algorithm (GA) using 30 standard benchmark complex objective functions. Results showed significant improvement. The homing (goal-oriented) operation introduced novel concepts for accelerating off-policy reinforcement learning algorithm for Partially Observable Markov Decision Processes (POMDP) via dynamic programming on a multi-agent framework. Finally, we demonstrated an ingenious approach to the resampling phase of Monte Carlo’s particle filter (for robot localization) which showed relatively significant improvement in the belief state estimation accuracy with respect to ground truth within POMDP domains. The contribution of this research is twofold: firstly, it presents a framework for search optimization while localizing multiple emission sources. Secondly, it presents ingenious concepts for foraging, gradient source localization along with the potential for search and rescue operations within POMDP environments. |
en |
dc.format.extent |
1 online resource (201 leaves) : illustrations, graphs (chiefly color) |
|
dc.language.iso |
en |
en |
dc.subject |
Genetic algorithm |
en |
dc.subject |
Dynamic programming |
en |
dc.subject |
Evolutionary Neural networks |
en |
dc.subject |
Cooperation |
en |
dc.subject |
Search optimization |
en |
dc.subject |
Hybridization |
en |
dc.subject |
Reinforcement Learning |
en |
dc.subject |
Particle filters |
en |
dc.subject |
Markov decision processes |
en |
dc.subject |
POMDPs |
en |
dc.subject.ddc |
006.3 |
|
dc.subject.lcsh |
Robots -- Control systems |
en |
dc.subject.lcsh |
Robots -- Programming |
en |
dc.subject.lcsh |
Robotics |
en |
dc.subject.lcsh |
Markov processes |
en |
dc.subject.lcsh |
Control theory -- Computer programs |
en |
dc.subject.lcsh |
Dynamic programming |
en |
dc.subject.lcsh |
Artificial intelligence |
en |
dc.subject.lcsh |
Computer algorithms |
en |
dc.title |
Cooperative control and optimization of robotic agents |
en |
dc.type |
Thesis |
en |
dc.description.department |
School of Computing |
en |
dc.description.degree |
Ph. D. (Computer Science) |
|