No Free Lunch in Ball Catching: A Comparison of Cartesian and Angular Representations for Control
dc.contributor.author | Höfer, Sebastian | |
dc.contributor.author | Raisch, Jörg | |
dc.contributor.author | Toussaint, Marc | |
dc.contributor.author | Brock, Oliver | |
dc.date.accessioned | 2018-05-18T11:26:22Z | |
dc.date.available | 2018-05-18T11:26:22Z | |
dc.date.issued | 2018 | |
dc.description.abstract | How to run most effectively to catch a projectile, such as a baseball, that is flying in the air for a long period of time? The question about the best solution to the ball catching problem has been subject to intense scientific debate for almost 50 years. It turns out that this scientific debate is not focused on the ball catching problem alone, but revolves around the research question what constitutes the ingredients of intelligent decision making. Over time, two opposing views have emerged: the generalist view regarding intelligence as the ability to solve any task without knowing goal and environment in advance, based on optimal decision making using predictive models; and the specialist view which argues that intelligent decision making does not have to be based on predictive models and not even optimal, advocating simple and efficient rules of thumb (heuristics) as superior to enable accurate decisions. We study two types of approaches to the ball catching problem, one for each view, and investigate their properties using both a theoretical analysis and a broad set of simulation experiments. Our study shows that neither of the two types of approaches can be regarded as superior in solving all relevant variants of the ball catching problem: each approach is optimal under a different realistic environmental condition. Therefore, predictive models neither guarantee nor prevent success a priori, and we further show that the key difference between the generalist and the specialist approach to ball catching is the type of input representation used to control the agent. From this finding, we conclude that the right solution to a decision making or control problem is orthogonal to the generalist and specialist approach, and thus requires a reconciliation of the two views in favor of a representation-centric view. | en |
dc.description.sponsorship | DFG, SPP 1527, Autonomes Lernen | en |
dc.identifier.uri | https://depositonce.tu-berlin.de/handle/11303/7813 | |
dc.identifier.uri | http://dx.doi.org/10.14279/depositonce-6988 | |
dc.language.iso | en | en |
dc.relation.issupplementto | https://doi.org/10.1371/journal.pone.0197803 | en |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | en |
dc.subject.ddc | 000 Informatik, Informationswissenschaft, allgemeine Werke | de |
dc.subject.other | ball catching | en |
dc.subject.other | gaze heuristic | en |
dc.subject.other | Chapman's strategy | en |
dc.subject.other | optical acceleration cancellation | en |
dc.subject.other | optimal control | en |
dc.subject.other | reinforcement learning | en |
dc.subject.other | no free lunch | en |
dc.title | No Free Lunch in Ball Catching: A Comparison of Cartesian and Angular Representations for Control | en |
dc.type | Generic Research Data | en |
tub.accessrights.dnb | free | * |
tub.affiliation | Fak. 4 Elektrotechnik und Informatik::Inst. Technische Informatik und Mikroelektronik::FG Robotics | de |
tub.affiliation.faculty | Fak. 4 Elektrotechnik und Informatik | de |
tub.affiliation.group | FG Robotics | de |
tub.affiliation.institute | Inst. Technische Informatik und Mikroelektronik | de |
Files
Original bundle
1 - 12 of 12
No Thumbnail Available
- Name:
- README.md
- Size:
- 1.14 KB
- Format:
- Unknown data format
- Description:
- README
No Thumbnail Available
- Name:
- e2e_baseline_cov-io_manual.zip
- Size:
- 2.23 MB
- Format:
- ZIP archive format.
- Description:
- Experimental evaluation of COV-IO baseline (previously trained by supervised learning)
No Thumbnail Available
- Name:
- e2e_baseline_cov-oac_manual.zip
- Size:
- 2.35 MB
- Format:
- ZIP archive format.
- Description:
- Experimental evaluation of COV-OAC baseline (previously trained by supervised learning)
No Thumbnail Available
- Name:
- e2e_cma.zip
- Size:
- 120.34 MB
- Format:
- ZIP archive format.
- Description:
- CMA-ES evaluated to different observation types
No Thumbnail Available
- Name:
- 2DExperiments_fr60.tar.gz
- Size:
- 2.21 GB
- Format:
- Unknown data format
- Description:
- Experiments in 2D environment with frame rate of 60Hz
No Thumbnail Available
- Name:
- 2DExperiments_fr10.tar.gz
- Size:
- 178.03 MB
- Format:
- Unknown data format
- Description:
- Experiments in 2D environment with frame rate of 10Hz (including MPC experiments)
No Thumbnail Available
- Name:
- AdversarialConstCOV.tar.gz
- Size:
- 1.56 MB
- Format:
- Unknown data format
- Description:
- Single experiment, demonstrating adversarial parameter choice for constant COV strategy (Figure 5 in paper)
No Thumbnail Available
- Name:
- 3DExperiments_fr10.tar.gz
- Size:
- 717.27 MB
- Format:
- Unknown data format
- Description:
- Experiments in 3D environment with frame rate of 10Hz (including MPC experiments)
No Thumbnail Available
- Name:
- 3DExperiments_fr60_part1.tar.gz
- Size:
- 1.76 GB
- Format:
- Unknown data format
- Description:
- Experiments in 3D environment with frame rate of 60Hz (1: COV-IO)
No Thumbnail Available
- Name:
- 3DExperiments_fr60_part2.tar.gz
- Size:
- 1.75 GB
- Format:
- Unknown data format
- Description:
- Experiments in 3D environment with frame rate of 60Hz (2: COV-OAC)
No Thumbnail Available
- Name:
- 3DExperiments_fr60_part3.tar.gz
- Size:
- 1.86 GB
- Format:
- Unknown data format
- Description:
- Experiments in 3D environment with frame rate of 60Hz (3: iLQG)
No Thumbnail Available
- Name:
- 3DExperiments_fr60_part4.tar.gz
- Size:
- 1.81 GB
- Format:
- Unknown data format
- Description:
- Experiments in 3D environment with frame rate of 60Hz (4: LQG)
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 2.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: