Now AI can outmaneuver you at both Stratego and Diplomacy • TechCrunch

Now AI can outmaneuver you at both Stratego and Diplomacy • ProWellTech

ByEmma Watson December 2, 2022

While artificial intelligence long ago surpassed human capability in Chess, and more recently Go — and let us not forget Doom — other more complex board games still present a challenge to computer systems. Until very recently, Stratego and Diplomacy were two of those games, but now AI has become table-flipping good at the former and passably human at the latter.

On the surface, you might think that it’s just because these games require a certain level of long-term planning and strategy. But so do Go and Chess, just in a different way.

The crucial difference is actually that Stratego and Diplomacy are games of strategy based on imperfect information. In Chess and Go, you can see every piece on the board. Stratego hides the identity of pieces until they are encountered by another piece, and Diplomacy is largely about establishing agreements, alliances, and of course vendettas that are kept secret but core to the gameplay. No honest Chess game will involve a third party swooping in to protect your opponent’s bishop with a blue rook.

Both games require not raw calculation of paths to victory, but softer skills like guessing what the opponent is thinking, and what they think the computer is thinking, and make moves that accommodate and hopefully upset those assumptions. In other words, it has to bluff and convince another player of something, not just overpower it with the best possible moves.

The Stratego-playing model, from DeepMind, is named DeepNash, after the famous equilibrium. It is focused less on clever moves and more on play that can’t be exploited or predicted. In some cases this can be bold, like one game the team watched against a human player where the AI sacrificed several high-level pieces, leaving it at a material disadvantage — but it was all a calculated risk to bring out the other player’s big guns, so it could strategize around those. (It won.)

DeepNash is good enough that it beat other Stratego systems almost every time, and 84% of the time versus experienced humans. Because the algorithms that work well in Go and Chess don’t work well here, they invented a new algorithmic method called Regularised Nash Dynamics — but you’ll have to read the paper if you want to understand it any more deeply than that. In the meantime here’s an annotated game:

On the Diplomacy side, we have an AI named Cicero (ah, hubris!) from Meta and CSAIL that manages to play the game at a human level — and if that sounds like damning with faint praise, remember Diplomacy is difficult for most humans to play at a human level. The level of scheming, backstabbing, false promises, and general Machiavellian antics that people get up to in the game are such that it is banned from many friendly gaming groups. Is a computer really capable of that level of shenanigans?

Seems so, and the advances that make it possible are interesting. After all, the interesting part of Diplomacy isn’t the world map and pieces, which are fairly straightforward to read and evaluate, but the potential for schemes latent in those arrangements. Is Venice being threatened on two fronts, or is it luring the western front into an envelopment through a long contemplated times-face?

Not only that, but in order to participate in the scheming, one must speak (or chat, online) to other players and convince them of your sincerity and intent. This takes more than CPU cycles!

Now AI can outmaneuver you at both Stratego and Diplomacy • ProWellTech 1

Image Credits: Meta

Here’s how Cicero works:

Using the board state and current dialogue, make an initial prediction of what everyone will do.
Refine that prediction using planning and then uses those predictions to form an intent for itself and its partner.
Generate several candidate messages based on the board state, dialogue, and its intents.
Filter the candidate message to reduce nonsense, maximize value, and ensure consistency with our intents.

Then, plea your case and hope the other player isn’t planning your demise.

When set loose on webDiplomacy.net, Cicero played quite well against its opponents, placing 2nd out of 19 in a league and generally outscoring others.

It’s still very much a work in progress — it can lose track of what it’s said to others, or make other blunders humans probably wouldn’t — but it’s pretty remarkable that it can be competitive at all.

BoomPop gains traction by designing high-end off-sites for our now remote-first world • TechCrunch

BoomPop gains traction by designing high-end off-sites for our now remote-first world • ProWellTech

ByEmma Watson November 17, 2022

There’s nothing sexy about corporate retreats. But BoomPop, a 26-person, San Francisco-based outfit that the startup studio Atomic launched in 2020, is managing to infuse some sizzle in the historically staid industry. In fact, given what BoomPop is building, one can see it evolve into an option for more than companies looking to more easily…

Shure’s SRH1540 headphones can upgrade your home setup with quality sound and all-day comfort – TechCrunch

Shure’s SRH1540 headphones can upgrade your home setup with quality sound and all-day comfort – TechCrunch

ByOlivia Wilde October 14, 2020

We’re going to be trying out a number of different headphones on TC this week and next as part of our Headphone Week series, and today I’m going to be checking out those Shure SRH1540 ($ 499). These are not new – they have been a readiness for audiophiles in their price range for years….

Polestar to build its first all-electric SUV in the United States – ProWellTech

Polestar to build its first all-electric SUV in the United States – ProWellTech

ByEmma Watson June 16, 2021

Polestar, Volvo Car Group’s standalone electric performance brand, will manufacture its first all-electric SUV in the United States. The automaker said Wednesday that the Polestar 3 will be assembled at a plant shared with Volvo Cars at a factory in Ridgeville, South Carolina. The Polestar 3 follows the all-electric Polestar 2 sedan and the hybrid…

AWS has acquired encrypted messaging service Wickr – ProWellTech

AWS has acquired encrypted messaging service Wickr – ProWellTech

ByEmma Watson June 26, 2021

Amazon’s cloud services giant Amazon Web Services (AWS) is getting into the encrypted messaging business. The company has just announced that it has acquired secure communications service Wickr — a messaging app that has geared itself towards providing services to government and military groups and enterprises. It claims to be the only “collaboration service” that…

Deliveroo could leave Spanish market ahead of on-demand labor reclassification – ProWellTech

Deliveroo could leave Spanish market ahead of on-demand labor reclassification – ProWellTech

ByEmma Watson July 31, 2021

Deliveroo announced today that it is considering leaving the Spanish market, citing limited market share and a long road of investment with “highly uncertain long-term potential returns” on the horizon. The company, an on-demand outfit based in the U.K., went public earlier in 2021. Its shares initially sagged, drawing concern about both the value of…

YouTube TV Launches 4K Plus Upgrade for Enhanced Experience

YouTube TV Launches 4K Plus Upgrade for Enhanced Experience

ByEmma Watson June 29, 2021

YouTube is aiming to score new sign-ups for its YouTube TV service ahead of imminent sporting spectaculars such as the Tokyo Olympics and the MLB All-Star Game. YouTube’s existing live TV service offers more than 85 channels for $65 a month, and this week the Google-owned streaming giant has announced the addition of a new…