Original Link: http://www.wired.com/wiredenterprise/2013/04/siri-two-years/

According to Apple’s Muller, the company takes steps to ensure that the data is anonymized and only collects the Siri voice clips in order to improve Siri itself. “Our customers’ privacy is very important to us,” she says.

Here’s what happens. Whenever you speak into Apple’s voice activated personal digital assistant, it ships it off to Apple’s data farm for analysis. Apple generates a random numbers to represent the user and it associates the voice files with that number. This number — not your Apple user ID or email address — represents you as far as Siri’s back-end voice analysis system is concerned.

Once the voice recording is six months old, Apple “disassociates” your user number from the clip, deleting the number from the voice file. But it keeps these disassociated files for up to 18 more months for testing and product improvement purposes.

“Apple may keep anonymized Siri data for up to two years,” Muller says “If a user turns Siri off, both identifiers are deleted immediately along with any associated data.”

Apple keeps data collected from queries to Siri for up to two years…

My biggest question is actually more of why are people surprised by this?

Siri works by remembering more of what you ask it.. It uses people’s queries to build its database and become more useful to the user.. So why are people surprised by how long it stores that information?