<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>
<channel>
	<title>Comments on: Court reporters and multitasking</title>
	<atom:link href="http://dangerousintersection.org/2009/11/03/court-reporters-and-multitasking/feed/" rel="self" type="application/rss+xml" />
	<link>http://dangerousintersection.org/2009/11/03/court-reporters-and-multitasking/</link>
	<description>Human Animals at the Crossroads of Culture, Science, Religion and Media</description>
	<pubDate>Mon, 22 Mar 2010 10:30:20 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7.1</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: NIklaus Pfirsig</title>
		<link>http://dangerousintersection.org/2009/11/03/court-reporters-and-multitasking/comment-page-1/#comment-55352</link>
		<dc:creator>NIklaus Pfirsig</dc:creator>
		<pubDate>Thu, 05 Nov 2009 03:51:23 +0000</pubDate>
		<guid isPermaLink="false">http://dangerousintersection.org/?p=9996#comment-55352</guid>
		<description>Erich, 
  Much recognition software employs an artificial intelligence programming technique known as a neural net simulation. 
  Neural net simulations run many parallel sub-programs, called nodes, that independently analyze the input and produce a list of possible results. Each node starts with a different list of possible results. Each node votes on which item seems more likely to be the result and percentage of nodes voting for the most popular result. The percentage is the compared to a preset percentage called the confidence threshold level. if the percentage is below the preset confidence threshold level, each node's list is adjusted to favor the more popular result and the input is rerun back through the analysis. This repeats until the confidence threshold level is attained or the voting percentage percentage no longer changes. The actual confidence level score is the ratio of results that meet or exceed the preset confidence threshold level out of all the results processed.

  So  basically the software can be 99 percent confident that you are saying "hair" when you are really saying "hare". 

Back in the 90's I worked with a commercial AI system that read handwritten applications and output test for used in a database system. The system used a 300 MHz cpu that interfaced to a high performance document scanner and a second 300 MHz PC that hosted a custom hardware neural net simulator designed around 4 intel 860 processors (the 860 cpu was a 64-bit high performance pipelined RISC processor, often touted as the "Cray on a chip"). Ten years later, any mid-range PC could do more than the $8000 co-processor board was able to do.

  The AI system achieved about 98 percent accuracy with confidence levels set at around 85. Some of the problem handwriting was difficult for our editing staff to read.</description>
		<content:encoded><![CDATA[<p>Erich,<br />
  Much recognition software employs an artificial intelligence programming technique known as a neural net simulation.<br />
  Neural net simulations run many parallel sub-programs, called nodes, that independently analyze the input and produce a list of possible results. Each node starts with a different list of possible results. Each node votes on which item seems more likely to be the result and percentage of nodes voting for the most popular result. The percentage is the compared to a preset percentage called the confidence threshold level. if the percentage is below the preset confidence threshold level, each node&#8217;s list is adjusted to favor the more popular result and the input is rerun back through the analysis. This repeats until the confidence threshold level is attained or the voting percentage percentage no longer changes. The actual confidence level score is the ratio of results that meet or exceed the preset confidence threshold level out of all the results processed.</p>
<p>  So  basically the software can be 99 percent confident that you are saying &#8220;hair&#8221; when you are really saying &#8220;hare&#8221;. </p>
<p>Back in the 90&#8217;s I worked with a commercial AI system that read handwritten applications and output test for used in a database system. The system used a 300 MHz cpu that interfaced to a high performance document scanner and a second 300 MHz PC that hosted a custom hardware neural net simulator designed around 4 intel 860 processors (the 860 cpu was a 64-bit high performance pipelined RISC processor, often touted as the &#8220;Cray on a chip&#8221;). Ten years later, any mid-range PC could do more than the $8000 co-processor board was able to do.</p>
<p>  The AI system achieved about 98 percent accuracy with confidence levels set at around 85. Some of the problem handwriting was difficult for our editing staff to read.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Erich Vieth</title>
		<link>http://dangerousintersection.org/2009/11/03/court-reporters-and-multitasking/comment-page-1/#comment-55311</link>
		<dc:creator>Erich Vieth</dc:creator>
		<pubDate>Thu, 05 Nov 2009 00:19:11 +0000</pubDate>
		<guid isPermaLink="false">http://dangerousintersection.org/?p=9996#comment-55311</guid>
		<description>Dan: I can just hear the little computer speaker barking:  Speak up, please!     Then again, that's what court reporters sometimes need to do.</description>
		<content:encoded><![CDATA[<p>Dan: I can just hear the little computer speaker barking:  Speak up, please!     Then again, that&#8217;s what court reporters sometimes need to do.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dan Klarmann</title>
		<link>http://dangerousintersection.org/2009/11/03/court-reporters-and-multitasking/comment-page-1/#comment-55309</link>
		<dc:creator>Dan Klarmann</dc:creator>
		<pubDate>Wed, 04 Nov 2009 22:01:26 +0000</pubDate>
		<guid isPermaLink="false">http://dangerousintersection.org/?p=9996#comment-55309</guid>
		<description>Erich, there is a basic difference between what any software does, and what it shows a user. Internally, Dragon knows its own confidence level, the sound levels, the sound distinction levels, the frequency distributions of each sound, and the frequency distribution and volume of the background noise.

For a consumer dictation program, all it displays is its best guess at written words to match the sounds with as little interruption of the free flow of utterance as possible.

I was not suggesting using off-the-shelf 2009 Dragonware in lieu of a stenographer. I was suggesting that the kernel of the program knows when to ask someone to speak more clearly. This could be used to complement other automatic recording devices such as face-centering video cameras with directional mics for each speaker.</description>
		<content:encoded><![CDATA[<p>Erich, there is a basic difference between what any software does, and what it shows a user. Internally, Dragon knows its own confidence level, the sound levels, the sound distinction levels, the frequency distributions of each sound, and the frequency distribution and volume of the background noise.</p>
<p>For a consumer dictation program, all it displays is its best guess at written words to match the sounds with as little interruption of the free flow of utterance as possible.</p>
<p>I was not suggesting using off-the-shelf 2009 Dragonware in lieu of a stenographer. I was suggesting that the kernel of the program knows when to ask someone to speak more clearly. This could be used to complement other automatic recording devices such as face-centering video cameras with directional mics for each speaker.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Erich Vieth</title>
		<link>http://dangerousintersection.org/2009/11/03/court-reporters-and-multitasking/comment-page-1/#comment-55259</link>
		<dc:creator>Erich Vieth</dc:creator>
		<pubDate>Wed, 04 Nov 2009 07:01:24 +0000</pubDate>
		<guid isPermaLink="false">http://dangerousintersection.org/?p=9996#comment-55259</guid>
		<description>Dan:  That is often not my experience.   When I use Dragon, it spits out the closest fit to the words I utter, and they can sometimes be dramatically different than what I utter.    It doesn't display any sort of confidence level--Dragon is ALWAYS confident!   The exception would be if I were to cough, at which point Dragon doesn't recognize any text.   Any time I say real words, Dragon prints real words (quite often, but not always, the same words I am speaking).  

But as I mentioned above, 99% accuracy would quickly get a court reporter fired.</description>
		<content:encoded><![CDATA[<p>Dan:  That is often not my experience.   When I use Dragon, it spits out the closest fit to the words I utter, and they can sometimes be dramatically different than what I utter.    It doesn&#8217;t display any sort of confidence level&#8211;Dragon is ALWAYS confident!   The exception would be if I were to cough, at which point Dragon doesn&#8217;t recognize any text.   Any time I say real words, Dragon prints real words (quite often, but not always, the same words I am speaking).  </p>
<p>But as I mentioned above, 99% accuracy would quickly get a court reporter fired.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dan Klarmann</title>
		<link>http://dangerousintersection.org/2009/11/03/court-reporters-and-multitasking/comment-page-1/#comment-55190</link>
		<dc:creator>Dan Klarmann</dc:creator>
		<pubDate>Wed, 04 Nov 2009 04:10:29 +0000</pubDate>
		<guid isPermaLink="false">http://dangerousintersection.org/?p=9996#comment-55190</guid>
		<description>Dragon may not yet be perfect in transcription, but it could easily tell when it is having trouble, as in mumbling, indistinct word separations, and overall volume (the causes of "speak up").</description>
		<content:encoded><![CDATA[<p>Dragon may not yet be perfect in transcription, but it could easily tell when it is having trouble, as in mumbling, indistinct word separations, and overall volume (the causes of &#8220;speak up&#8221;).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Erich Vieth</title>
		<link>http://dangerousintersection.org/2009/11/03/court-reporters-and-multitasking/comment-page-1/#comment-55157</link>
		<dc:creator>Erich Vieth</dc:creator>
		<pubDate>Tue, 03 Nov 2009 23:44:18 +0000</pubDate>
		<guid isPermaLink="false">http://dangerousintersection.org/?p=9996#comment-55157</guid>
		<description>Dan:  I would add that in addition to oral evidence a second major category of evidence is documentary evidence (including written statements, photographs and business records) which, in fact, drives the verdict in many cases.   Other types of evidence are also important (e.g., physical objects, scientific tests), though these things do rely upon human beings to establish a foundation for admissibility.  

My experience with voice activation (and I'm a constant user of Dragon), is that we are a long way from relying on it to get an accurate account.  As amazing as it is as a dictation tool, 99% accuracy would get a human court reporter fired in a heartbeat.    Many of my posts are written through Dragon.   I always need to make a few corrections before publishing--and you have been gracious to point out typos that I miss on a regular basis.</description>
		<content:encoded><![CDATA[<p>Dan:  I would add that in addition to oral evidence a second major category of evidence is documentary evidence (including written statements, photographs and business records) which, in fact, drives the verdict in many cases.   Other types of evidence are also important (e.g., physical objects, scientific tests), though these things do rely upon human beings to establish a foundation for admissibility.  </p>
<p>My experience with voice activation (and I&#8217;m a constant user of Dragon), is that we are a long way from relying on it to get an accurate account.  As amazing as it is as a dictation tool, 99% accuracy would get a human court reporter fired in a heartbeat.    Many of my posts are written through Dragon.   I always need to make a few corrections before publishing&#8211;and you have been gracious to point out typos that I miss on a regular basis.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dan Klarmann</title>
		<link>http://dangerousintersection.org/2009/11/03/court-reporters-and-multitasking/comment-page-1/#comment-55154</link>
		<dc:creator>Dan Klarmann</dc:creator>
		<pubDate>Tue, 03 Nov 2009 19:34:41 +0000</pubDate>
		<guid isPermaLink="false">http://dangerousintersection.org/?p=9996#comment-55154</guid>
		<description>Legacy is the basis of jurisprudence. One example that bugs me is that eyewitness accounts are considered the only legal form of evidence. But it has been thoroughly proven that this is the least reliable form of evidence. My experience sitting on juries is that testimony falls somewhere between 
a) An account of what the witness thinks they remember of what they believe they saw, and 
b) What they invent and hope you'll believe. As in, "Naw, I ain't told [those other witnesses] what they's all said I did"

Having a stenographer take down every word predates typing or other recording devices. It does still serve the two needs Erich ended up with, prompting for verbal clarity as needed, and providing an impartial witness to the proceedings.

But a Dragonware-like app can almost already do such prompting, and the stenographer is not actually a conscious witness (as Erich described). She can only testify that the words did flow through her to the tape.</description>
		<content:encoded><![CDATA[<p>Legacy is the basis of jurisprudence. One example that bugs me is that eyewitness accounts are considered the only legal form of evidence. But it has been thoroughly proven that this is the least reliable form of evidence. My experience sitting on juries is that testimony falls somewhere between<br />
a) An account of what the witness thinks they remember of what they believe they saw, and<br />
b) What they invent and hope you&#8217;ll believe. As in, &#8220;Naw, I ain&#8217;t told [those other witnesses] what they&#8217;s all said I did&#8221;</p>
<p>Having a stenographer take down every word predates typing or other recording devices. It does still serve the two needs Erich ended up with, prompting for verbal clarity as needed, and providing an impartial witness to the proceedings.</p>
<p>But a Dragonware-like app can almost already do such prompting, and the stenographer is not actually a conscious witness (as Erich described). She can only testify that the words did flow through her to the tape.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dan Klarmann</title>
		<link>http://dangerousintersection.org/2009/11/03/court-reporters-and-multitasking/comment-page-1/#comment-55152</link>
		<dc:creator>Dan Klarmann</dc:creator>
		<pubDate>Tue, 03 Nov 2009 19:08:14 +0000</pubDate>
		<guid isPermaLink="false">http://dangerousintersection.org/?p=9996#comment-55152</guid>
		<description>Back in the 1990's it was conclusively proven that our brains multitask at the symbolic level. That is, many things are busily going on while the little piece that runs your conscious awareness only monitors one. The mind can be trained much like the body, allowing reflexes to handle many complex functions without oversight of the conscious mind. 

Why is it that we are not amazed that one can walk without intentionally telling each quadricep (and each of the hundred other muscles involved) when to contract and or being aware of all the sensations involved (orientation, rotation, balance, forces, wind, etc), yet it amazes us that we can process specific types of information, as in-the-ear-out-the-fingers transcription? 
What amazes me is that my wife can converse while typing something else. At least until she empties the buffer of what she'd read or thought of to write; a couple of sentences. Amazing.
I think a word and it comes out my fingers. But I am limited to a word at a time. If I invested the time to train, I could get to her paragraphic level of typing. Probably. Old dog.</description>
		<content:encoded><![CDATA[<p>Back in the 1990&#8217;s it was conclusively proven that our brains multitask at the symbolic level. That is, many things are busily going on while the little piece that runs your conscious awareness only monitors one. The mind can be trained much like the body, allowing reflexes to handle many complex functions without oversight of the conscious mind. </p>
<p>Why is it that we are not amazed that one can walk without intentionally telling each quadricep (and each of the hundred other muscles involved) when to contract and or being aware of all the sensations involved (orientation, rotation, balance, forces, wind, etc), yet it amazes us that we can process specific types of information, as in-the-ear-out-the-fingers transcription?<br />
What amazes me is that my wife can converse while typing something else. At least until she empties the buffer of what she&#8217;d read or thought of to write; a couple of sentences. Amazing.<br />
I think a word and it comes out my fingers. But I am limited to a word at a time. If I invested the time to train, I could get to her paragraphic level of typing. Probably. Old dog.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
