<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Strata &#187; Fred Trotter</title>
	<atom:link href="http://strata.oreilly.com/fredt/feed" rel="self" type="application/rss+xml" />
	<link>http://strata.oreilly.com</link>
	<description>Making Data Work</description>
	<lastBuildDate>Sun, 16 Jun 2013 16:18:16 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>Moving to the open health-care graph</title>
		<link>http://strata.oreilly.com/2013/06/moving-to-the-open-health-care-graph.html</link>
		<comments>http://strata.oreilly.com/2013/06/moving-to-the-open-health-care-graph.html#comments</comments>
		<pubDate>Tue, 04 Jun 2013 14:30:44 +0000</pubDate>
		<dc:creator>Fred Trotter</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[health care]]></category>
		<category><![CDATA[health care data]]></category>
		<category><![CDATA[health care disruption]]></category>
		<category><![CDATA[open health-care graph]]></category>

		<guid isPermaLink="false">http://strata.oreilly.com/?p=57480</guid>
		<description><![CDATA[To achieve the the triple aim in healthcare (better, cheaper, and safer), we are going to need intensive monitoring and measurement of specific doctors, hospitals, labs and countless other clinical professionals and clinical organizations. We need specific data and specific &#8230; ]]></description>
				<content:encoded><![CDATA[<p>To achieve the <a href="http://www.ihi.org/offerings/Initiatives/TripleAim/Pages/default.aspx">the triple aim</a> in healthcare (better, cheaper, and safer), we are going to need intensive monitoring and measurement of specific doctors, hospitals, labs and countless other clinical professionals and clinical organizations. We need specific data and specific doctors.</p>
<p>In <a href="http://www.leagle.com/xmlResult.aspx?xmldoc=19791770479FSupp1291_11604.xml&amp;docbase=CSLWAR1-1950-1985">1979</a> a Federal judge in Florida sided with the AMA to prevent these kinds of provider-specific data sets violated doctor privacy. Last Friday, a different Florida judge reversed the 1979 injunction, allowing provider identified data to be released from CMS under FOIA requests. Even without this tremendous <a href="http://online.wsj.com/article/SB10001424127887324682204578517653435889448.html">victory for the Wall Street Journal</a>, there was already a shift away from aggregation studies in healthcare towards using Big Data methods on specific doctors to improve healthcare. This critical shift will allow us to determine which doctors are doing the best job, and which are doing the worst. We can target struggling doctors to help improve care, and we can also target the best doctors, so that we can learn new best practices in healthcare.</p>
<p>Evidence-based medicine must be targeted to handle specific clinical contexts. The only really open questions to decide are &#8220;how much data should we relese&#8221; and &#8220;should this be done in secret or in the open.&#8221; I submit that the targeting should be done at the individual and team level, and that this must be an open process. We need to start tracking the performance and clinical decisions of specific doctors working with other specific doctors, in a way that allows for public scrutiny. We need to release uncomfortably personal data about specific physicians and evaluate that data in a fair manner, without sparking a witch-hunt. And whether you agree with this approach or not, it&#8217;s already underway. The overturning of this court case will only open the flood gates further.<span id="more-57480"></span></p>
<p>Last year, I <a href="http://www.youtube.com/watch?v=47qEV3WzHFw">released DocGraph at Strata RX</a>. DocGraph details how specific doctors and hospitals team together to deliver healthcare. (data on referral patterns, etc). At that conference we (<a href="http://notonlydev.com">Not Only Development</a>) promised that this data set would go Open Source eventually. This month, we will be announcing that the DocGraph data set is available for costless download.  But there are two other data sets that can now be used to make that graph data much richer.</p>
<p>&#8220;Graph&#8221; or &#8220;network&#8221; data, in this context, refers to a computing technique that represents nodes (in this case doctors, hospitals, etc.) connected by edges (in this case, representing which doctors and hospitals work together). So DocGraph explicitly states that Dr. Smith (a primary care doctor) is working with Dr. Jones (a cardiologist), and assigns a strength to that relationship.</p>
<p>In order to take a network graph approach to modeling the healthcare system, you have to name names. We have had named, specific, and incriminating <a href="http://www.medicare.gov/hospitalcompare/">data on US hospitals</a> for years now. This openness has been at the heart of a revolution in the field of <a href="http://en.wikipedia.org/wiki/Patient_safety">patient safety</a>. I believe that this openness has been a central motivator for the ongoing reduction of &#8220;never events&#8221;, perhaps even more important than corresponding payment reforms. On some cases, I even <a href="http://www.ihi.org/knowledge/Pages/Publications/EffectNonpaymentPreventableInfectionsUSHospitals.aspx">have data to back this assertion</a>.</p>
<p>I am a devotee of <a href="https://www.youtube.com/watch?v=u6XAPnuFjJc">new thinking about human motivation</a> and I believe very strongly the most doctors, even the &#8220;bad&#8221; ones, want to be the best they can at what they do, largely independent of financial incentives. But doctors need uncomfortably personal data on how they, specifically, are doing in order to start doing it better.</p>
<p>This has been the month for the open release of uncomfortably personal data about the healthcare system.</p>
<p>First, HHS announced the release of data on master charge sets for hospitals. This is likely a direct response to the problems with master charge sets in the masterful <a href="http://www.time.com/time/magazine/article/0,9171,2136864,00.html#ixzz2TruCzkWE">Bitter Pill: Why Medical Bills Are Killing Us</a> article by Steven Brill in Time Magazine. The <a href="http://www.cms.gov/research-statistics-data-and-systems/statistics-trends-and-reports/medicare-provider-charge-data/index.html">data released by CMS</a> is a complex data set about an even more complex medico-legal issue, but a useful oversimplification is this: it shows which hospitals have been the worst abusers of cash-paying, uninsured patients. You could write <a href="http://swampland.time.com/2013/05/08/an-end-to-medical-billing-secrecy/">entire articles</a> about the structure, value, and depth of this data set&#8230;and I plan to, given infinite time and resources.</p>
<p>But then, in the midst of this, <a href="http://projects.propublica.org/checkup/">Propublica released data on the prescribing patterns of almost every doctor in the US</a>. This is detailed information about the preferences of almost every doctor who participates in the Medicare Part D prescription program, which is almost every doctor. Does your doctor prefer Oxycontin to Vicodin? Which antibiotic does your doctor use most frequently? These choices, taken together, can be called a &#8220;prescribing pattern.&#8221; Propublica is allowing the public to view those patterns for specific doctors.</p>
<p>These data sets are having a combinatorial impact for those of us who are interested in researching the healthcare system.</p>
<p>Now you can see if your doctor is a conservative ( <a href="http://projects.propublica.org/checkup/providers/199132">o</a> <a href="http://projects.propublica.org/checkup/providers/632523">1</a> <a href="http://projects.propublica.org/checkup/providers/1584701">2</a> <a href="http://projects.propublica.org/checkup/providers/545197">3</a> <a href="http://projects.propublica.org/checkup/providers/545197">4</a> <a href="http://projects.propublica.org/checkup/providers/963204">5</a>) or a liberal (<a href="http://projects.propublica.org/checkup/providers/991133">1</a> <a href="http://projects.propublica.org/checkup/providers/1062967">2</a> <a href="http://projects.propublica.org/checkup/providers/753957">3</a>) prescriber, and you can see <a href="http://pastebin.com/E7Mv8RmL">if they refer</a> patients to a <a href="http://www.washingtonpost.com/blogs/wonkblog/wp/2013/05/08/one-hospital-charges-8000-another-38000/">hospital that charges more than double what the one across the street does</a>.</p>
<p>Taken together, these changes in data release policy represent two important shifts in the analysis of the healthcare system. We are moving from proprietary analysis to open analysis and from aggregate data to graph (network) data. These moves parallel past scientific process breakthroughs:</p>
<blockquote><p>Proprietary science -&gt; Open Science: illustrated by the move from Alchemy to Chemistry. Alchemists were famous for doing work in secret, hoping to learn and horde the secrets for turning lead into gold or the secret to eternal life. Chemistry began when researchers gave up secrecy and started sharing important results openly.</p>
<p>Aggregate models -&gt; Network models: illustrated by the move from chromosomal models of genetic inheritance to <a href="http://en.wikipedia.org/wiki/Omics">omic</a> (genomic/proteomic/etc.) network models of genetic inheritance. Mendel could spot patterns in the colors of his peas because the those genes operated on the chromosomal level. The chromosome acts as a natural phenotypical aggregator for a much more complex genetic processes. But that aggregation limits what can be studied. The discovery of DNA allowed researchers to start analyzing inheritance using network models.</p></blockquote>
<p>For years, there has been a proprietary market for data about how doctor behave, specifically around prescribing patterns. IMS, for instance, is a <a href="http://www.imshealth.com/">leading data vendor</a> for this information. If you had wanted to purchase prescribing or referral pattern data, IMS will happily provide it (hint: you can&#8217;t afford it). Despite the high barrier to access, IMS&#8217;s service has been very unpopular with doctors, and the AMA successfully lobbied for a mechanism that would allow for a doctor to <a href="http://www.ama-assn.org/ama/pub/about-ama/physician-data-resources/ama-database-licensing/amas-physician-data-restriction-program.page">opt-out of these prescribing databases</a>. So there is a well-established data vendor community here, with some recent Big Data entrants.</p>
<p>Two companies using Big Data graph analysis methods on doctor data have had high-profile funding events. <a href="http://www.crunchbase.com/company/mednetworks">Activate Networks was funded for 10 million in series B</a> and <a href="http://www.crunchbase.com/company/kyruus">Kyruus received 11 million in series B</a>. The list of  <a href="http://kyruus.com/about-us/management-team/">people at Kyruus</a> and <a href="http://www.activatenetworks.net/people/founders/">people at Activate Networks</a> are filled with the rock stars of this nascent industry; who have published <a href="http://www.ncbi.nlm.nih.gov/pubmed/22797644">seminal papers in the field</a>. However, like IMS, these companies are pursuing proprietary data approach to graph analysis of the healthcare system.</p>
<p>This is not necessarily by choice—most doctor data is released reluctantly by data owners. They are concerned with ensuring that doctor data does not spill into the public domain. In order to run their businesses, IMS, Activate, and Kyruus have likely made contractual promises that require them to keep of much of the data that they have access to private. In short, these companies are &#8220;behind the curtain&#8221; of healthcare informatics. They get substantial benefits by having access to this kind of private healthcare data and they must accept certain limitations in its use. My limited interactions with these companies has shown that they are as enthusiastic about open data in healthcare as I am.</p>
<p>When even proprietary players want to shift to more open accountable data models, it is fair to say that this shift is widely accepted. As a society, it is critical for us to move this graph healthcare data, as much as possible, into the open. This will allow data scientists from IMS, Activate, Kyruus and others to collaborate with journalists, academics and the open source developer community to make doctor and hospital performance into an open science. HHS has done its part by proactively releasing new critical data sets and by electing not to &#8220;fight&#8221; FOIA requests that seek even more data. This is substantive evidence that the <a href="http://www.whitehouse.gov/the-press-office/2013/05/09/obama-administration-releases-historic-open-data-rules-enhance-governmen">mission of open data</a>, inspired by President Obama and implemented by Federal CTO Todd Park, is a reality.</p>
<p>The second shift is away from aggregate models for healthcare researchers. While Propublica, Kyruus, and Activate have the Big Data chops to lead this shift, the rank and file healthcare researcher still routinely uses aggregate data to analysis the healthcare system as the method of choice. This is true of academic researchers, healthcare industry administrators, and policy makers alike. Understanding statistics is really the first step towards being a well-rounded data scientist, but it is only the first step.</p>
<p>Traditional statistical approaches, like traditional economic approaches, are powerful because they make certain simplifying assumptions about the world. Like many generalizations they are useful cognitive shortcuts until they are too frequently proven untrue. There is no &#8220;normal&#8221; prescribing pattern, for instance, to which a given provider can be judged.</p>
<p>Using averages, across zipcode, city, state or regional boundaries is a useful way to detect and describe the problems that we have in healthcare. But it is a terrible way to create feedback and control loops. An infectious disease clinic, for instance, will be unperturbed to learn that it has a higher &#8220;rate of infection&#8221; scores than neighboring clinics. In fact they are a magnet for infection cases, and hopefully should have a higher infection &#8220;case loads&#8221;, but a lower &#8220;infection&#8221; rates. Many scoring systems are unable to make these kinds of distinctions. Similarly, a center for cancer excellence would not be surprised to learn that they have a shorter life span scores then other cancer treatment centers. Any &#8220;last resort&#8221; clinical center would show those effects as the attract the most difficult cases.</p>
<p>It is difficult to use averages, scores, and other simplistic mathematical shortcuts to detect real problems in healthcare. We need a new norm where the average healthcare researcher&#8217;s initial tool of choice is <a href="https://gephi.org/">Gephi</a> rather than Excel and <a href="http://www.neo4j.org/">Neo4J</a> rather than SQL. The aggregate approach has taken the healthcare system this far, but we need to have deeper understandings of how the healthcare system works—and fails—if we want to achieve the triple aim. We need models that incorporate details about specific doctors and hospitals. We need to move from simplistic mathematical shortcuts to complex mathematical models, Big Data if you like that term.</p>
<p>We need to have both shifts at the same time. It is not enough to have the shift to open data, in aggregate, or the shift to network models trapped behind the &#8220;insiders curtain&#8221;.  That has been happening for years and this creates troubling power dynamics. When the public can see only averages, but the insiders get to see the graph of healthcare, we will enjoy only narrow optimizations and limited uses.</p>
<p>Currently, there is no simple way for data scientists at an EHR company, an insurance company, and a drug company to teach each other how to better leverage the healthcare graph. Each of these companies has a lens on the &#8220;true&#8221; graph created using only a slice of the relevant data. but the limitations of the data are really the least of the problems facing these researchers working in isolation. For each data scientist, working in isolation, there is no way to generally test hypothesis with outsiders. There is no way to &#8220;<a href="http://store-xkcd-com.myshopify.com/products/try-science">stand back and try science</a>&#8221; because science is a community process.</p>
<p>We need to create a community of healthcare graph researchers and provide that community with non-aggregate data it needs to create the algorithms that will dictate how medicine operates for the next century. This is not a project for any single company to take on; we are betting too much as a society to have that kind of pressure. No company or data scientist I know of even wants that kind of role. Instead, every company in the space is interested in leveraging and contributing open data, so that the hypothesis and methods developed &#8220;behind the curtain&#8221; can be validated in the open.</p>
<p>Before the release of these three data sets, data scientists were in the tremendously uncomfortable position of having to make critical business decisions while being only &#8220;probably&#8221; right. Given the ease with which &#8220;probably right&#8221; can turn into &#8220;<a href="http://seattletimes.com/html/opinion/2020820167_paulkrugmancolumnexcelxml.html">completely wrong</a>&#8221; with data, we should work hard to ensure that data scientists are not put in this position again.</p>
<div style="float: left;border-top: thin gray solid;border-bottom: thin gray solid;padding: 20px;margin: 20px 2px;clear: both"><a href="http://strataconf.com/rx2013?intcmp=il-strata-strx13-strata-blog-banner-148x178"><img style="float: left;border: none;padding-right: 10px" src="http://cdn.oreilly.com/radar/images/promos/2013-stratarx.gif" /></a><a href="http://strataconf.com/rx2013?intcmp=il-strata-strx13-strata-blog-banner-148x178"><strong>Strata Rx Heath Data Conference</strong></a> &mdash; Strata Rx brings together the diverse communities driving innovations in big data analytics for health care. Learn about the transformation of health care through big data and how to position your company to benefit from these trends. <a href="http://strataconf.com/rx2013?intcmp=il-strata-strx13-strata-blog-banner-148x178">Learn more</a>.</div>
]]></content:encoded>
			<wfw:commentRss>http://strata.oreilly.com/2013/06/moving-to-the-open-health-care-graph.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Preferred structures for cleaned-up doctor data</title>
		<link>http://strata.oreilly.com/2013/01/preferred-structures-for-cleaned-up-doctor-data.html</link>
		<comments>http://strata.oreilly.com/2013/01/preferred-structures-for-cleaned-up-doctor-data.html#comments</comments>
		<pubDate>Fri, 18 Jan 2013 14:00:23 +0000</pubDate>
		<dc:creator>Fred Trotter</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[data project]]></category>
		<category><![CDATA[DocGraph]]></category>
		<category><![CDATA[doctor data]]></category>
		<category><![CDATA[health data]]></category>
		<category><![CDATA[medical data]]></category>
		<category><![CDATA[StrataRX]]></category>

		<guid isPermaLink="false">http://strata.oreilly.com/?p=54026</guid>
		<description><![CDATA[The DocGraph project has an interesting issue that I think will become a common one as  the open data movement continues. For those that have not been keeping up, DocGraph was announced at Strata RX, described carefully on this blog, &#8230; ]]></description>
				<content:encoded><![CDATA[<p>The <a href="http://docgraph.notonlyfor.com/">DocGraph project</a> has an interesting issue that I think will become a common one as  the open data movement continues. For those that have not been keeping up, DocGraph was <a href="http://www.youtube.com/watch?v=47qEV3WzHFw">announced at Strata RX</a>, <a href="http://strata.oreilly.com/2012/11/docgraph-open-social-doctor-data.html">described carefully</a> on this blog, and will be <a href="http://strataconf.com/strata2013/public/schedule/detail/27662">featured again at Strata 2013</a>. For those that do not care to click links, DocGraph is a crowdfunded open data set, which merges open data sources on doctors and hospitals.</p>
<p>As I recently described on the DocGraph mailing list, work is underway to acquire the data sets that we set out to merge. The issue deals with file formats.</p>
<p>The core identifier for doctors, hospitals and other healthcare entities is the <a href="http://en.wikipedia.org/wiki/National_Provider_Identifier">National Provider Identifier</a> (NPI). This is something like a Social Security number for doctors and hospitals. In fact it was created in part so that doctors would not need to use their Social Security numbers or other identifiers in order to participate in healthcare financial transactions (i.e. paid by insurance companies for their services). The NPI is the &#8220;one number to rule them&#8221; in healthcare and we want to map data from other sources accurately to that ID.</p>
<p>Each state releases none, one or several data files that can be purchased and also contain doctor data. But these file downloads are in &#8220;random file format X.&#8221; Of course we are not yet done with our full survey of the files and their formats, but I can assure you that they are mostly <a href="http://en.wikipedia.org/wiki/Comma-separated_values">CSV files</a> and a troubling number of PDF files. It is our job to take these files and merge them against the NPI, in order to provide a cohesive picture for data scientists.</p>
<p>But the data available from each state varies greatly. Sometimes they will have addresses, sometimes not. Sometimes they will have fax numbers, sometimes not, sometimes they will include medical school information, some will not. Sometimes they will simply include the name of the medical school, sometimes they will use a code. Sometimes when they use codes they will make up their own &#8230;</p>
<p>I am not complaining here. We knew what we were getting ourselves into when we took on the DocGraph project. The community at large has paid us well to do this work! But now we have a question? What data formats should we support?<span id="more-54026"></span></p>
<p>The simple answer is that everyone can handle JSON, XML or CSV and we will probably end up supporting all of those formats to some degree. XML is famous for its capacity to handle &#8220;constraints&#8221; that solve some or all of the problems that we are considering addressing. Of course, there is some work on similar <a href="http://json-schema.org/">constraint systems for JSON</a>. <a href="http://lloyd.io/orderly-jsonschema">This guy thinks more is needed</a>, <a href="http://www.ietf.org/mail-archive/web/apps-discuss/current/msg07201.html">This guy thinks less is needed</a>. Both of them seem to have thought about it more than I have.</p>
<p>What I am concerned with is how to architect the schema(s) for these merges. As we merge the doctor data, we will be making guesses about what a particular field in a file download means? To what degree should we expose those guesses?</p>
<p>How do we merge data sources that will have overlapping and potentially redundant data sources (like addresses for doctors that come from five different files)? How do we handle metadata, when so much of the metadata involves which file/state a data point originated from? Some people will really want to know that, but most people want a slim data structure that is easy to work with.</p>
<p>I feel somewhat uneasy just asking the community of data scientists about these issues, but it ends up being a pretty big deal in the long run. We hope to maintain this as an open dataset, which means that we want people to be able to rely on these file formatting decisions for as long as possible (we will be versioning the format) . Eventually people will start to merge this dataset with DNA and phonemic data, including geo-aware population health data. That is going to be a pretty complicated process and I really want to save headaches for the people involved in this kind of work.</p>
<p>Also important is the size of this dataset. The core data file is an almost 5 GB CSV file. I expect the state files to be around a 1 GB each. Add all of that up and add JSON/XML formatting and you are talking about a lot of space. How can we chose a format that will make working with this dataset easier? So far, we have seen lots of <a href="https://github.com/rweald/docgraph-data-analysis">cool stuff</a> because we kept things simple for data scientists.</p>
<p>I would like to open the floor to comments (on this blog post), which the DocGraph project will be taking very seriously. When you comment, if you could mention other &#8220;open projects that have solved similar problems and how they did it, that would be great. It would also be great to have links to any relevant SQL or document-based layout standards that might help us. If you are going to provide an academic reference, please also let us know if you have any experience working with those standards. While we really want to solve this problem in a parsimonious way, we do have practical usability as our primary aim here. Thanks for your help in advance.</p>
<p><strong>Related:</strong></p>
<ul>
<li> <a href="http://strata.oreilly.com/2012/11/docgraph-open-social-doctor-data.html">DocGraph: Open social doctor data</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://strata.oreilly.com/2013/01/preferred-structures-for-cleaned-up-doctor-data.html/feed</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>DocGraph: Open social doctor data</title>
		<link>http://strata.oreilly.com/2012/11/docgraph-open-social-doctor-data.html</link>
		<comments>http://strata.oreilly.com/2012/11/docgraph-open-social-doctor-data.html#comments</comments>
		<pubDate>Mon, 19 Nov 2012 14:00:30 +0000</pubDate>
		<dc:creator>Fred Trotter</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[crowdfund]]></category>
		<category><![CDATA[DocGraph]]></category>
		<category><![CDATA[health care]]></category>
		<category><![CDATA[health data]]></category>
		<category><![CDATA[health IT]]></category>
		<category><![CDATA[StrataRX]]></category>

		<guid isPermaLink="false">http://strata.oreilly.com/?p=52818</guid>
		<description><![CDATA[At Strata RX in October I announced the availability of DocGraph. This is the first project of NotOnly Development, which is a Not Only For Profit Health IT micro-incubator. The DocGraph dataset shows how doctors, hospitals, laboratories and other health &#8230; ]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.youtube.com/watch?v=47qEV3WzHFw">At Strata RX in October I announced the availability of DocGraph</a>. This is the first project of <a href="http://notonlyfor.com">NotOnly Development</a>, which is a Not Only For Profit Health IT micro-incubator.</p>
<p>The DocGraph dataset shows how doctors, hospitals, laboratories and other health care providers team together to treat Medicare patients. This data details how the health care system in the U.S. delivers care.</p>
<p>You can read about <a href="http://thehealthcareblog.com/blog/2012/11/05/tracking-the-social-doctor-opening-up-physician-referral-data-and-much-more/">the basics of this data release</a>, and you can read about <a href="http://thehealth careblog.com/blog/2012/10/12/the-mystery-data-set/">my motivations</a> for making the release. Most importantly, you can still participate in our efforts to <a href="http://www.medstartr.com/projects/82-next-level-doctor-social-graph">crowdfund improvements to this dataset</a>. We have already far surpassed our original $15,000 goal, but you can still get early and exclusive access to the data for a few more days. Once the crowdfunding has ended, the price will go up substantially.</p>
<p>This article will focus on this data from a technical perspective.</p>
<p>In a few days, the crowdfunding (hosted by <a href="http://www.medstartr.com/">Medstartr</a>) will be over, and I will be delivering this social graph to all of the participants. We are offering a ransom license that we are calling &#8220;Open Source Eventually,&#8221; so participants in the crowdfunding will get exclusive access to the data for a full six months before the license to this dataset automatically converts to a Creative Commons license. The same data is available under a proprietary-friendly license for more money. For all of these &#8220;releases,&#8221; this article will be the go-to source for technical details about the specific contents of the file.</p>
<p><span id="more-52818"></span></p>
<p>DocGraph is very likely the largest open and real-named <a href="http://en.wikipedia.org/wiki/Social_graph">Social Graph</a> of any kind. There are almost 1 million entities that appear in the 2011 data. Each of these entities is either a specific person or organization that provides health care services to Medicare patients. Of course, the graphs found in the Facebook, Zynga, Twitter and LinkedIn datasets are far more expansive, but they are also the closely held property of those companies. When other organizations interact with those graphs, they are given access to only slivers of the whole dataset. If you are not an employee of one of the previously named companies, this is probably the largest named graph that you will have access to. (Let me know if you know of a real-named graph dataset that is bigger &#8230;)</p>
<p>This data is keyed using the <a href="http://en.wikipedia.org/wiki/National_Provider_Identifier">National Provider Identifier</a> (NPI). This is a universal identifier for doctors and hospitals and was mandated for the purpose of medical billing by HIPAA as replacement to the UPIN system. (HIPAA did a lot of administrative things beyond patient privacy regulations). Anyone who bills Medicare, or prescribes medication, must have an NPI number. The release and adoption of the NPI was an important component of health care reform in its own right, since the NPI was intended to ensure that doctors had only one identifier rather than maintaining a separate identifier for each insurance company that they billed. This number is basically the equivalent of a social security number for doctors. It would be fairly difficult for a health care provider to provide care without one, and as a result they are fairly ubiquitous. I have been <a href="http://www.fredtrotter.com/category/npi/">working on NPI data for years</a>, at the prompting of the folks from <a href="http://www.npidentify.com/">NPIdentify</a>. It is a rich and messy dataset all on its own!<!--more--></p>
<p>The core NPI database is already an open dataset. You can use the government&#8217;s <a href="https://npiregistry.cms.hhs.gov/NPPESRegistry/NPIRegistryHome.do">NPI search tool</a>, but it sucks. So I built <a href="http://docnpi.com">a better one</a>. If you are a health care provider, you can <a href="https://nppes.cms.hhs.gov/NPPES/StaticForward.do?forward=static.npistart">update your NPI record here</a>. This is a good time to remind doctors that it is a bad idea to list your home address in the NPI data. Because the entire NPI database is public information, and you can <a href="http://nppes.viva-it.com/NPI_Files.html">download the core NPI release file here</a>. For years the NPI database was updated on a monthly basis, but now it is updated weekly.</p>
<p>The DocGraph dataset is fairly large, with exactly 49,685,586 pairs of referring parties. Of course, even with this many links, the actual dataset that I will be providing is relatively small. The 2011 file is 1.3 GB, which includes about 1 million providers participating in the graph at least once. To provide context, there are about 3.7 million entries in the core NPI file.</p>
<p>I sometimes call the DocGraph dataset the &#8220;referral&#8221; dataset, and interactions that are traditionally understood as referral relationships make up the bulk of the data. But strictly speaking the data should be considered a &#8220;teaming&#8221; dataset, which shows when providers work on the same group of patients within the same time frame. I frequently refer to these teaming relationships as &#8220;referrals&#8221; because A. they usually are referrals, and B. this is easier to say than &#8220;participating in the same teaming coupling instance.&#8221;</p>
<p>Specifically, the data represents the number of times that two providers billed Medicare for the same patient within a sliding 30-day window, where at least 11 patients were involved in the transaction. If Provider A sees a patient on January 15, and Provider B sees the same patient on February 15, then that counts as &#8220;+1.&#8221;</p>
<p>In order to ensure that this dataset did not provide any data about patients, we enforce minimum number of patients involved in a given teaming relationship. So for every entry in this dataset, at least 11 patients (which is a standard for CMS somewhere) are involved. This is intended to address the Elvis problem. Everyone knows who Elvis&#8217; doctor is and everyone knows that Elvis is that doctor&#8217;s only patient. Therefore, if Elvis&#8217; doctor is &#8220;referring&#8221; to a cardiologist in this dataset, then everyone would know that Elvis has heart problems. This problem goes away once you include a minimum number of patients  in a transaction, so 11 patients is the floor. So we know that at least 11 patients took part in any given &#8220;referral count.&#8221; Because the patient data is both de-identified and aggregated, there should be no patient privacy concerns in this dataset. (Do let me know if you find evidence that I am wrong about this.)</p>
<p>To further protect patient privacy we cannot tell anything about the number of patients involved beyond the fact that there were at least 11 involved in a given provider-provider relationship. If the same patient sees Doctor A on January 15, Doctor B on February 15 and then again on June 15 and July 15, then that counts as &#8220;2&#8243; referrals in this dataset. When a referral relationship has a score of 1,100 we cannot know if this was 11 patients with 100 referral instances, or 1,100 patients with 1 referral instance, or 10 patients with 10 referral instances and 1 patient with 1,090 referral instances. The whole point here is that we have a score that approximates the strength of the relationship between two entities in the NPI database, and for that purpose it does not really matter what kind of patient flow is being indicated.</p>
<p>Entries in the DocGraph dataset take the form</p>
<pre>1112223334,5556667778,1100</pre>
<p>Where the first number is the NPI of the entity that saw the patient first in time, and the second number is the NPI of the entity that saw the patient the second in time, and the score is the number of times this happend in a 30-day period within a given (2011) year.</p>
<p>I have uploaded the <a href="http://pastebin.com/E7Mv8RmL">entire referral graph for the Methodist Hospital in Houston, Texas</a> to Pastebin as an example of what you can find in the larger file.</p>
<p>Usually, patients go see a primary care doctor and then get referred to specialists. Usually this translates to the primary care doctor being seen first between the two doctors. But frequently a specialist will be seen and then a patient will return to a primary care provider. In fact, it might be possible to use the relative &#8220;directionality&#8221; of the graph to automatically guess which provider was the primary care provider and which was the specialist. For instance, given:</p>
<pre>1112223334,5556667778,100</pre>
<pre>5556667778,1112223334,50</pre>
<p>It might be reasonable to assume that 1112223334 was a primary care provider.</p>
<h2>Doctor and organization types</h2>
<p>Beyond just being a &#8220;wide&#8221; graph with lots of nodes with real-named entities, this dataset is incredibly deep. It is deep because the core NPI public release file contains a tremendous amount of detailed information, which is usually even right!</p>
<p>The first thing that the NPI file contains is at least one and possibly many different types of provider-type taxonomy. These provider types are coded in a provider ontology maintained by the American Medical Association&#8217;s National Uniform Claim Committee (I would like to thank the members for performing in this usually thankless task. Committee participation gives me a headache.) You can download this <a href="http://www.nucc.org/index.php?option=com_content&amp;view=article&amp;id=14&amp;Itemid=125">Health Care Provider Taxonomy</a>, or you can <a href="http://codelists.wpc-edi.com/mambo_taxonomy_2_pre_production.asp">browse it online</a>.</p>
<p>The good news here is that the NPI database uses a provider-type taxonomy. However, there is little justification that this should be a &#8220;tree&#8221; style taxonomy. The assumption in a hierarchical taxonomy is that leaves can only have one &#8220;parent.&#8221;  This means that a given &#8220;doctor type&#8221; can be either listed under the &#8220;cardiology&#8221; group or the &#8220;pediatrics group,&#8221; but not both. As a result there a lots of very arbitrary groupings for doctor types here. Doctors cannot find a sensible way to navigate this &#8220;tree&#8221; style taxonomy when they sign up. Since they have to choose something, they usually get at least one &#8220;type&#8221; correct, but often, this database does not correctly represent the breadth of a given doctor&#8217;s actual specializations.</p>
<p>Still, it is a good assumption that the provider taxonomy field in the core NPI file is usually correct, and as a result, it is possible to distinguish effectively between hospitals, primary care doctors, specialists types, and laboratories in the referral dataset. In fact, one of the most frequent &#8220;referrals&#8221; in the data is the referral to get lab work done at LabCorp, Quest or one of the local lab providers. Referrals to hospital emergency departments (which are not referrals at all, of course) and treatment facilities like DaVita are also very common. I have uploaded the top 100 organizations by the number of entries that they have in the dataset to Pastebin so that you can clearly see <a href="http://pastebin.com/mdVEc2M9">the types of relationships that will be most common in the data</a>.</p>
<h2>Geocoding</h2>
<p>The core NPI database also contains two addresses for each NPI record, the practice location address and a mailing address. I have done queries against the Open Street Map database and about 80% of the addresses are already coded to latitude or longitude. There are zip codes that can be used to detect general location for the other 20%.</p>
<p>This means that is going to be possible to run all kinds of geo-data queries against this dataset. There are all kinds of other geo databases that can be overlaid against this referral database in order to reach interesting conclusions. You could easily, for instance, study referrals to allergy doctors in relationship to geo-recorded air quality scores. Let me know if you make some pretty maps and I will try to give you a shout out on Twitter or on my blog.</p>
<h2>Hospital data</h2>
<p>Quality and performance data for individual doctors is pretty hard to come by. However, there has been an explosion in the availability of hospital data that details how hospitals perform on critical issues like readmission rates and central line infection rates. Frequently this quality data is coded natively using NPIs and it is usually pretty simple to convert this data to NPI coded when the NPI is not directly available.</p>
<p>This hospital data will be part of what we are <a href="http://www.medstartr.com/projects/82-next-level-doctor-social-graph">trying to merge together for our improved DocGraph project</a>.</p>
<p>There are lots of interesting questions that you can ask regarding this dataset. For instance, using the DocGraph, you can determine which cardiologists are referring to hospitals with poor central line infection rates.</p>
<h2>State level credentialing data</h2>
<p>Every state has a state level medical board that releases data on individual doctors. This data usually includes what medical school a doctor attended, what board certifications they maintain, and any disciplinary actions that the state board has undertaken against this doctor. Unfortunately this data is expensive between ($50 and $500 per state) and rarely coded using an NPI.</p>
<p>This is the largest single disconnected data source from the current NPI database. <a href="http://www.medstartr.com/projects/82-next-level-doctor-social-graph">Buying and normalizing this data is the first goal of our Next Level crowdfunding effort</a>.</p>
<p>Using this data it would be simple to determine if attending the same medical school was an important part of how doctors refer. It will also be possible to determine if board certification has an impact on referral patterns.</p>
<h2>Non-profit data</h2>
<p>I have just discovered <a href="http://boingboing.net/2012/11/01/tax-returns-for-6461326-tax.html">through BoingBoing</a> that resource.org will be performing <a href="https://bulk.resource.org/irs.gov/eo/doc/mef.html">data extraction</a> on its <a href="https://bulk.resource.org/irs.gov/eo/readme.html">enormous cache of non-profit tax filings</a>. Once it is possible to import this as a data source, it should be possible to figure out what the NPI for different non-profit hospital systems are. Once this is done, it would be possible to see how executive compensation works with the graph. This is not something we will be doing with our initial improvement project, but this is obviously where we would like to take this!</p>
<h2>Limitations</h2>
<p>As with all rich datasets, this data is convoluted and can be confusing. Already we have seen patterns that do not make sense unless you understand how the data was built. This is based on administrative data and not clinical data. So this shows not how patients were &#8220;treated&#8221; together, but how they were &#8220;billed&#8221; together. There are several important artifacts to consider as the result of this.</p>
<p>First, individual providers frequently have two NPIs in the database. One for them as an individual and another for an organization that exists only for that individual. Seeing an NPI for &#8220;Dr. Smith&#8221; and another for &#8220;Dr. Smith, LLC&#8221; is not uncommon. As a result of this, frequently a given provider will not even appear in the DocGraph at all. Some digging is often required to determine how a given doctor is actually interacting with Medicare. Frequently, when an individual NPI and organizational NPI share an address, this means they are working as one unit.</p>
<p>Sometimes, it is the organization that is hidden in the billing. We have already seen cases where a given primary care provider is referring to more than 20 different cardiologists. A little further digging shows that these cardiologists are all part of a cardiology service. Obviously the primary care doctor is referring to the cardiology service, and does not actually have relationships with the individual cardiologists. However, there is no way, using just the DocGraph, to tell the difference between a &#8220;service&#8221; where doctors bill as individuals and a group of unaffiliated doctors.</p>
<p>There are cases where cardiologists list themselves as cardiac surgeons in the NPI database and vice versa. Doctors frequently rely on administrative staff members to fill out the data in the NPI database, and they often get it wrong. This issue impacts the quality of addresses and countless other issues. The current NPI registration form attempts to normalize addresses, but the older forms did not. Keep in mind that this data is just as subject to user entry errors as anything else.</p>
<p>In many cities, a given hospital will win the contract for the main emergency room. This often makes it seem like every provider in the city has a &#8220;referral&#8221; relationship with that hospital. Of course, this is not strictly true. A good sign is if an organization has a strong &#8220;referral&#8221; relationship with the local fire department, then they run the local emergency room and there &#8220;team&#8221; with every doctor in the city.</p>
<p>Because Medicare typically covers people over the age of 65, there is not very much information about doctors who exclusively treat children, or women who are having children. This means that there is a lack of data about pediatrics or ob/gyn doctors. There is also no information on doctors who do not take Medicare patients. Also there is no data in this dataset for providers who do not bill Medicare in a transactional fashion. Fully capitated plans like Kaiser Permanente will not have data in this dataset. If you put this data on a map, there should be a big hole over southern California as a result of this.</p>
<p>This dataset, like any messy data, should not be considered the &#8220;truth&#8221; but it can be used to help generate and reject hypotheses about how a given community delivers health care. We have lots of ideas about how to make the data more reliable and more accessible, but in order to do that, we need your help <a href="http://www.medstartr.com/projects/82-next-level-doctor-social-graph">paying for improvements to the data</a>. Even if you do not yourself want to have access to richer data, consider supporting us as we provide data for those who wish to have access to high quality versions of open doctor data. This dataset should deliver on the overall promise of open data: transparency improves performance.</p>
<p>There is a <a href="http://groups.google.com/group/docgraph">DocGraph Google Group</a> now, that you can join if you would like to ask specific questions about this dataset.</p>
<div style="float: left;border-top: thin gray solid;border-bottom: thin gray solid;padding: 20px;margin: 20px 2px;clear: both"><a href="http://strataconf.com/strata2013?_discount=STRATA20&amp;intcmp=il-strata-stsc13-docgraph"><img style="float: left;border: none;padding-right: 10px" src="http://cdn.oreilly.com/radar/images/promos/strataca13-148x178.jpg" alt="" /></a><a href="http://strataconf.com/strata2013?_discount=STRATA20&amp;intcmp=il-strata-stsc13-docgraph"><strong>Strata Conference Santa Clara</strong></a>— Strata Conference Santa Clara, being held Feb. 26-28, 2013 in California, gives you the skills, tools, and technologies you need to make data work today.<a href="http://strataconf.com/strata2013?_discount=STRATA20&amp;intcmp=il-strata-stsc13-docgraph"><strong>Save 20% on registration with the code STRATA20</strong></a></p>
</div>
<p><strong>Related:</strong></p>
<ul>
<li><a href="http://strata.oreilly.com/2012/09/data-health-care-disruption.html">When data disrupts health care</a></li>
<li><a href="http://radar.oreilly.com/2012/08/data-health-care.html">Solving the Wanamaker problem for health care</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://strata.oreilly.com/2012/11/docgraph-open-social-doctor-data.html/feed</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Who owns patient data?</title>
		<link>http://strata.oreilly.com/2012/06/patient-data-ownership-access.html</link>
		<comments>http://strata.oreilly.com/2012/06/patient-data-ownership-access.html#comments</comments>
		<pubDate>Wed, 06 Jun 2012 14:00:00 +0000</pubDate>
		<dc:creator>Fred Trotter</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[@editpick]]></category>
		<category><![CDATA[@home]]></category>
		<category><![CDATA[data access]]></category>
		<category><![CDATA[data product]]></category>
		<category><![CDATA[data provider]]></category>
		<category><![CDATA[health data]]></category>
		<category><![CDATA[health data privacy]]></category>
		<category><![CDATA[health IT]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2012/06/patient-data-ownership-access.html</guid>
		<description><![CDATA[Patients, doctors and providers have a unique set of privileges that do not line up exactly with a traditional concept of ownership. ]]></description>
				<content:encoded><![CDATA[<p>Who owns a patient&#8217;s health information?</p>
<ul>
<li> The patient to whom it refers?</li>
<li> The health provider that created it?</li>
<li> The IT specialist who has the greatest control over it?</li>
</ul>
<p>
The notion of ownership is inadequate for health information. For instance, no one has an absolute right to destroy health information. But we all understand what it means to own an automobile: You can drive the car you own into a tree or into the ocean if you want to. No one has the legal right to do things like that to a &#8220;master copy&#8221; of health information.
</p>
<p>
All of the groups above have a complex series of rights and responsibilities relating to health information that should never be trivialized into ownership.</p>
<p>
Raising the question of ownership at all is a hash argument. What is a hash argument? Here&#8217;s <a href="http://www.juliansanchez.com/2009/04/06/climate-change-and-argumentative-fallacies/">how Julian Sanchez describes it</a>:</p>
<blockquote>
<p>&#8220;Come to think of it, there&#8217;s a certain class of rhetoric I&#8217;m going to call the &#8216;<a href="http://www.cs.rit.edu/%7Eark/lectures/onewayhash/onewayhash.shtml">one-way hash</a>&#8216; argument. Most modern cryptographic systems in wide use are based on a certain mathematical asymmetry: You can multiply a couple of large prime numbers much (much, much, much, much) more quickly than you can factor the product back into primes. A one-way hash is a kind of &#8216;fingerprint&#8217; for messages based on the same mathematical idea: It&#8217;s really easy to run the algorithm in one direction, but much harder and more time consuming to undo.  Certain bad arguments work the same way &mdash; skim online debates between biologists and earnest ID (Intelligent Design) aficionados armed with talking points if you want a few examples: The talking point on one side is <em>just complex enough</em> that it&#8217;s both intelligible &mdash; even somewhat intuitive &mdash; to the layman <em>and</em> sounds as though it might qualify as some kind of insight &#8230; The rebuttal, by contrast, may require explaining a whole series of preliminary concepts before it&#8217;s really possible to explain why the talking point is wrong.&#8221;</p>
</blockquote>
<p>
The question &#8220;Who owns the data?&#8221; presumes that the notion of ownership is valid, and it jettisons those foolish enough to try to answer the question into a needless circular debate. Once you mistakenly assume that the question is answerable, you cannot help but back an unintelligible position.
</p>
<p>
Ownership is a poor starting point for health data because the concept itself doesn&#8217;t map well to the people and organizations that have relationships with that data. The following chart shows what&#8217;s possible depending on a given role.
</p>
<div align="center">
<table cellspacing="10" border="1">
<tr style="background: #ddd">
<td valign="top" width="20" style="padding: 3px">Person / Privilege</td>
<td valign="top" width="116" style="padding: 3px">Delete their copy of data </td>
<td valign="top" width="116" style="padding: 3px">Arbitrarily (without logs) edit their copy of data </td>
<td valign="top" width="116" style="padding: 3px">Correct the provider&#8217;s copy of the data </td>
<td valign="top" width="116" style="padding: 3px">Append to the provider&#8217;s copy of the data </td>
<td valign="top" width="116" style="padding: 3px">Acquire copies of <a href="http://www.hhs.gov/ocr/privacy/">HIPAA</a>-covered data</th>
</tr>
<tr>
<td valign="top" style="font-weight: bold">Sourcing Provider</td>
<td valign="top"> No. HIPAA mandates that the provider who creates HIPAA-covered data must ensure that a copy of the record is available. Mere deletion is not a privilege that providers have with their copies of patient records. Most EHR systems enforce this rule for providers.
</td>
<td valign="top">
No. While providers can change the contents of the EHR, they are not allowed to change the contents without a log of those changes being maintained. Many EHRs contain the concept of &#8220;signing&#8221; EHR data, which translates to &#8220;the patient data entering the state where it cannot be changed without logging anymore.&#8221;
</td>
<td valign="top">
Yes. Providers can correct their copy of the EHR data, providing they maintain a copy of the incorrect version of the data. Again, EHR software enforces this rule.</td>
<td valign="top">
Yes. The providers can merely add to data, without changing the &#8220;correctness&#8221; of previous instances of the data. EHR systems should seamlessly handle this case.</td>
<td valign="top">
Sometimes. Depending on the ongoing &#8220;treatment&#8221; status of the patient, providers typically have the right to acquire copies of treatment data from other treating providers. If they are &#8220;fired,&#8221; they can lose this right.
</td>
</tr>
<tr>
<td colspan="6">&nbsp;</td>
</tr>
<tr style="background: #ddd">
<td valign="top" width="20" style="padding: 3px">Person / Privilege</td>
<td valign="top" width="116" style="padding: 3px">Delete their copy of data </td>
<td valign="top" width="116" style="padding: 3px">Arbitrarily (without logs) edit their copy of data </td>
<td valign="top" width="116" style="padding: 3px">Correct the provider&#8217;s copy of the data </td>
<td valign="top" width="116" style="padding: 3px">Append to the provider&#8217;s copy of the data </td>
<td valign="top" width="116" style="padding: 3px">Acquire copies of <a href="http://www.hhs.gov/ocr/privacy/">HIPAA</a>-covered data</th>
</tr>
<tr>
<td valign="top" style="font-weight: bold">Patient rights</td>
<td valign="top">Yes, they can delete their own copies of their patient records, but requests to providers that their charts be deleted will be denied. </td>
<td valign="top">
No. Patients cannot change the &#8220;canonical&#8221; version of a patient record.
</td>
<td valign="top">
No. While patients have the right to comment on and amend the file, they can merely suggest that the &#8220;canonical&#8221; version of the patient record be updated.
</td>
<td valign="top">
Yes. The patient has the right to append to EHR records under HIPAA. HIPAA does not require that this amendment impact the &#8220;canonical&#8221; version of the patient record, but these additions must be present somewhere, and there is likely to be a substantial civil liability for providers who fail to act in a clinically responsible manner on the amended data. The relationship between &#8220;patient amendments&#8221; and the &#8220;canonical version&#8221; is a complex procedural and technical issue that will see lots of attention in the years to come.
</td>
<td valign="top">
Usually. Patients typically have the right to access the contents of an EHR system, assuming they pay a copying cost. <a href="http://radar.oreilly.com/2012/05/patient-health-data-directed-exchange.html">EHRs frequently make this copying cost unreasonable</a>, and the results are so dense that they are not useful. There are also exceptions to this &#8220;right to read,&#8221; including psychiatric notes and legal investigations.
</td>
</tr>
<tr>
<td colspan="6">&nbsp;</td>
</tr>
<tr style="background: #ddd">
<td valign="top" width="20" style="padding: 3px">Person / Privilege</td>
<td valign="top" width="116" style="padding: 3px">Delete their copy of data </td>
<td valign="top" width="116" style="padding: 3px">Arbitrarily (without logs) edit their copy of data </td>
<td valign="top" width="116" style="padding: 3px">Correct the provider&#8217;s copy of the data </td>
<td valign="top" width="116" style="padding: 3px">Append to the provider&#8217;s copy of the data </td>
<td valign="top" width="116" style="padding: 3px">Acquire copies of <a href="http://www.hhs.gov/ocr/privacy/">HIPAA</a>-covered data</th>
</tr>
<tr>
<td valign="top" style="font-weight: bold">True Copyright Ownership (i.e. the relationship you have with a paper you have written or a photo you have taken)</td>
<td valign="top"> Yes. You can destroy things you own.</td>
<td valign="top"> Yes. You can change things you own without recording what changes you made.</td>
<td valign="top"> No. If you hold copyright to material and someone has purchased a right to a copy of that material, you cannot make them change it, even if you make &#8220;corrections.&#8221; Sometimes, people use licensing rather than mere &#8220;copy sales&#8221; to enforce this right (i.e. Microsoft might have the right to change your copy of Windows, etc.).</td>
<td valign="top">
No. Again, you have no rights to change another person&#8217;s copy of something you own the copyright to. Again, some people use licensing as a means to gain this power rather than just &#8220;sale of a copy.&#8221;</td>
<td valign="top">
No. You do not have an automatic right to copies of other people&#8217;s copyrighted works, even if they depict you somehow. (This is why your family photographer can gouge you on reprints.)</td>
</tr>
<tr>
<td colspan="6">&nbsp;</td>
</tr>
<tr style="background: #ddd">
<td valign="top" width="20" style="padding: 3px">Person / Privilege</td>
<td valign="top" width="116" style="padding: 3px">Delete their copy of data </td>
<td valign="top" width="116" style="padding: 3px">Arbitrarily (without logs) edit their copy of data </td>
<td valign="top" width="116" style="padding: 3px">Correct the provider&#8217;s copy of the data </td>
<td valign="top" width="116" style="padding: 3px">Append to the provider&#8217;s copy of the data </td>
<td valign="top" width="116" style="padding: 3px">Acquire copies of <a href="http://www.hhs.gov/ocr/privacy/">HIPAA</a>-covered data</th>
</tr>
<tr>
<td valign="top" style="font-weight: bold">
IT Specialist
</td>
<td valign="top">
Kind of. Regulations dictate that IT specialists and vendors should not have the right to delete patient records. But root (or admin) access to the underlying EHR databases ensure that only people with backend access can truly delete patient records. Only people with direct access to source code or direct access to the database can completely circumvent EHR logging systems. The &#8220;delete privilege&#8221; is somewhat difficult to accomplish entirely without detection, however, since it is likely that someone (i.e. the patient) will know that the record should be present.
</td>
<td valign="top">
Yes. Source code or database-level access ensures that patient records can be modified without logging.
</td>
<td valign="top">
Yes. Source code or database-level access ensures that patient records can be modified without logging.
</td>
<td valign="top">
Yes. Source code or database-level access ensures that patient records can be modified without logging.
</td>
<td valign="top">
No. Typically, database administrators and programmers do not have the standing to request medical records from other sources.
</td>
</tr>
<tr>
<td colspan="6">&nbsp;</td>
</tr>
</table>
</div>
<p>Ergo, neither a patient nor a doctor nor the programmer has an &#8220;ownership&#8221; relationship with patient data. All of them have a unique set of privileges that do not line up exactly with any traditional notion of &#8220;ownership.&#8221; Ironically, it is neither the patient nor the provider (when I say &quot;provider,&quot; this usually means a doctor) who is closest to &#8220;owning&#8221; the data. The programmer has the most complete access and the only role with the ability to avoid rules that are enforced automatically by electronic health record (EHR) software. </p>
<p>
So, asking &#8220;who owns the data?&#8221; is a meaningless, time-wasting, and shallow conceptualization of the issue at hand.
</p>
<p>
The real issue is: &#8220;What rights do patients have regarding healthcare data that refers to them?&#8221; This is a deep question because patient rights to data vary depending on how the data was acquired. For instance, a standalone personal health record (PHR) is primarily governed by the end-user license agreement (EULA) between the patient and the PHR provider (which usually gives the patient wildly varying rights), while right to a doctor&#8217;s EHR data is dictated by both HIPAA and <a href="http://shop.oreilly.com/product/0636920020110.do">Meaningful Use</a> standards.
</p>
<p>
Usually, what people really mean when they say &#8220;The patient owns the data&#8221; is &#8220;The patient&#8217;s needs and desires regarding data should be respected.&#8221; That is a wonderful instinct, but unless we are going to talk about specific privileges enabled by regulation or law, it really means &#8220;whatever the provider/programmer holding the data thinks it means.&#8221;
</p>
<p>
For instance, while current Meaningful Use does require providers to give patients digital access to summary documents, there is no requirement for &#8220;complete&#8221; and &#8220;instant&#8221; access to the full contents of the EHR. While HIPAA mandates &#8220;complete&#8221; access, the EHR serves to make <a href="http://radar.oreilly.com/2012/05/patient-health-data-directed-exchange.html">printed copies of digitized patient data completely useless</a>. The devil is in the details here, and when people start going on about &#8220;the patient owning the data,&#8221; what they are really doing is encouraging a mental shortcut that cannot readily be undone.
</p>
<p>
<em>Note: This is a refresh of an article originally published <a href='http://www.fredtrotter.com/2009/10/27/who-owns-the-data/'>here</a>. Photo on home and category pages: <a href="http://www.flickr.com/photos/rvoegtli/4927466850/" title="stethoscope by rosmary, on Flickr">Stethoscope by rosmary, on Flickr</a></em></p>
<div style="float: left;border-top: thin gray solid;border-bottom: thin gray solid;padding: 20px;margin: 20px 2px;clear: both"><a href="http://shop.oreilly.com/product/0636920020110.do?intcmp=il-npa-books-who-owns-patient-data"><img style="float: left;border: none;padding-right: 10px" src="http://radar.oreilly.com/2011/10/19/meaingful-use-cover-148.png" /></a><a href="http://shop.oreilly.com/product/0636920020110.do?intcmp=il-npa-books-who-owns-patient-data"><strong>Meaningful Use and Beyond: A Guide for IT Staff in Health Care</strong></a> &mdash;  Meaningful Use underlies a major federal incentives program for medical offices and hospitals that pays doctors and clinicians to move to electronic health records (EHR). This book is a Rosetta Stone for the IT implementer who wants to help organizations harness EHR systems.</div>
<p><strong>Related:</strong></p>
<ul>
<li> <a href="http://radar.oreilly.com/2012/05/patient-health-data-directed-exchange.html">Principles of patient access in Directed Exchange</a></li>
<li> <a href="http://radar.oreilly.com/2011/04/jeff-jonas-data-privacy-control.html">The truth about data: Once it&#8217;s out there, it&#8217;s hard to control</a></li>
<li> <a href="http://radar.oreilly.com/2011/10/geeks-meaningful-use-aco.html">Why geeks should care about meaningful use and ACOs</a></li>
<li> <a href="http://radar.oreilly.com/tag/health-it">See more of Radar&#8217;s health IT coverage</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://strata.oreilly.com/2012/06/patient-data-ownership-access.html/feed</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Principles of patient access in Directed Exchange</title>
		<link>http://strata.oreilly.com/2012/05/patient-health-data-directed-exchange.html</link>
		<comments>http://strata.oreilly.com/2012/05/patient-health-data-directed-exchange.html#comments</comments>
		<pubDate>Sun, 06 May 2012 22:00:00 +0000</pubDate>
		<dc:creator>Fred Trotter</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[directed exchange]]></category>
		<category><![CDATA[health data]]></category>
		<category><![CDATA[health internet]]></category>
		<category><![CDATA[health IT]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2012/05/patient-health-data-directed-exchange.html</guid>
		<description><![CDATA[In this digital world, health data that&apos;s 36-hours old can only be analyzed as a post-mortem. Health data that&apos;s 30-days old is already rotting. ]]></description>
				<content:encoded><![CDATA[<p>
The <a href="http://en.wikipedia.org/wiki/Health_Insurance_Portability_and_Accountability_Act">Health Insurance Portability and Accountability Act  (HIPAA)</a> is good law. HIPAA formalized principles of patient privacy that should have been codified industry norms for more than 50 years (better late than never). HIPAA provided the right to patients in the U.S. to get access to their own healthcare records. The law struck reasonable balances on hundreds of complicated issues in order to achieve these goals. The law  solved more problems, by far, than it created. Which is as close to the definition of good government as I can imagine. Patients are better off after HIPAA than before.</p>
<p>
Sadly, the &#8220;letter of the law&#8221; in HIPAA is frequently either ignored or worse, fully embraced, in order to make patient access to their own healthcare data more cumbersome. This is evidenced nowhere better than Regina Holiday&#8217;s experience with access to her husband&#8217;s medical records. To make a long story short, she was able to <a href="http://reginaholliday.blogspot.com/2009/11/thank-you-stphen-king-for-under-dome.html">acquire an unpublished manuscript of a Stephen King novel, sooner and for less money than she was able to get her husband&#8217;s medical records</a>.</p>
<h3>Principle zero: Some clinicians will do anything they can to make patient access to their health records impossible or cumbersome. </h3>
<p>
Regina&#8217;s work, detailing her experience with her husband is titled <a href="http://73centsfilm.com/">73 cents</a>, because that&#8217;s how much it cost to get one page of her husband&#8217;s medical record. <a href="http://www.hhs.gov/ocr/privacy/hipaa/faq/right_to_access_medical_records/353.html">HIPAA allows hospitals and clinicians to charge a &#8220;reasonable&#8221; copying fee for access to patient records. </a> The problem with that is that in the digital age, a single healthcare record print out looks like this:</p>
<p class="image-box-580"><a href="http://www.flickr.com/photos/23879201@N00/5839811314/in/set-72157626851515399/"><img src="http://s.radar.oreilly.com/wp-files/2/2012/08/0512-single-ehr-record.jpg" width="580" border="0" alt="A single EHR record, printed out" style="margin-bottom: 15px" /></a><br /><em>A partial printout of a patient&#8217;s medical record.</em></p>
<p>
<a href="http://www.flickr.com/photos/23879201@N00/5839811314/in/set-72157626851515399/">This is what happens when you print out a digital health record.</a> Having patients pay the copying costs for access to medical records makes a simple presumption: there are only a few pages there. Obviously no patient will be able to afford copying costs in the age of all-digital records.  </p>
<h3>Principle one: Patient access to their own healthcare records must be digital once the record is digital.</h3>
<p>
Once you concede that access to the patient&#8217;s medical record must be digital, we can discuss the <a href="http://www.hermans.org/agents2/ch3_1_2.htm">push vs. pull question</a>. When someone else on the Internet has data that is important to you, you can generally find ways to have it &#8220;pushed&#8221; to you or you can choose to &#8220;pull&#8221; it. The simplest example is the weather. You can always check the weather easily online by visiting a website (by pulling). But you can also have software <a href="http://www.metafilter.com/103112/If-This-Then-That#3673465">text you when it is going to rain</a> (by pushing).</p>
<p>
There are advantages of both push and pull approaches for patient access to data. People who are excited about the pull model tend to focus on the benefits of the &#8220;portal&#8221; requirements in Meaningful Use, and those that favor the push model are excited about <a href="http://directproject.org/">directed exchange</a>. Without getting <a href="http://directtrust.wikispaces.com/message/view/CPP+Co-Workgroup+on+Citizens+and+Patients/52564050#52619702">into the debate</a>, I can posit that <a href="http://www.jopm.org/opinion/commentary/2011/07/05/sharks-bees-and-health-privacy-paranoia/">there are some cases where push access to patient data is critical</a>. Without supporting patient participation in directed exchange we regulate patients to second-class citizens with regard to healthcare exchange.<br />
That is unacceptable. <a href="http://directtrust.wikispaces.com/message/view/CPP+Co-Workgroup+on+Citizens+and+Patients/52564050#52569842">Patients should be first-class citizens in healthcare exchange</a>.
</p>
<h3>Principle two: Patients should be able to participate in health information exchange as first-class citizens.</h3>
<p>
The Office of the National Coordinator for Health Information Technology (ONC) should be <a href="http://radar.oreilly.com/2012/02/direct-project-required-in-nex.html">applauded for requiring directed exchange with patients</a> in the current proposed rule. I hope that ONC does not back off of this new requirement.
</p>
<p>
The current proposed rule making, however, is silent on a critical issue for directed health information exchange. How do we ensure that providers will not refuse to communicate with patients over directed exchange because of bogus &#8220;security concerns&#8221;? As we see with the copying costs under HIPAA, every potential barrier to a patient&#8217;s access to data will be used against patients. </p>
<p>
There are already rumors of cases in the pilots of directed exchange where organizations are using the trust architecture of the Direct Project to refuse to communicate with certain parties. While that might be reasonable between institutions (do you really think Planned Parenthood will ever automate communication with Catholic charity clinics or vice-versa?), it is absolutely critical that this not hamper patient-clinician communication.</p>
<p>
When we first designed the Direct Project Trust model, we presumed that patient-clinicians communication would take place based on <a href="http://wiki.directproject.org/message/view/Security+and+Trust+Workgroup/22949841">&#8220;business-card&#8221; identity verification</a>. That meant that when a patient provided a clinician with a public key (no matter how they did that) the <strong>clinician would trust it because the patient provided the public key</strong>. We did this because we knew that if clinicians could reject a patient&#8217;s public key based on &#8220;security concerns,&#8221; they would do so. Either the clinicians (or more likely the vendors that they hired) would choose directed exchange &#8220;partners&#8221; that were &#8220;approved&#8221; and &#8220;secure,&#8221; ensuring that the patient&#8217;s experience of directed exchange was merely a more extensive menu of patient portal options. Patient data is very valuable and controlling the flow of patient data is central to more business plans than I care to count. </p>
<p>
In order for patients to be first-class citizens in health information exchange, they should have the right to send their records, in an automated fashion, anywhere they want. Even if it meant sending it to a service that the patient was enthusiastic about, but the clinician disapproved of (i.e. <a href="http://qpid.me">qpid.me</a>).  In the world of secure email enabled by <a href="http://en.wikipedia.org/wiki/Public-key_infrastructure">public-key infrastructure</a> (PKI), that translates to clinicians must accept any public key/direct address presented by a patient in a reasonable manner. This acceptance must be unconditional, but should probably mean limiting the acceptance of that key to communication with just that patient. Anything less than this means that the patient is a second-class citizen with regards to the information exchange of their own data. </p>
<h3>Conclusion: ONC should require that clinicians communicate with a patient&#8217;s chosen directed exchange provider, which means accepting any public key presented by a patient in a reasonable manner.</h3>
<p>
The community at <a href='http://directtrust.org'>Direct Trust</a> is working hard to agree on what &#8220;reasonable manner&#8221; should mean, exactly. Here is my <a href="http://directtrust.wikispaces.com/message/view/CPP+Co-Workgroup+on+Citizens+and+Patients/52640236">latest proposal on the subject</a>, and here are <a href="http://directtrust.wikispaces.com/message/view/CPP+Co-Workgroup+on+Citizens+and+Patients/52642798">similar ideas from Dr. David Kibbe</a>. Eventually the Direct Trust community will knock out a firm understanding on the specific ways that might be &#8220;reasonable&#8221; for a patient to provide a certificate. But we are certainly agreed that without firm requirements on certificate acceptance, this issue will be used by clinicians to limit where patients can send their own data. </p>
<p>
As the U.S. federal government is preparing to pay healthcare providers to adopt electronic health records (EHR) they will insist that those doctors/hospitals/etc. show that they are using the new software in clinically meaningful ways. On Monday (May 7, 2012) they will be accepting comments on the second stage of the requirements that clinicians must meet in order to receive compensation. These requirements are usually short-handed as &#8220;<a href="http://healthit.hhs.gov/portal/server.pt?open=512&amp;objID=2996&amp;mode=2">meaningful use</a>.&#8221;
</p>
<p> I will be submitting this blog post as my comments to that process. Others will be submitting comments that directly contradict the principles and conclusions I write here. Most notably the <a href="http://www.aha.org/advocacy-issues/letter/2012/120430-cl-cms0044p.pdf">American Hospital Association (AHA) has argued that the requirements for patient portals and for providing patients with access to their digital record should be entirely removed from the meaningful use standards</a> (PDF). Specifically:</p>
<blockquote><p>&#8220;Our members are particularly concerned with the proposed objective to provide patients with the ability to view, download and transmit large volumes of protected health information via the Internet (a &#8220;patient portal&#8221;). The AHA believes that this objective is not feasible as proposed, raises significant security issues, and goes well beyond current technical capacity. We also believe that CMS should not include this objective because the Office of Civil Rights, and not CMS, regulates how health care providers and other covered entities fulfill their obligations under the Health Insurance Portability and Accountability Act (HIPAA), including the obligation to give patients access to their health records.&#8221;</p>
</blockquote>
<p>
This is fairly ironic, since the report also says:
</p>
<blockquote><p>&#8220;To date, OCR has received comments on its own significantly<br />
flawed original proposal to implement this section of HITECH, but has yet to finalize the<br />
standard.&#8221;</p>
</blockquote>
<p>
Apparently, AHA is not satisfied with any government agency&#8217;s interpretation of giving electronic access to patient data. The AHA would prefer that patients continue to wait the same amount of time for access to their digital records that they do for their paper records. Specifically:
</p>
<blockquote>
<p>&#8220;Further, 30 days are necessary to make determinations about how to respond to a request no matter the format of the protected health information. While providing an electronic copy of protected health information maintained in an EHR eventually may be facilitated more easily by technology, the process of determining which records are relevant and appropriate takes the same amount of time as it does for evaluating paper records.&#8221;</p>
</blockquote>
<p>
Of course, this is entirely false. Indeed, HIPAA does maintain that certain parts of healthcare records (i.e. a psychiatrist&#8217;s notes) and disclosures (i.e. when the FBI asks for records) are not subject to patient access. An EHR should be capable of understanding which parts of an EHR record are subject to HIPAA and which are not. If the EHR system can understand this distinction, then responses to HIPAA requests can be made in near-real-time. If the EHR system cannot make the distinction between which portions of the record to automatically provide to honor a HIPAA patient access request, then having 30 days is not going to be enough. Can you imagine a nurse reading through the entire stack of papers above to ensure that a certain mental health diagnosis is redacted?</p>
<p>
One of the most critical features of patient participation in directed exchange is the patient&#8217;s capacity to prevent the spread of bad information as it is happening. Apparently, the AHA believes that patients should tolerate the spread of mis-information in their health records to other institutions for a month before correcting it. This of course works in every situation where patients can wait a whole month to get correct information to other hospitals and clinicians.
</p>
<p>
I would like to be the first to welcome the American Hospital Association to the digital age. (Okay, <a href="http://blog.nationalpartnership.org/index.php/2012/05/patient-protections-hit/">maybe the second</a>.) From a technology perspective, there is nothing at all that would prevent patients from receiving copies of their updated digital health records seconds after it is &#8220;signed&#8221; by their clinicians. Inside those seconds is plenty of time to digitally determine whether sharing with the patient is appropriate, legal and safe. Seconds after a patient like me receives data, I intend to process it in an automated fashion. It is not unreasonable, in this new digital world, for me to get a text message that a doctor has ordered a medication that I am allergic to. I wish to get that message after the doctor has ordered the medication, but before I receive it in my IV. </p>
<p>
In this new digital world, 36 hours is unreasonable. It means that humans continue to be involved in tasks that can be performed perfectly by a computer without errors. Even 36 hours means that doctors, nurses and hospital administrators are still &#8220;thinking in paper.&#8221; Thirty-six hours means that you still do not view me, the patient, as an equal data partner. It means that I am blind to the data in your hospital at the only time it really matters, which is right now. Health data that is 36-hours old can only be analyzed as a post-mortem and data that is 30-days old is already rotting. As a patient, 36 hours is a short-term solution. It is an opportunity for you to rethink how information flows in your hospitals. It is an opportunity for you to rethink the notions of &#8220;inside&#8221; the hospital and &#8220;outside&#8221; the hospital.
</p>
<p>
This is not that I do not take your point regarding the reconciliation of the policies from the perspective of HIPAA and meaningful use. Two time-lines for compliance is difficult. But the reconciliation is to speed HIPAA up, not slow meaningful use down. The notion that you will give patients a stack of paper like the one above 30 days after it is useful is a bad joke. It was a bad joke 20 years ago, when the technologies already existed to fix the problem, but you decided that the patient&#8217;s experience was not worth that investment.
</p>
<p>
There is <a href="https://secure2.convio.net/npwf/site/Advocacy?cmd=display&amp;page=UserAction&amp;id=741">always something you can do</a>, if you feel as strongly about this as I do.
</p>
<div style="float: left;border-top: thin gray solid;border-bottom: thin gray solid;padding: 20px;margin: 20px 2px;clear: both"><a href="http://shop.oreilly.com/product/0636920020110.do?intcmp=il-npa-ebooks-directed-exchange-trotter"><img style="float: left;border: none;padding-right: 10px" src="http://radar.oreilly.com/2011/10/19/meaingful-use-cover-148.png" /></a><a href="http://shop.oreilly.com/product/0636920020110.do?intcmp=il-npa-ebooks-directed-exchange-trotter"><strong>Meaningful Use and Beyond: A Guide for IT Staff in Health Care</strong></a> &mdash;  Meaningful Use underlies a major federal incentives program for medical offices and hospitals that pays doctors and clinicians to move to electronic health records (EHR). This book is a rosetta stone for the IT implementer who wants to help organizations harness EHR systems.</div>
<p><em>Photo: <a href="http://www.flickr.com/photos/23879201@N00/5839811314/" title="Medical record printout by jodi0327, on Flickr">Medical record printout by jodi0327, on Flickr</a></em></p>
<p><strong>Related:</strong></p>
<ul>
<li> <a href="http://radar.oreilly.com/2012/05/the-state-of-health-it-accordi.html">The state of health IT according to the American Hospital Association</a></li>
<li> <a href="http://radar.oreilly.com/2010/11/rebranding-the-health-internet.html">The Direct Project: Healthcare communication gets an upgrade</a></li>
<li> <a href="http://radar.oreilly.com/2012/01/epatients-empowered-patients.html">Epatients: The hackers of the healthcare world</a></li>
<li> <a href="http://radar.oreilly.com/2012/02/farzad-mostashari-health-it-epatients.html">Building the health information infrastructure for the modern epatient</a></li>
<li> <a href="http://radar.oreilly.com/2011/10/geeks-meaningful-use-aco.html">Why geeks should care about meaningful use and ACOs</a></li>
<li> <a href="http://radar.oreilly.com/tag/health-it">See more of Radar&#8217;s health IT coverage</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://strata.oreilly.com/2012/05/patient-health-data-directed-exchange.html/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>The Direct Project in action</title>
		<link>http://strata.oreilly.com/2012/02/direct-project-healthcare-houston.html</link>
		<comments>http://strata.oreilly.com/2012/02/direct-project-healthcare-houston.html#comments</comments>
		<pubDate>Fri, 24 Feb 2012 14:00:00 +0000</pubDate>
		<dc:creator>Fred Trotter</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[@home]]></category>
		<category><![CDATA[Direct Project]]></category>
		<category><![CDATA[ehr]]></category>
		<category><![CDATA[health information exchange]]></category>
		<category><![CDATA[health IT]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2012/02/direct-project-healthcare-houston.html</guid>
		<description><![CDATA[Jim Langabeer, CEO of Greater Houston Healthconnect, discusses the implementation goals and hurdles related to a Direct Project pilot program. ]]></description>
				<content:encoded><![CDATA[<p><a href="http://directproject.org"><img src="http://radar.oreilly.com/2012/02/22/0212-direct-project-logo.png" border="0" alt="The Direct Project" width="300" style="float: right;margin: 3px 0 10px 10px" /></a>The <a href="http://directproject.org">Direct Project</a> is <a href="http://wiki.siframework.org/file/view/HIMSS%20Events%20for%20SI%20and%20Direct%202012-02-16_v4.pdf">all over HIMSS12</a>, and really all over the country now. But it  still carries controversy. When I found out that one of the Houston Health Information Exchange efforts had <a href="http://www.healthcareitnews.com/news/greater-houston-hie-puts-direct-project-work">successfully launched a Direct Pilot</a>, I simply had to do an interview. After all, here was software that I had contributed to as an open source project that was being deployed in my own backyard.</p>
<p>
Jim Langabeer is the CEO of the newly renamed <a href="http://www.ghhconnect.org/">Greater Houston Healthconnect</a>. I caught up with Jim at Starbucks and peppered him with questions about where Health Information Exchange (HIE) is going and what HIE looks like in Houston. </p>
<h2>What&#8217;s your background?</h2>
<p><strong>Jim Langabeer:</strong> I have been in healthcare for a long time in the <a href="http://www.texasmedicalcenter.org/">Texas Medical Center</a>.  I started as a hospital administrator at <a href="http://www.utmb.edu/">UTMB</a>, where my first project was to work on an IT project team developing a Human Resources Management System. It was a collaborative effort between three hospitals. </p>
<p>
I recently led a software company in the business intelligence space, which was later acquired by Oracle. After that, I decided I wanted to come back to Houston and continue to work in healthcare, so I returned to work for <a href="http://www.mdanderson.org/">MD Anderson</a> leading project and performance management. I eventually worked with the CIO <a href="http://faculty.mdanderson.org/Lynn_Vogel/Default.asp?SNID=1486718363">Lynn Vogel</a> to assess the business value of information systems. I most recently taught healthcare administration at the <a href="https://sph.uth.tmc.edu/">UT School of Public Health</a>.</p>
<p>
Throughout my healthcare career, I have been using data to drive healthcare decisions. My PhD is in decision sciences &mdash; quantitative modeling of data for decision-making &mdash; and my research grants have all involved analyzing large datasets to make healthcare decisions better. I have also worked between organizations in a collaborative manner. Health Information Exchange was an obvious next step for me.</p>
<h2>Where are you in the process of creating a health information exchange?</h2>
<p>
<strong>Jim Langabeer:</strong> We are in the middle stage of operations. We are finalizing our architectural vision and choosing vendors. Most importantly, we have strong community support: 41% of the doctors in the region have committed to the exchange with letters of support as well as 61 of the 117 local hospitals.</p>
<p>
We are meeting with all of the doctors we can. We are calling them and faxing them and visiting them, with one simple message: Health IT is coming and we want you to participate.</p>
<h2>You mentioned an &#8220;architectural vision.&#8221; Can you expand on that?</em></h2>
<p>
<strong>Jim Langabeer:</strong> We really cannot have just one architecture, so our architectural vision really means choosing several protocols and architectures to support the various needs of our stakeholders in parallel. We need to accommodate the entire range of transactions that our physicians and hospitals perform. The numbers say that 50% of Houston docs work in small practices with only one or two doctors, and they typically do not have electronic health records (EHR). Hooking these doctors into a central hub model does not make sense, so a different model where they can use browser/view capabilities and direct connections between providers must be part of our architectural vision.</p>
<p>
Houston also has several large hospitals using EPIC or other mature EHR systems.  That means we need a range of solutions. Some docs just want to be able to share records. Some are more sophisticated and want to do full EHR linking. Some doctors just want to be able to view data on the exchange using a web portal.</p>
<p>
We want to accommodate all of these requests. That means we want a portfolio of products and a flexible overall architectural vision. Practically, that means we will be supporting Direct, <a href="http://ihe.net">IHE</a> and also older <a href="http://en.wikipedia.org/wiki/Health_Level_7">Hl7 v2</a>.</p>
<p>
Some people are saying Direct is all we want. We do not want a solution that is way over what small providers can handle and then it never gets used. We are architecture- and vendor-neutral, which can be difficult because <a href="http://www.epic.com/">EPIC</a> is so prevalent in Houston. </p>
<p>
We have practices that are still on paper on one hand and very sophisticated hospitals on the other, and that is just in the central Houston area. Immediately outside of Houston, lots of rural hospitals that we plan to support have older EHR systems or home-grown systems. That means we have to work with just about every potential health IT situation and still provide value.</p>
<h2>Recently, a <a href="http://jamia.bmj.com/content/early/2012/01/21/amiajnl-2011-000442.full">JAMIA perspectives article</a> criticized Direct as a threat to non-profit HIE efforts like yours. Do you feel that Direct is a threat?</h2>
<p><strong>Jim Langabeer:</strong> I do not see Direct as a threat. I hear that from lots of sources, that Direct is a distraction for health information exchanges. I disagree.</p>
<p>I see it as another offering. The market is obviously responding to Direct. The price point on the software for Direct is definitely a benefit to smaller docs. We see it as a parallel path.</p>
<p>
We do not see <a href="http://www.surescripts.com/">Surescripts</a> (which is offering <a href="http://www.aafp.org/online/en/home/practicemgt/physiciansdirect.html">Direct email addresses to doctors with the AAFP</a>) as a threat because we see them as a collaborator. We want them, and similar companies, to be part of our network. We are also having conversations with insurance companies and others who are not typically involved in health information exchanges because we are looking for partners.</p>
<p>
The problem in healthcare is that it has always been very fragmented; no single solution gets much penetration. So, as we consider different protocols, we have to go with what people are asking for and what is already being adopted. We have to get to a point were these technologies have a very high penetration rate. </p>
<h2>How are you narrowing your health IT vendors?</h2>
<p><strong>Jim Langabeer:</strong> What we want is a vendor that is going to be with us long term, sharing our risks and making sure we are successful. The sustainability of the vendor is connected to the sustainability of our exchange, so that is really important. Our 20-county region represents 6.4 million people, and that population is larger than most states that  are pursing exchanges. Not many vendors have experience on that scale. </p>
<h2>How important is the software licensing? Do open source vendors have an advantage?</h2>
<p><strong>Jim Langabeer:</strong> I am not sure they have an advantage. Of course, open source is ideal, but often proprietary vendors are ahead in terms of features. A mix in the long-term solution would be really cool.</p>
<h2>How will you work with outside EHR vendors?</h2>
<p><strong>Jim Langabeer:</strong> We are trying to engage at the CIO level. We&#8217;re trying to understand what solutions standards and data they want to share. There is a core set of things everyone needs to do. Beyond that core, some people want to go with <a href="http://en.wikipedia.org/wiki/Service-oriented_architecture">SOA</a>; other people really want <a href="http://www.ihe.net/">IHE</a> or Direct. There is not much data sharing between hospitals. That is why industry standards are so important to us. It helps us shorten those discussions and make a more narrow offering. So, we are focusing on protocols as a means to work with the various EHR vendors.</p>
<p>
One CIO told us, &#8220;We do not want to exchange data at all; we just want our doctors to be able to open a browser and see your data.&#8221; We may not like to hear that, but that is the reality for many organizations in Houston. </p>
<p>
The other thing that is unique about Houston is that you are not going to see the state of Texas taking a dictatorial role. In other large exchanges, you often have a state-level government dictating HIE. In that environment, it is easier to insist on specific standards. That is not our situation in Houston, so we have to meet our constituents where they are. </p>
<h2>I have been frustrated that the <a href="http://wiki.directproject.org/Reference+Implementation+Workgroup">Direct Project reference implementations</a> only come in Java and .NET at this point. I would like to see implementations in PHP, Python, Ruby, etc. &mdash; languages that are more popular with entrepreneurs. Are you concerned with issues like that?</h2>
<p><strong>Jim Langabeer:</strong> We&#8217;re definitely thinking about things like that. We do not want to be merely business-to-business &mdash; we want to offer services to consumers. So, we care about the technology becoming accessible to consumers, which means getting to iPhones. We want to be able to offer consumers tools that will bring them value, so we certainly care about issues like implementation language because we see those issues as connected.</p>
<h2>If I let you dictate which Houston clinic or hospital I go to, when can I go see a doctor and get my patient data sent to my HealthVault or other PHR Direct account?</h2>
<p><strong>Jim Langabeer:</strong>  I would hope that the technology would be ready by the end of year. What I envision is a core group of early adopters. We already have several hospitals and some physician groups that are interested in taking that role. </p>
<p><em>This interview was edited and condensed.</em></p>
<div style="float: left;border-top: thin gray solid;border-bottom: thin gray solid;padding: 20px;margin: 20px 2px;clear: both"><a href="http://shop.oreilly.com/product/0636920020110.do?cmp=il-radar-books-direct-project-langabeer-houston-interview"><img style="float: left;border: none;padding-right: 10px" src="http://radar.oreilly.com/2011/10/19/meaingful-use-cover-148.png" /></a><a href="http://shop.oreilly.com/product/0636920020110.do?cmp=il-radar-books-direct-project-langabeer-houston-interview"><strong>Meaningful Use and Beyond: A Guide for IT Staff in Health Care</strong></a> &mdash;  Meaningful Use underlies a major federal incentives program for medical offices and hospitals that pays doctors and clinicians to move to electronic health records (EHR). This book is a rosetta stone for the IT implementer who wants to help organizations harness EHR systems.</div>
<p><strong>Related:</strong></p>
<ul>
<li> <a href="http://radar.oreilly.com/2010/11/rebranding-the-health-internet.html">The Direct Project: Healthcare communication gets an upgrade</a></li>
<li> <a href="http://radar.oreilly.com/2012/01/epatients-empowered-patients.html">Epatients: The hackers of the healthcare world</a></li>
<li> <a href="http://radar.oreilly.com/2012/02/farzad-mostashari-health-it-epatients.html">Building the health information infrastructure for the modern epatient</a></li>
<li> <a href="http://radar.oreilly.com/2011/10/geeks-meaningful-use-aco.html">Why geeks should care about meaningful use and ACOs</a></li>
<li> <a href="http://radar.oreilly.com/tag/health-it">See more of Radar&#8217;s health IT coverage</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://strata.oreilly.com/2012/02/direct-project-healthcare-houston.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>AI will eventually drive healthcare, but not anytime soon</title>
		<link>http://strata.oreilly.com/2012/01/artificial-intelligence-healthcare.html</link>
		<comments>http://strata.oreilly.com/2012/01/artificial-intelligence-healthcare.html#comments</comments>
		<pubDate>Wed, 25 Jan 2012 14:00:00 +0000</pubDate>
		<dc:creator>Fred Trotter</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[algorithm]]></category>
		<category><![CDATA[doctors]]></category>
		<category><![CDATA[health IT]]></category>
		<category><![CDATA[healthcare]]></category>
		<category><![CDATA[medicine]]></category>
		<category><![CDATA[patients]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2012/01/artificial-intelligence-healthcare.html</guid>
		<description><![CDATA[People will eventually get better care from artificial intelligence, but for now, we should keep the algorithms focused on the data that we know is good and keep the doctors focused on the patients.  ]]></description>
				<content:encoded><![CDATA[<p>TechCrunch recently published a guest post from Vinod Khosla with the headline &#8220;<a href="http://techcrunch.com/2012/01/10/doctors-or-algorithms/">Do We Need Doctors or Algorithms?</a>&#8220;. Khosla is an investor and engineer, but he is a little outside his depth on some of his conclusions about health IT. </p>
<p>Let me concede and endorse his main point that doctors will become bionic clinicians by teaming with smart algorithms. He is also right that eventually the best doctors will be artificial intelligence (AI) systems &mdash; software minds rather than human minds.</p>
<p>That said, I disagree with Khosla on almost all of the details. Khosla has accidentally embraced a perspective that too many engineers and software guys bring to health IT.</p>
<p>Bear with me &mdash; I am the guy trying to write the &#8220;House M.D.&#8221; AI algorithms that Khosla wants. It&#8217;s harder than he thinks because of two main problems that he&#8217;s not considering: The search space problem and the good data problem.</p>
<h2>The search space problem</h2>
<p>Any person even reasonably informed about AI knows about <a href="http://en.wikipedia.org/wiki/Go_%28game%29">Go</a>, an ancient game with simple rules. Those simple rules hide the fact that Go is a very complex game indeed. For a computer, it is much harder to play than chess.</p>
<p>Almost since the dawn of computing, chess was regarded as something that required intelligence and was therefore a good test of AI. <a href="http://en.wikipedia.org/wiki/Deep_Blue_%28chess_computer%29">In 1997, the world chess champion was beaten by a computer</a>. In the year after, <a href="http://mechner.com/david/compgo/sciences/">a professional Go player beat the best Go software in the world with a 25 stone handicap</a>. Artificial intelligence experts <a href="http://en.wikipedia.org/wiki/Computer_Go">study Go carefully</a> precisely because it is so hard for computers. The approach that computers take toward being smart &mdash; thinking of lots of options really fast &mdash; stops working when the number of options skyrockets, and the number of potentially right answers also becomes enormous. Most significantly, Go can always be made more computationally difficult by simply expanding the board. </p>
<p>Make no mistake, the diagnosis and treatment of human illness is like Go. It&#8217;s not like chess. Khosla is making a classic AI mistake, presuming that because he can discern the rules easily, it means the game is simple. Chess has far more complex rules than Go, but it ends up being a simpler game for computers to play.</p>
<p>To be great at Go, software must learn to ignore possibilities, rather than searching through them. In short, it must develop &#8220;Go instincts.&#8221; The same is true for any software that could claim to be a diagnostician.</p>
<p>How can you tell when software diagnosticians are having search problems? When they cannot tell the difference between all of the &#8220;right&#8221; answers to a particular problem. The average doctor does not need to be told &#8220;could it be Zebra Fever?&#8221; by a computer that cannot tell that it should have ignored any zebra-related possibilities because it is not physically located in Africa. (No zebras were harmed in the writing of this article, and I do not believe there is a real disease called Zebra Fever.)</p>
<h2>The good data problem</h2>
<p>The second problem is the good data problem, which is what I spend most of my time working on. </p>
<p>Almost every time I get over-excited about the <a href="http://directproject.org/">Direct Project</a> or other health data exchange progress, my co-author <a href="http://www.oreillynet.com/pub/au/4766">David Uhlman</a> brings me back to earth:</p>
<blockquote>
<p>What good is it to have your lab results transferred from hospital A to hospital B using secure SMTP and XML? They are going to re-do the labs anyway because they don&#8217;t trust the other lab.</p>
</blockquote>
<p>While I still have hope for health information exchange in the long term, David is right in the short term. Healthcare data is not remotely solid or trustworthy. A good majority of the time, it is total crap. The reason that doctors insist on having labs done locally is not because they don&#8217;t trust the competitor&#8217;s lab; it&#8217;s more of a &#8220;devil that you know&#8221; effect. They do not trust their own labs either, but they have a better understanding of how and when their own labs screw up. That is not a good environment for medical AI to blossom.</p>
<p>The simple reality is that doctors <a href="http://thoughtbroadcast.com/2012/01/19/the-unfortunate-therapeutic-myopia-of-the-emr/">have good reason to be dubious about the contents of an EHR record</a>. For lots of reasons, not the least of which is that the codes they are potentially entering there are not diagnostically helpful or valid.</p>
<p>Non-healthcare geeks presume that the dictionaries and ontologies used to encode healthcare data are automatically valid. But in fact, the best assumption is that ontologies consistently lead to dangerous diagnostic practices, as they shepherd clinicians into choosing a label for a condition rather than a true diagnosis.  Once a patient&#8217;s chart has a given label, either for diagnosis or for treatment, it can be very difficult to reassess that patient effectively. There is even a name for this problem: <a href="http://www.ncbi.nlm.nih.gov/pubmed/11694107">clinical inertia</a>.  Clinical inertia is an issue with or without computer software involved, but it is very easy for an ontology of diseases and treatments to make clinical inertia worse. The fact is, <a href="http://www.psychologytoday.com/blog/dsm5-in-distress/201201/two-fallacies-invalidate-the-dsm-5-field-trials">medical ontologies must be constantly policed to ensure that they do not make things worse, rather then better</a>.</p>
<p>It simply does not matter how good the AI algorithm is if your healthcare data is both incorrect and described with a faulty healthcare ontology. My personal experiences with health data on a wide scale? It&#8217;s like having a conversation with a habitual liar who has a speech impediment. </p>
<p>So Khosla is not &#8220;wrong&#8221; per-se; he&#8217;s just focused on solving the wrong parts of the problem. As a result, his estimations of when certain things will happen are pretty far off.</p>
<p>I believe that we will not have really good diagnostic software until after the <a href="http://en.wikipedia.org/wiki/Technological_singularity">singularity</a> <em>and</em> until after we can ensure that healthcare data is reliable. I actually spend most of my time on the second problem, which is really a sociological problem rather then a technology problem. </p>
<p>Imagine if we had a &#8220;House AI&#8221; before we were able to feed it reliable data? Ironically it would be very much like the character on TV: constantly annoyed that everyone around him keeps screwing up and getting in his way. </p>
<p>Anyone who has seen the show knows that the House character is constantly trying to convince the other characters that the patients are lying. The reality is that the best diagnosticians typically assume that the chart is lying before they assume that the patient is lying. With notable exceptions, the typical patient is highly motivated to get a good diagnosis and is, therefore, honest. The chart, on the other hand, be it paper or digital, has no motivation whatsoever, and it will happily mix in false lab reports and record inane diagnoses from previous visits.  </p>
<p>The average doctor doubts the patient chart but trusts the patient story. For the foreseeable future, that is going to work much better than an algorithmically focused approach. </p>
<p>Eventually, Khosla&#8217;s version of the future (which is typical of forward-thinking geeks in health IT) will certainly happen, but I think it is still 30 years away. The technology will be ready far earlier. Our screwed up incentive systems and backward corporate politics will be holding us back. I hardly have to make this argument, however, since <a href="http://www.youtube.com/watch?v=oro19-l5M8k">Hugo Campos recently made it so well</a>.</p>
<p>Eventually, people will get better care from AI. For now, we should keep the algorithms focused on the data that we know is good and keep the doctors focused on the patients. We should be worried about making patient data accurate and reliable.</p>
<p>I promise you we will have the AI problem finished long before we have healthcare data that is reliable enough to train it.</p>
<p>Until that happens, imagine how Watson would have performed on &#8220;Jeopardy&#8221; if it had been trained on &#8220;<a href="http://en.wikipedia.org/wiki/The_Lord_of_the_Rings">Lord of the Rings</a>&#8221; and &#8220;<a href="http://en.wikipedia.org/wiki/The_Cat_in_the_Hat">The Cat in the Hat</a>&#8221; instead of encyclopedias. Until we have healthcare data that is more reliable than &#8220;The Cat in the Hat,&#8221; I will keep my doctor, and you can keep your algorithms, thank you very much. </p>
<div style="float: left;border-top: thin gray solid;border-bottom: thin gray solid;padding: 20px;margin: 20px 2px;clear: both"><a href="http://shop.oreilly.com/product/0636920020110.do?cmp=il-radar-books-ai-and-healthcare"><img style="float: left;border: none;padding-right: 10px" src="http://radar.oreilly.com/2011/10/19/meaingful-use-cover-148.png" /></a><a href="http://shop.oreilly.com/product/0636920020110.do?cmp=il-radar-books-ai-and-healthcare"><strong>Meaningful Use and Beyond: A Guide for IT Staff in Health Care</strong></a> &mdash;  Meaningful Use underlies a major federal incentives program for medical offices and hospitals that pays doctors and clinicians to move to electronic health records (EHR). This book is a rosetta stone for the IT implementer who wants to help organizations harness EHR systems.</div>
<p><strong>Related:</strong></p>
<ul>
<li> <a href="http://radar.oreilly.com/2012/01/software-crumple-zones-healthcare.html">Software crumple zones</a></li>
<li> <a href="http://radar.oreilly.com/2011/10/geeks-meaningful-use-aco.html">Why geeks should care about meaningful use and ACOs</a></li>
<li> <a href="http://radar.oreilly.com/2011/11/health-it-developer-contests.html">Why developers should enter health IT contests</a></li>
<li> <a href="http://radar.oreilly.com/2012/01/epatients-empowered-patients.html">Epatients: The hackers of the healthcare world</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://strata.oreilly.com/2012/01/artificial-intelligence-healthcare.html/feed</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>The rise of programmable self</title>
		<link>http://strata.oreilly.com/2012/01/programmable-self-motivation-hacks-digital-data.html</link>
		<comments>http://strata.oreilly.com/2012/01/programmable-self-motivation-hacks-digital-data.html#comments</comments>
		<pubDate>Wed, 11 Jan 2012 16:00:00 +0000</pubDate>
		<dc:creator>Fred Trotter</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[@editpick]]></category>
		<category><![CDATA[health]]></category>
		<category><![CDATA[health IT]]></category>
		<category><![CDATA[personal data]]></category>
		<category><![CDATA[programmable self]]></category>
		<category><![CDATA[quantified self]]></category>
		<category><![CDATA[tracking]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2012/01/programmable-self-motivation-hacks-digital-data.html</guid>
		<description><![CDATA[Taking a cue from the Quantified Self movement, the programmable self is the combination of a digital motivation hack with a digital system that tracks behavior. Here&apos;s a look at companies and projects relevant to the programmable self space. ]]></description>
				<content:encoded><![CDATA[<p><a href="http://programmableself.com">Programmable self</a> is a riff on the <a href="http://quantifiedself.com">Quantified Self</a> (QS). It is a simple concept:</p>
<p>Quantify what you want to change about yourself + motivational hacks = personal change success.</p>
<p>There are several potential &#8220;motivation hacks&#8221; that people regularly employ. The simplest of these is peer pressure. You could tell all of your co-workers every morning whether you kept your diet last night, for instance. Lots of research has shown that sort of thing is an effective motivator for change. Of course, you can make peer pressure digital by doing the same thing on Facebook/Twitter/Google+/whatever. Peer pressure has two components: shame and praise. It&#8217;s motivating to avoid shame and to get praise. Do it because of a tweet and <em>viola</em>, you have digital peer pressure motivation.</p>
<p><a href="http://www.fredtrotter.com/2011/05/26/programmable-self-reading-list/">Several books</a> have recently popularized using money, in one form or another, as a motivational tool. There is <a href="http://en.wikipedia.org/wiki/Loss_aversion">some evidence</a>, for instance, that people feel worse about losing $10 than they feel good about earning $10. This is called loss aversion, and it can easily be turned into a motivational hack. Having trouble finishing that book? Give 10 envelopes with $100 each to your best friend. Instruct them to mail the envelopes to your favorite (or most hated) charity for each month that you do not finish a chapter. Essentially, you&#8217;ve made your friend a &#8220;referee&#8221; of your motivational hack.</p>
<p>So, is there any potential to automate this process? To use software to hack your own motivation? One of the coolest applications that does just that is <a href="http://stickk.com">StickK.com</a>, which is designed to electronically manage contracts you make with yourself.</p>
<p>But that, by itself, is not programmable self. </p>
<p>Programmable self is the combination of a digital motivation hack, like StickK, with a digital system that tracks behavior, like <a href="http://www.fitbit.com/">Fitbit</a> (that&#8217;s the Quantified Self part). You have to have both. Recently, for example, Stickk started supporting the use of the <a href="http://www.withings.com/">Withings Scale</a> to support weight entries. Withings is a Wi-Fi-enabled scale that broadcasts your weight automagically to the Withings servers. From there, Withings will send your weight generally wherever you want: <a href="http://www.microsoft.com/en-us/healthvault/">HealthVault</a>, other personal health record (PHR) systems, or over to Stickk.com.  With that feature, Stickk became a programmable-self platform. </p>
<p>Stickk is pretty old, and <a href="https://loseitorloseit.com/">Lose it or Lose It</a>, which is focused specifically on losing weight, is also ancient in Internet time. It <a href="http://forge38.com/blog/2009/12/lose-it-or-lose-it-why-how-and-how-much">launched in 2009</a>. The site requires you to take a picture of a weekly weigh in (you actually photograph the scale) and send it in. That counts as digital tracking, but I wonder if it supports Withings (or if it will). </p>
<p>In October 2011, <a href="https://www.beeminder.com/">Beeminder</a> launched, billing itself as a direct Stickk competitor, but &#8220;for data geeks.&#8221; <a href="http://quantifiedself.com/2011/12/toolmaker-talk-bethany-soule-daniel-reeves-beeminder/">Indeed,  it is a little geeky</a>: Beeminder is focused on weight change and other goals that are numerically similar to weight change. The notion is that there is a proper path for the improvement of certain numbers &mdash; as well as a little &#8220;data jitter&#8221; to eliminate &mdash; in order to improve. Beeminder also <a href="https://www.beeminder.com/testimonials">refers</a> to the classical term for the lack of self discipline: <a href="http://en.wikipedia.org/wiki/Akrasia">akrasia</a> &mdash; so bonus points for that.</p>
<p>Last November, <a href="https://eatery.massivehealth.com/">The Eatery</a> launched from <a href="http://massivehealth.com">Massive Health</a>. Massive Health is a massively funded dream team, and their first app is a classic programmable-self experiment. You simply take pictures of your food with your camera (digital tracking = photos) and let others rate your food choices (motivation hack = praise/shame). It&#8217;s a good idea, and you can expect lots more from Massive Health that qualifies as programmable self.</p>
<p>Recently, <a href="http://www.gym-pact.com/">GymPact</a> made a big splash, even ending up in a <a href="http://bucks.blogs.nytimes.com/2012/01/02/gym-pact-fines-you-for-not-exercising/">New York Times blog post</a>. Gympact is an iOS (soon Android) app that lets you check in at the gym. If you fail to check in, you get charged a fee. If you do keep your commitment to go to the gym, then you also earn some of the money from all of the people who failed to go to the gym. </p>
<p>Finally, <a href="http://twitter.com/busterbenson">Buster Benson</a> and <a href="http://twitter.com/jensmccabe">Jen S. McCabe</a>  are working on <a href="http://bud.ge">Bud.ge</a>, which might be the first of the <a href="http://quantifiedself.com/2012/01/toolmaker-talk-buster-benson-budge/">programmable-self platform plays</a>.</p>
<p>All of these count as programmable self. I seriously doubt that any of these companies were aware of my <a href="http://radar.oreilly.com/2011/07/programmable-self-quantified-self.html">original interview about programmable self</a> or would even be comfortable with the term, which sounds pretty geeky and devious. (Which is, of course, why I love it.)</p>
<p>Other friends of mine in the <a href="http://shop.oreilly.com/product/0636920014614.do">serious games/games for health/gamification</a> movement would probably count as programmable self, too. But some of them seem convinced that &#8220;fun&#8221; can have a deeper component in motivation than some of the more aggressive techniques that I, and other programmable self people, seem to favor. I should also mention that I am hardly the only one in the QS movement <a href="http://quantifiedself.com/2011/11/panel-discussion-on-health-incentives/">stumbling in this direction</a>. </p>
<p>I will be writing about programmable self on Radar occasionally, but there is a lot more going on than I can track here . That&#8217;s why I&#8217;ve also made a Tumblr about the subject and filled it with all of the &#8220;<a href="http://programmableself.com">software for behavior change</a>&#8221; goodness that anyone can take. My <a href="https://twitter.com/fredtrotter">@fredtrotter</a> Twitter account is mostly focused on programmable self as well. </p>
<p>Most importantly, I want to hear about what you have tried to do with your own personal change hacks, especially those that impact your health in one way or another. For that, I have set up a <a href="http://groups.google.com/group/programmable-self/">Programmable Self Google Group</a>. Please join us. Some of the top minds in behavior change are already subscribers.</p>
<p>The Quantified Self movement is not primarily about the &#8220;tool creators&#8221; who make stuff for people to use, but a movement of users who defy the boundaries of tools and manage to create innovative quantification tools on their own. Many of these efforts also count as programmable-self approaches. No discussion of programmable self can ignore the work of individuals, so here is a decidedly non-exhaustive list of people innovating in this space:</p>
<ul>
<li> <a href="http://quantifiedself.com/2012/01/amelia-greenhall-weigh-everyday-understanding/">Amelia Greenhall: Weigh Everyday = Understanding</a></li>
<li> <a href="http://quantifiedself.com/2011/12/nick-crocker-floss-the-teeth-you-want-to-keep/">Nick Crocker: Floss the Teeth You Want To Keep </a></li>
<li><a href="http://quantifiedself.com/2011/08/nancy-dougherty-on-mindfulness-pills/">Nancy Dougherty on Mindfulness Pills  </a></li>
</ul>
<div style="float: left;border-top: thin gray solid;border-bottom: thin gray solid;padding: 20px;margin: 20px 2px;clear: both"><a href="https://en.oreilly.com/strata2012/public/regwith/radar20?cmp=il-radar-st12-programmable-self"><img style="float: left;border: none;padding-right: 10px" src="http://radar.oreilly.com/2011-strata-ca-promo.png" /></a><a href="https://en.oreilly.com/strata2012/public/regwith/radar20?cmp=il-radar-st12-programmable-self"><strong>Strata 2012</strong></a> &mdash;  The 2012 Strata Conference, being held Feb. 28-March 1 in Santa Clara, Calif., will offer three full days of hands-on data training and information-rich sessions. Strata brings together the people, tools, and technologies you need to make data work.</p>
<p><a href="https://en.oreilly.com/strata2012/public/regwith/radar20?cmp=il-radar-st12-programmable-self"><strong>Save 20% on registration with the code RADAR20</strong></a></div>
]]></content:encoded>
			<wfw:commentRss>http://strata.oreilly.com/2012/01/programmable-self-motivation-hacks-digital-data.html/feed</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Epatients: The hackers of the healthcare world</title>
		<link>http://strata.oreilly.com/2012/01/epatients-empowered-patients.html</link>
		<comments>http://strata.oreilly.com/2012/01/epatients-empowered-patients.html#comments</comments>
		<pubDate>Thu, 05 Jan 2012 15:00:00 +0000</pubDate>
		<dc:creator>Fred Trotter</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[@editpick]]></category>
		<category><![CDATA[@home]]></category>
		<category><![CDATA[empowered patient]]></category>
		<category><![CDATA[epatient]]></category>
		<category><![CDATA[Health 2.0]]></category>
		<category><![CDATA[health IT]]></category>
		<category><![CDATA[quantified self]]></category>

		<guid isPermaLink="false">http://blogs.oreilly.com/radar/2012/01/epatients-empowered-patients.html</guid>
		<description><![CDATA[The epatient community uses digital tools and the connective power of the Internet to empower patients. Here, Fred Trotter offers epatient resources and first steps. ]]></description>
				<content:encoded><![CDATA[<p>I help build open source software tools that patients can use to have greater control and influence over their own healthcare (like the <a href="http://directproject.org/">Direct Project</a> and <a href="http://yourdoctorsadvice.org">Your Doctors Advice</a>). As as result, I&#8217;ve become quite familiar with other tools that do the same sorts of things. There is a community of patients who are deeply interested in the ways in which they can become more engaged and how they can specifically use technology to achieve this. This community calls themselves <a href="http://e-patients.net/">epatients</a>. The epatient community asked me to write a short collection of resources for &#8220;becoming an epatient.&#8221; </p>
<p>The &#8220;e&#8221; in epatient is intentionally obscure. The initial assumption is that the &#8220;e&#8221; stands for &#8220;electronic,&#8221; as it does in &#8220;email.&#8221; But in fact, the &#8220;e&#8221; stands for &#8220;engaged&#8221; or &#8220;empowered.&#8221; Nonetheless, reference to email is intentional: The epatient community recognizes that leveraging data is a critical part of empowering a person who happens to be sick. Patients must be &#8220;electronic&#8221; to become fully &#8220;engaged.&#8221; I think of epatients as the healthcare equivalent of makers and hackers. More importantly, they are the people I have in mind when I write software.</p>
<p>Engaged patients get better healthcare. Not just a little bit better. Much better. This is not a thesis I am prepared to defend here, except to drop a link to the <a href="http://www.jopm.org/">Journal of Participatory Medicine</a>, which is a good resource for those seeking a full chronicle of this engaged effect (the benefits of engagement have been documented in the healthcare literature for years).</p>
<p>Let&#8217;s go over some simple concepts that the rest of my epatient advice and resources are based on:</p>
<ul>
<li> Your doctors are probably not managing your healthcare information. You should assume it is your responsibility. Otherwise, the people involved in your treatment process will often presume that this information simply does not exist. </li>
<li> Other patients, with similar conditions to yours, can have better information about your health problem than your doctors have.</li>
<li> Like the highway system, the healthcare system is generally beneficial, but there are dangers, and you need to learn to avoid them. </li>
</ul>
<p>And here are two things you should keep in mind:</p>
<ul>
<li> Whenever I say &#8220;patient,&#8221; I actually mean &#8220;the patient/caregiver team.&#8221; Often, the patient will not have the energy to do what is necessary to be fully engaged. It is critical that when the patient cannot be an epatient, that at least the patient&#8217;s team be an epatient proxy.</li>
<li> In regard to the &#8220;better information&#8221; I mentioned above, I don&#8217;t mean that your doctor is typically wrong and another patient is typically right about your diagnosis or treatment options. While that does happen on occasion, it is not the norm. Another patient&#8217;s information is &#8220;better&#8221; because your doctor typically does not have the visceral experience of being a patient. Your doctor doesn&#8217;t understand which lotion can make all of the difference or what position might let you get some sleep after a rough procedure. There is simply no substitute for experience. Sometimes that experience can make your life a little easier,  and sometimes it can help you get the right treatments that literally will save your life. (This is the <a href="http://www.amazon.com/dp/B0041T4FKW">gist of epatient Dave deBronkart&#8217;s story</a>.)</li>
</ul>
<p>The first thing an epatient needs to do is find the right epatient community. For any common illness or group of symptoms, there is a community of people who are already connecting with each other over the Internet. For  people with a recent cancer diagnosis <a href="http://www.acor.org/">ACOR</a> is probably the right place to start. This is one of the oldest, largest and most active epatient communities. Sometimes, the tools that a particular community of epatients choose might be a little old school &mdash; things like traditional forums, or even list-serves. Do not judge a community by its adoption of technology, judge it by its activity level.</p>
<p>There are efforts to create technology solutions that are specifically designed to enable patient communities. For a good example, check out <a href="http://www.patientslikeme.com/">PatientsLikeMe</a>, which is representative of patient communities drifting more toward &#8220;patient social media.&#8221; I am happy to say that epatient tool makers like me are coming up with cool stuff constantly. But the mature epatient is not impressed by new and shiny. Remember, it is the relationships and insights that matter here, not the technology. Whenever possible, you want the right information delivered over the wrong technology medium, rather than the other way around.</p>
<p>For any serious common healthcare condition, there are probably several different communities of patients online that you can meet with. For common conditions, you might also be able to find a specific meetup in your area, or at least some kind of <a href="http://www.meetup.com/Houston-Wellness-Group/">epatient meetup</a>. If you have a rare disease, you might have trouble finding local resources, and you might only have one or two online communities to choose from. But for rare diseases, the online patient community as a whole is typically better informed than the average primary care physician on a particular condition. Sometimes a physician might not understand or recommend treatment options that are offered outside their local healthcare community, an issue that can be compounded when they are unfamiliar with a particular type of diagnosis. It is even more critical for rare disease patients or caregivers to find a patient community and listen to them. For rare diseases and conditions, it is critical to understand what treatment options are offered in other areas and to be fully informed regarding sham clinics that often operate in under-regulated areas. </p>
<p> How do you know you&#8217;ve found the right community? Here&#8217;s a good rule of thumb: If you can&#8217;t get a real response to a simple health question from the community within 24 hours, you should probably move on if there are other options. Healthy epatient communities are vibrant and alive. </p>
<p>You should also consider joining the <a href="http://participatorymedicine.org/">Society for Participatory Medicine</a> (SPM). The <a href="http://lists.participatorymedicine.org/scripts/wa-SPM.exe?INDEX">mailing list for the SPM</a> is one of the most important cross-condition meeting places for epatients.</p>
<p>If you are interested in the technology side, also pay attention to <a href="http://www.health2con.com/">Health 2.0</a> (#health20), which has a <a href="http://www.health2con.com/conferences/san-francisco-2011/patients-2-0/">Patients 2.0</a> (#patients20) community that is concerned with the &#8220;electronic&#8221; part of epatient. Generally, I find that Health 2.0 is the right place to find out-of-the-box thinking on healthcare information systems, especially those offered directly to consumers. I go there to get the down-low on my competitors and to find collaborators. Matthew Holt from Health 2.0 cross pollinates <a href="http://thehealthcareblog.com/">the healthcare blog</a>, which is also worth adding to your RSS reader.</p>
<p>The cross-disease epatient community uses Twitter more than Facebook or Google+ (for the time being), and the <a href="https://twitter.com/#!/search/%23epatient">#epatient</a> hashtag is your friend. I also recommend following <a href="http://twitter.com/epatientdave">@epatientdave</a>,  <a href="http://twitter.com/reginaholliday">@reginaholliday</a>, and <a href="http://twitter.com/hhask">@hhask</a> as good starter accounts. If you care about the tech aspects of epatients, then check out <a href="http://twitter.com/nateosit">@NateOsit</a> or myself (<a href="http://twitter.com/fredtrotter">@fredtrotter</a>). And if you&#8217;re interested in the activist portion of the epatient community, then look at the <a href="http://occupyhealthcare.net/">Occupy Healthcare</a> roster.</p>
<p>Once your healthcare information gets past a certain basic level of complexity, it becomes important that you have a personal health record (PHR). Your doctors should, eventually, be able to update your PHR. That means you need a PHR that supports the <a href="http://directproject.org/">Direct Project</a>. Right now, that is a short list, with Microsoft&#8217;s <a href="http://www.microsoft.com/en-us/healthvault/">HealthVault</a> at the top. As a FOSS advocate, that is not an endorsement I make lightly, but Microsoft&#8217;s health IT team has been surprisingly supportive of open source (i.e. patient empowering) technologies and they have relatively mature Direct support. <a href="http://indivohealth.org/">Indivo</a> is pretty much the only one to consider if you want to run your own PHR (it&#8217;s open source), but its Direct support is lagging. Something you should keep in mind: If you have only one doctor at a time, you do not need to obsess about the coordination of your own healthcare information; if you have more than one doctor, you do. You can manage this information with a PHR, or a notebook, or even an <a href="http://e-patients.net/u/2011/11/kss-life-on-drugs.png">infographic</a>. What matters is that you need to do it, and do it carefully. </p>
<p>Some of the people in the Quantified Self movement are patients who are trying to use data about themselves to improve their health. If that interests you (if you have a chronic condition, it should), then take a look at <a href="http://quantifiedself.com/">the Quantified Self website</a>.</p>
<p>There are two types of patients: those who are in crisis mode and those who are in maintenance mode. If you are in crisis mode, and you feel like your whole world has been turned upside down, then I recommend you read Dr. Carolyn Oliver&#8217;s book &#8220;<a href="http://www.amazon.com/Cautious-Care-Carolyn-Oliver-M-D/dp/1440418675">Cautious Care: A Guide for patients</a>.&#8221; The newest edition of this book is also available as a series of free PDF downloads from the <a href="http://www.cautiouspatient.org/">Cautious Patient Foundation website</a>. Here are direct links to the <a href="http://www.cautiouspatient.org/downloads/Balancing-HospitalAdvice-12-1-11c.pdf">hospital part</a> and the <a href="http://www.cautiouspatient.org/downloads/Balancing-OfficeCare-12-1-11c.pdf">outpatient part</a>. Dr. Oliver&#8217;s book is like a &#8220;defensive driving crash course&#8221; for the healthcare system. If you are going to be staying up all night at the hospital (as a caregiver or a patient), and you have no idea what you should be paying attention to, this is the book that you should be reading.  <em>(Disclosure: I work for Dr. Oliver at the Cautious Patient Foundation.)</em></p>
<p>If you are in maintenance mode, you should still start with Dr. Oliver&#8217;s books, but you can also extend your reading to the classics of the epatient movement. That process should always begin with the  <a href="http://e-patients.net/e-Patients_White_Paper.pdf">e-patient white paper</a>, and I also recommend the book &#8220;<a href="http://www.amazon.com/dp/B0041T4FKW/">Laugh, Sing, and Eat Like a Pig</a>&#8221; for ethos purposes. Go to <a href="http://e-patients.net">e-patients.net</a> and click &#8220;categories&#8221; to find the content on that blog that seems relevant to you. </p>
<p>If you are well, consider spending some time with these resources. When you become sick you do not want to be learning how this stuff works.  You will probably ignore this advice, and that&#8217;s fine. Just try to remember that this advice is here when you need it. Also, remember that these are not instructions for how to handle your illness; they&#8217;re instructions for <em>finding</em> the instructions on how to handle your illness.</p>
<p>If you are newly sick, then what I have written here will sound overwhelming. You are tired, scared and confused (and you don&#8217;t need some arrogant geek telling you that you need to use technology to look over your doctor&#8217;s shoulder). Recognize that you will not have the capacity to bring your intellect to bear on this the way you could when you were healthy. It is very tempting to just check out and trust that the healthcare system is going to take good care of you. But please trust me on this: That&#8217;s a bad idea. The default settings on the healthcare system really suck. Your doctors will provide you with options, but they do not have the right perspective to help you make the right choices. Patients often ask their doctors, &#8220;What would you do if you were me?&#8221; The very, very best doctors reply, &#8220;But I am not you.&#8221;  What you need to find are other people who share your values, who are living with the consequences of having made the choices that you are facing. I say this with both a deep sense of pride and a great deal of humility: My spiritual predecessors (health geeks) have made sure that the Internet can provide you with connections to these people.</p>
<p> The Internet will make it easier to connect, which is awesome. If you cannot summon the strength to do this, fine &mdash; entrust someone you love with the task of connecting for you. I truly believe that it is impossible to be an epatient alone. Moreover, I believe that it is impossible not to be an epatient if you have shared your burden with even one other patient.  </p>
<p> For those who need encouragement with their ongoing efforts to connect to others with the same suffering, I leave you with some video happiness. Be sure to watch the whole thing:</p>
</p>
<div style="float: left;border-top: thin gray solid;border-bottom: thin gray solid;padding: 20px;margin: 20px 2px;clear: both"><a href="http://shop.oreilly.com/product/0636920020110.do?cmp=il-radar-books-epatient-resources"><img style="float: left;border: none;padding-right: 10px" src="http://radar.oreilly.com/2011/10/19/meaingful-use-cover-148.png" /></a><a href="http://shop.oreilly.com/product/0636920020110.do?cmp=il-radar-books-epatient-resources"><strong>Meaningful Use and Beyond: A Guide for IT Staff in Health Care</strong></a> &mdash;  Meaningful Use underlies a major federal incentives program for medical offices and hospitals that pays doctors and clinicians to move to electronic health records (EHR). This book is a rosetta stone for the IT implementer who wants to help organizations harness EHR systems.</div>
<p><strong>Related:</strong></p>
<ul>
<li> <a href="http://radar.oreilly.com/2011/10/geeks-meaningful-use-aco.html">Why geeks should care about meaningful use and ACOs</a></li>
<li> <a href="http://radar.oreilly.com/2011/11/health-it-developer-contests.html">Why developers should enter health IT contests</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://strata.oreilly.com/2012/01/epatients-empowered-patients.html/feed</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
	</channel>
</rss>
