Skip to content

Commit 8f25713

Browse files
marycrawfordmarycrawford
and
marycrawford
authored
Edit README by adding table of contents and update overview, notices and disclaimers (#498)
* update readme with table of contents and revised notices * add section for CDC Managed Repo * resolve formatting issue * organize docs, update overview and add video and considerations * update verbiage and saved video * remove extra space and typo * remove extra dash and replace with comma --------- Co-authored-by: marycrawford <[email protected]>
1 parent 626cce4 commit 8f25713

12 files changed

+110
-51
lines changed

README.md

+110-45
Original file line numberDiff line numberDiff line change
@@ -1,26 +1,31 @@
1-
# CDCgov GitHub Organization Open Source Project Template
1+
# Table of Contents
2+
[1. Overview](#1-overview)
3+
- [The Problem](#the-problem)
4+
- [The Solution](#the-solution)
5+
- [Future Considerations](#future-considerations)
26

3-
**Template for clearance: This project serves as a template to aid projects in starting up and moving through clearance procedures. To start, create a new repository and implement the required [open practices](open_practices.md), train on and agree to adhere to the organization's [rules of behavior](rules_of_behavior.md), and [send a request through the create repo form](https://forms.office.com/Pages/ResponsePage.aspx?id=aQjnnNtg_USr6NJ2cHf8j44WSiOI6uNOvdWse4I-C2NUNk43NzMwODJTRzA4NFpCUk1RRU83RTFNVi4u) using language from this template as a Guide.**
7+
[2. Notices](#2-notices)
8+
- [2.1 Privacy Standard Notice](#21-privacy-standard-notice)
9+
- [2.2 Records Management Standard Notice](#22-records-management-standard-notice)
10+
- [2.3 Domestic Copyright Protection Notice](#23-domestic-copyright-protection-notice)
11+
- [2.4 Open Source Notice](#24-open-source-notice)
12+
- [2.5 License Standard Notice](#25-license-standard-notice)
13+
- [2.6 Github Notice](#26-github-notice)
14+
- [2.7 Contributing Standard Notice](#27-contributing-standard-notice)
415

5-
**General disclaimer** This repository was created for use by CDC programs to collaborate on public health related projects in support of the [CDC mission](https://www.cdc.gov/about/organization/mission.htm). GitHub is not hosted by the CDC, but is a third party website used by CDC and its partners to share information and collaborate on software. CDC use of GitHub does not imply an endorsement of any one particular service, product, or enterprise.
16+
[3. General Disclaimer](#3-general-disclaimer)
617

7-
## Access Request, Repo Creation Request
18+
[4. Other Related Documents](#4-other-related-documents)
819

9-
* [CDC GitHub Open Project Request Form](https://forms.office.com/Pages/ResponsePage.aspx?id=aQjnnNtg_USr6NJ2cHf8j44WSiOI6uNOvdWse4I-C2NUNk43NzMwODJTRzA4NFpCUk1RRU83RTFNVi4u) _[Requires a CDC Office365 login, if you do not have a CDC Office365 please ask a friend who does to submit the request on your behalf. If you're looking for access to the CDCEnt private organization, please use the [GitHub Enterprise Cloud Access Request form](https://forms.office.com/Pages/ResponsePage.aspx?id=aQjnnNtg_USr6NJ2cHf8j44WSiOI6uNOvdWse4I-C2NUQjVJVDlKS1c0SlhQSUxLNVBaOEZCNUczVS4u).]_
1020

11-
## Related documents
21+
# 1. Overview
22+
The Intelligent Data Workflow Automation (IDWA) ReportVision Project aims to support the Office of Public Health Data, Surveillance, and Technology (OPHDST) in enhancing the ability of state, local, territorial, and tribal public health departments to manage, search, and secure critical data. As a key division of the CDC, OPHDST plays a vital role in public health infrastructure.
1223

13-
* [Open Practices](open_practices.md)
14-
* [Rules of Behavior](rules_of_behavior.md)
15-
* [Thanks and Acknowledgements](thanks.md)
16-
* [Disclaimer](DISCLAIMER.md)
17-
* [Contribution Notice](CONTRIBUTING.md)
18-
* [Code of Conduct](code-of-conduct.md)
24+
Please see the [UserGuide](/docs/user_guide.md) to get a technical overview of this project.
1925

20-
## Overview
21-
22-
Please see the [UserGuide](./user_guide.md) to get a overview of this project.
26+
## The Problem
2327

28+
The exchange of public health data is hindered by outdated, manual processes. Some state, local, tribal, and territorial health departments still rely on fax, email, and physical mail to receive case data, requiring staff to manually review and re-enter information from lab reports. This labor-intensive process can take up to 20 minutes per report, and electronic data extraction remains cumbersome and error-prone, particularly when handling multiple documents. As a result, low accuracy in data ingestion impedes the ability of public health departments to efficiently process and utilize critical health data.
2429

2530
## Public Domain Standard Notice
2631
This repository constitutes a work of the United States Government and is not
@@ -31,46 +36,106 @@ All contributions to this repository will be released under the CC0 dedication.
3136
submitting a pull request you are agreeing to comply with this waiver of
3237
copyright interest.
3338

34-
## License Standard Notice
35-
The repository utilizes code licensed under the terms of the Apache Software
36-
License and therefore is licensed under ASL v2 or later.
39+
## The Solution
40+
41+
ReportVision is a powerful tool designed to automate the reading and extracting of data from lab reports, helping public health departments streamline their workflows. Leveraging the power of the Tesseract engine and Microsoft Azure Cloud Platform, ReportVision allows teams to create customizable, data-driven templates for automatic extraction and annotation of multiple datasets—delivering notable accuracy and speed.
42+
43+
The goal is simple yet powerful: to provide jurisdictions with a "starter kit" that empowers them to rapidly build their own resources, provision scalable Azure infrastructure, or seamlessly replicate similar configurations in Amazon Web Services (AWS) or Google Cloud Platform (GCP).
44+
45+
With ReportVision, public health departments can move from cumbersome, error-prone processes to a highly efficient, automated workflow that supports critical decision-making with fast, reliable data.
46+
47+
This application offers a robust framework for public health departments and personnel to efficiently extract relevant data from lab reports utilizing an advanced Optical Character Recognition (OCR) model. This OCR technology significantly enhances both the speed and accuracy of data extraction, taking your data processing capabilities to the next level.
3748

38-
This source code in this repository is free: you can redistribute it and/or modify it under
39-
the terms of the Apache Software License version 2, or (at your option) any
40-
later version.
49+
Check out the following videos to see how the updated OCR model works in action, and and witness firsthand how ReportVision enhances both the speed and accuracy of data extraction!
4150

42-
This source code in this repository is distributed in the hope that it will be useful, but WITHOUT ANY
43-
WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A
44-
PARTICULAR PURPOSE. See the Apache Software License for more details.
51+
<div align="center">
52+
<video width="500" height="280" controls>
53+
<source src="images-and-media/reportvision-demo.mp4" type="video/mp4">
54+
Video Extracting Data From Lab Reports.
55+
</video>
56+
</div align="center">
4557

46-
You should have received a copy of the Apache Software License along with this
47-
program. If not, see http://www.apache.org/licenses/LICENSE-2.0.html
58+
## Future Considerations
4859

49-
The source code forked from other open source projects will inherit its license.
60+
The current version of the application is optimized only for PDF-based lab reports. However, as demand from public health departments and personnel continues to grow, we see significant potential to expand support for additional file formats in future updates.
61+
62+
+ [Return to Table of Contents](#table-of-contents).
63+
64+
# 2. Notices
65+
66+
## 2.1 Privacy Standard Notice
67+
This repository contains only non-sensitive, publicly available data and information. All material and community participation is covered by the [Disclaimer](DISCLAIMER.md) and [Code of Conduct](code-of-conduct.md).
5068

51-
## Privacy Standard Notice
52-
This repository contains only non-sensitive, publicly available data and
53-
information. All material and community participation is covered by the
54-
[Disclaimer](DISCLAIMER.md)
55-
and [Code of Conduct](code-of-conduct.md).
5669
For more information about CDC's privacy policy, please visit [http://www.cdc.gov/other/privacy.html](https://www.cdc.gov/other/privacy.html).
5770

58-
## Contributing Standard Notice
59-
Anyone is encouraged to contribute to the repository by [forking](https://help.github.com/articles/fork-a-repo)
60-
and submitting a pull request. (If you are new to GitHub, you might start with a
61-
[basic tutorial](https://help.github.com/articles/set-up-git).) By contributing
62-
to this project, you grant a world-wide, royalty-free, perpetual, irrevocable,
63-
non-exclusive, transferable license to all users under the terms of the
64-
[Apache Software License v2](http://www.apache.org/licenses/LICENSE-2.0.html) or
65-
later.
71+
+ [Return to Table of Contents](#table-of-contents).
6672

67-
All comments, messages, pull requests, and other submissions received through
68-
CDC including this GitHub page may be subject to applicable federal law, including but not limited to the Federal Records Act, and may be archived. Learn more at [http://www.cdc.gov/other/privacy.html](http://www.cdc.gov/other/privacy.html).
73+
## 2.2 Records Management Standard Notice
6974

70-
## Records Management Standard Notice
7175
This repository is not a source of government records, but is a copy to increase
7276
collaboration and collaborative potential. All government records will be
7377
published through the [CDC web site](http://www.cdc.gov).
7478

75-
## Additional Standard Notices
76-
Please refer to [CDC's Template Repository](https://github.com/CDCgov/template) for more information about [contributing to this repository](https://github.com/CDCgov/template/blob/main/CONTRIBUTING.md), [public domain notices and disclaimers](https://github.com/CDCgov/template/blob/main/DISCLAIMER.md), and [code of conduct](https://github.com/CDCgov/template/blob/main/code-of-conduct.md).
79+
+ [Return to Table of Contents](#table-of-contents).
80+
81+
82+
## 2.3 Domestic Copyright Protection Notice
83+
84+
This repository is a work of the United States Government and is not subject to domestic copyright protection under 17 U.S.C. § 105. If published in the public domain within the United States, copyright and related rights worldwide will be waived through the [CC0 1.0 Universal public domain dedication](https://creativecommons.org/publicdomain/zero/1.0/).
85+
86+
+ [Return to Table of Contents](#table-of-contents).
87+
88+
## 2.4 Open Source Notice
89+
90+
This repository is open source and follows [open practices](docs/open_practices.md). Contributors are expected to adhere to the organization's [rules of behavior](docs/rules_of_behavior.md).
91+
92+
+ [Return to Table of Contents](#table-of-contents).
93+
94+
## 2.5 License Standard Notice
95+
96+
The code in this repository is licensed under the Apache License 2.0 (ASL v2), or any later version at your discretion.
97+
98+
You are free to use, redistribute, and modify the source code under the terms of the Apache License 2.0. However, this software is distributed "as is", without any warranties of any kind, either express or implied, including but not limited to the warranties of merchantability, fitness for a particular purpose, or non-infringement. In no event shall the authors or copyright holders be liable for any claim, damages, or other liability, whether in an action of contract, tort, or otherwise, arising from, out of, or in connection with the software or the use or other dealings in the software.
99+
100+
For full licensing details, refer to the [Apache License 2.0](http://www.apache.org/licenses/LICENSE-2.0.html).
101+
102+
Additionally, any code forked from this open-source project will retain its original license.
103+
104+
+ [Return to Table of Contents](#table-of-contents).
105+
106+
## 2.6 Github Notice
107+
108+
GitHub is not hosted by the CDC, but is a third party website used by CDC and its partners to share information and collaborate on software. CDC use of GitHub does not imply an endorsement of any one particular service, product, or enterprise. If you are new to GitHub, we recommend starting with this
109+
[basic tutorial](https://help.github.com/articles/set-up-git) to familiarize yourself with version control and collaboration.
110+
111+
+ [Return to Table of Contents](#table-of-contents).
112+
113+
## 2.7 Contributing Standard Notice
114+
115+
While we encourage continuous development of this repository's codebase, there is currently no designated department overseeing its management. If you'd like to contribute, you have two options:
116+
117+
1. Clone the repository and create a new repository in your organization's codebase with the changes you wish to implement.
118+
- This option allows you to manage the changes independently within your own organization's environment.
119+
2. Submit a pull request and contact the CDC to inquire whether a department has been assigned to manage the repository.
120+
- If a CDC department is designated, you can coordinate with them for further changes.
121+
- _Note_: All comments, messages, pull requests, and other submissions received through
122+
CDC including this GitHub page may be subject to applicable federal law, including but not limited to the Federal Records Act, and may be archived. Learn more at [http://www.cdc.gov/other/privacy.html](http://www.cdc.gov/other/privacy.html).
123+
- Also see [CONTRIBUTING.md](docs/CONTRIBUTING.md) and [CDC Managed Repository Guidance](#4-cdc-managed-repository-guidance).
124+
125+
+ [Return to Table of Contents](#table-of-contents).
126+
127+
# 3. General Disclaimer
128+
129+
This repository was created for use by CDC programs to collaborate on public health related projects in support of the [CDC mission](https://www.cdc.gov/about/cdc/?CDC_AAref_Val=https://www.cdc.gov/about/organization/mission.htm).
130+
131+
+ [Return to Table of Contents](#table-of-contents).
132+
133+
# 4. Other Related Documents
134+
135+
* [Open Practices](docs/open_practices.md)
136+
* [Rules of Behavior](docs/rules_of_behavior.md)
137+
* [Disclaimer](docs/DISCLAIMER.md)
138+
* [Contribution Notice](docs/CONTRIBUTING.md)
139+
* [Code of Conduct](docs/code-of-conduct.md)
140+
* [Review Guidelines](docs/REVIEW_GUIDELINES.md)
141+
* [Review SLAS](docs/REVIEW_SLAS.md)
File renamed without changes.

DISCLAIMER.md docs/DISCLAIMER.md

File renamed without changes.

LICENSE docs/LICENSE

File renamed without changes.
File renamed without changes.

REVIEW_SLAS.md docs/REVIEW_SLAS.md

File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.

user_guide.md docs/user_guide.md

File renamed without changes.
63 MB
Binary file not shown.

thanks.md

-6
This file was deleted.

0 commit comments

Comments
 (0)