GTamilSelvan07
diff --git a/‎.gitignore
+1-2 b/‎.gitignore
+1-2
diff --git a/‎LICENSE
-1 b/‎LICENSE
-1
diff --git a/‎LICENSE.html
+136 b/‎LICENSE.html
+136
diff --git a/‎README.md
100644100755
+172-32 b/‎README.md
100644100755
+172-32
diff --git a/‎codes/dataloader/README.MD
+8 b/‎codes/dataloader/README.MD
+8
@@ -1,5 +1,4 @@
 __pycache__/
 .spyproject/
 temp*
-zip_unzip_files.py
-codes/
+zip_unzip_files.py
@@ -0,0 +1,136 @@
+<!DOCTYPE html>
+<html>
+<body>
+
+<h3>Database License Agreement</h3>
+<br>
+By accepting the terms of this document, the User (defined as the person named in the request for access and who has accepted the terms of this Database License Agreement), who will make use of the database or the database interface created by Pritam Sarkar and Ali Etemad (“Licensors”), agrees to the following:
+<br>
+
+<ol>
+<li> 
+<strong>Licensed Database</strong>
+<br>
+The Licensed Database means the database (including both the actual data as well as the interface to the database), or other material to which the User applied this Database License Agreement (DLA) entitled “A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work” and made available through the Queen’s University Dataverse portal.
+</li>
+<br>
+
+<li> 
+<strong>Scope of the License</strong>
+<br>
+  <ol type=a>
+  <li> <strong>Academic research </strong>
+  <br>
+  The User is granted a worldwide, royalty-free, non-sublicensable, non-exclusive, non-transferable, and non-sublicensable license to use the Licensed Database only for academic research.
+  </li>
+  <li> <strong>Moral Rights</strong>
+  <br>
+  Moral rights, such as the right of integrity, are not licensed under this Database License Agreement, nor are publicity, privacy, and/or other similar personality rights. Licensors to the extent possible, waive and/or agree not to assert any such rights held by the Licensors to the limited extent necessary to allow User, solely to exercise the rights licensed hereunder.
+  </li>
+  <li> <strong>Prohibited Use</strong>
+  <br>
+  The User acknowledges that the Licensed Database contains personal information of individuals that could be used to identify those individuals, either alone or in conjunction with other data. The User shall not use the contents of the Licenses Database in any research where the objective is to directly or indirectly identify the persons whose personal information may be contained therein. Any such use will be considered a breach of this Database License Agreement and will result in its termination with immediate effect in accordance with section 8(b) below.
+  </li>
+  <li><strong>Commercial use</strong>
+  <br>
+  The User may not use the Licensed Database for any non-academic purpose. The Licensed Database may not be used for any illegal or unlawful purposes whether commercial or not. Non-academic purposes include, but are not limited to:
+  <br>
+  <ul>
+  <li> proving the efficiency of commercial systems
+  <li> training or testing of commercial systems
+  <li> selling data from the dataset creating military applications
+  <li> developing governmental systems used in public spaces
+  </ul>
+  Where the User wishes to use the Licensed Database for non-academic purposes, such use must be under a separate agreement negotiated in good faith between the parties.
+  </li>
+  </ol>
+
+<br>
+<li> <strong>Responsibility</strong>
+<br>
+This Database License Agreement must be accepted by a person with a permanent position at an academic institute. 
+</li>
+
+<br>
+<li> <strong>Distribution</strong>
+<br>
+The User may not grant anyone access to the Licensed Database by giving out their username and password. The User may not distribute the Licensed Database or portions thereof in any way, with the exception of using small portions of data for the exclusive purpose of clarifying academic publications or presentations. Note that publications will have to comply with the terms stated in Section 6. 
+</li>
+
+
+<br>
+<li> <strong>Access</strong>
+<br>
+The User may only use the Licensed Database after the User has accepted the terms of this Database License Agreement (DLA). 
+</li>
+
+<br>
+<li> <strong>Publications</strong>
+<br>
+Publications include not only journal articles, but also presentations for conferences or educational purposes or other written disseminations, including pre-print repositories (e.g. arXiv), arising from research using the Licensed Database.
+<br>
+All publications that report on research that use any of the Licensed Database must cite the following paper as the source of the Licensed Database must cite the following paper as the source of the Licensed Database.
+<br><br>
+Sarkar, P., Posen, A. and Etemad, A., 2022. <i>AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work</i>. <i>arXiv preprint arXiv:2205.06887</i>.
+<br>
+<p>
+BibTeX entry:<br>
+<code>
+@misc{sarkar2022avcaffe,
+title={AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work},
+author={Pritam Sarkar and Aaron Posen and Ali Etemad},
+year={2022},
+eprint={2205.06887},
+archivePrefix={arXiv},
+primaryClass={cs.HC}} 
+</code></p>
+</li>
+	
+
+<br>
+<li> <strong>No Warranty</strong>
+<br>
+<strong>
+The Licensed Database is provided as-is and as-available, and without any representations or warranties of any kind, whether express, implied, statutory, or other. This includes, without limitation, warranties of title, merchantability, fitness for a particular purpose, non-infringement, absence of latent or other defects, accuracy, or the presence or absence of errors, whether or not known or discoverable.  
+<br>
+In no event will the Licensors be liable to the User on any legal theory (including, without limitation, negligence) or otherwise for any direct, special, indirect, incidental, consequential, punitive, exemplary, or other losses, costs, expenses, or damages arising out of this Database License Agreement or use of the Licensed Database, even if the Licensors have been advised of the possibility of such losses, costs, expenses, or damages. User assumes all risk and any liability arising from its use of the Licensed Database.
+</strong>
+</li>
+
+
+
+
+<br>
+<li> <strong>Term and Termination</strong>
+<br>
+<br>
+  <ol type=a>
+  <li> Unless otherwise terminated by operation of law or by acts of the parties in accordance with the terms of this DLA, this DLA is in force from the date the terms are accepted by the User and remains in effect until the Licensed Database is no longer subject to a prohibition on its use 		without the approval the Licensors. </li>
+  <li> If the User fails to comply with this DLA, then the rights granted hereunder will terminate automatically. </li>
+  <li> Where the right to use the Licensed Database terminated under section 8(a), it reinstates: 
+    <ol type=i>
+    <li> automatically as of the date the violation is cured, provided it is cured within 30 days of the User’s discovery of the violation; or </li>
+    <li> upon express reinstatement by the Licensors. </li>
+    </ol>
+   <li>Section 8(c) does not affect any rights the Licensors may have to seek remedies for the User’s violation of this DLA.</li>
+   <li>Sections 1, 6, 7, 8, and 9 survive termination of this DLA. </li>
+   
+   </ol>
+</li>
+
+
+<br>
+<li> <strong>Interpretation</strong>
+<br>
+This DLA does not, and shall not be interpreted to, reduce, limit, restrict, or impose conditions on any use of the Licensed Database that could lawfully be made without permission under this DLA.
+<br>
+If any provision of this DLA is deemed unenforceable, it shall be automatically reformed to the minimum extent necessary to make it enforceable. If the provision cannot be reformed, it shall be severed from this DLA without affecting the enforceability of the remaining terms and conditions.
+</li>
+
+</ol>
+
+
+
+
+</body>
+</html>
@@ -1,32 +1,172 @@
-<h1 align="center"> 
-AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work
-</h1>
-
-<h3 align="center">
-Under review.
-</h3>
-
-<h3 align="center">
-<a href="https://www.pritamsarkar.com">Pritam Sarkar</a>
-&nbsp; Aaron Posen
-&nbsp; Ali Etemad
-</h3>
-
-<h3 align="center"> 
-<a href="https://arxiv.org/pdf/2205.06887.pdf">[Paper]</a>  <a href="https://github.com/pritamqu/AVCAffe"> [Repository]</a>  <a href="https://pritamqu.github.io/AVCAffe/"> [Project Page]</a>
-</h3>
-
-
-This is the official code repository of AVCAffe. Please check the [project website](https://pritamqu.github.io/AVCAffe/) for additional details.
-
-
-<!-- ### Items available -->
-### TODO List
-- [x] [Paper](https://arxiv.org/pdf/2205.06887.pdf)
-- [ ] Database release
-- [ ] Database license agreement
-- [ ] Instruction to download
-- [ ] Dataloader code
-
-
-Coming soon...
+<h1 align="center"> 
+AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work
+</h1>
+
+<h3 align="center">
+AAAI 2023.
+</h3>
+
+<h3 align="center">
+<a href="https://www.pritamsarkar.com">Pritam Sarkar</a>
+&nbsp; <a href="">Aaron Posen</a>
+&nbsp; <a href="">Ali Etemad</a>
+</h3>
+
+<h3 align="center"> 
+<a href="https://arxiv.org/pdf/2205.06887.pdf">[Paper]</a>   <!-- change with aaai link -->
+<a href="./docs/assets/files/avcaffe_supp.pdf"> [Appendix]</a> 
+<a href="https://arxiv.org/pdf/2205.06887.pdf"> [ArXiv]</a> 
+<a href="https://github.com/pritamqu/AVCAffe"> [Code]</a>  <a href="https://pritamqu.github.io/AVCAffe/"> [Website]</a>
+</h3>
+
+
+This is the official code repository of AVCAffe. Please check the project website: https://pritamqu.github.io/AVCAffe/ for additional details. Before you download and/or use the AVCAffe Dataset, please make sure you have read the Database License Agreement available here: [DLA](./LICENSE.html).
+
+<!-- ### Items available -->
+### Updates
+- [x] Paper
+- [x] Database license agreement
+- [x] Instruction to download
+- [x] Dataloader code
+- [ ] Database release
+    - [x] Audio-visual recordings, ground truths, and additional meta data
+    - [ ] Face-crops (coming soon, pending due to some technical issues)
+
+
+
+### Overview
+
+AVCAffe is hosted in [borealisdata.ca](https://borealisdata.ca/) under [Queen's University Dataverse](https://borealisdata.ca/dataverse/queens). Please follow the instructions below to download the dataset.
+
+The directory structure of the dataset is as follows. 
+
+```    
+    ├── ..                              # total size = 34.7 GB
+    ├── ground_truths                   # size = 192 KB
+    ├── info
+    ├── videos                          # size = 552 KB
+    |   │   ├── per_participant_per_task    # size = 13.9 GB
+    │   │   ├── aiim001
+    │   │   ├── aiim002
+    │   │   ├── ...
+    │   │   ├── ...
+    │   │   └── aiim108
+    │   └── shorter_segments            # size = 20.3 GB
+    │       ├── aiim001 
+    │       ├── aiim002
+    │       ├── ...
+    │       ├── ...
+    │       └── aiim108
+    └── images                         
+        └── shorter_segments_face    # size = 41.7 GB
+           ├── aiim001
+           ├── aiim002
+           ├── ...
+           ├── ...
+           └── aiim108
+
+```
+
+- `ground_truths` contains the self-reported ground-truths for affect and cognitive load.
+- `info` contains additional meta data, e.g., train-val split, pre-study responses, etc. Please find details below. 
+- `videos/per_participant_per_task` contains full length videos of each participant per each task. Video length of 2.5-10 minutes, resolution of 640x360 pixel, format `.mp4`.  
+- `videos/shorter_segments` contains segmented clips of the same videos in `per_participant_per_task`. Video length approximately 6 seconds, resized the shorter side at 256 pixel, format `.avi`. Note, the shorter clips are prepared for easy and efficient to use to train deep learning models. 
+- `images/shorter_segments_face` contains the face crops of the participants. To train the baseline models we use just the face-crops instead of the full frames, which works better (at-least for simple models).
+
+### How to Request Access?
+
+
+**Step 1:**
+To access this dataset you must have an account in the Dataverse using an institutional email address. 
+Please go to this link to create an account: https://borealisdata.ca/loginpage.xhtml. 
+
+**Step 2:**
+Once you have an account, and signed in on the same browser simply click on this link https://borealisdata.ca/dataverseuser.xhtml?selectTab=apiTokenTab, it will show the your `API Token`. Alternatively, you can click on the drop down list shown under your name (top right corner), and click `API Token`. Please note the `API Token` which will be required at future step.
+
+**Step 3:** 
+Once the account is created please go to this link https://borealisdata.ca/dataset.xhtml?persistentId=doi:10.5683/SP3/PSWY62. **Select all 1,918 files in this dataset** and **Request Access**. Please read the **Terms of Use** and **Terms of Access** and **Accecpt** to submit a request. Please see the screenshot below.
+
+![step 3](./docs/assets/images/request_access.png)
+
+**Step 4:**
+Your request to access this dataset is successfully submitted. It may take 5-7 days to approve the request, if your access is not granted by that time, please write us an email at [email protected] and/or [email protected]. Once your request is approved please proceed to the next step.
+
+### How to Download?
+
+**Step 1:** 
+We provide a script to download the dataset. Create a directory where you want to download the dataset and go to that directory. Please run the following commands from a terminal `mkdir avcaffe` and `cd avcaffe`.
+
+**Step 2:**
+Please run the following command to clone the github repository in your current location.
+```
+git clone https://github.com/pritamqu/AVCAffe.git
+cd AVCAffe
+```
+
+**Step 3:** 
+Please open the `codes/downloader/downloader.py` using a text editor. Please update the API_TOKEN variable with your `API Token` noted earlier.
+
+**Step 4:** 
+Next, please ensure you have the required packages installed or you can install them by running
+```
+pip install -r codes/downloader/requirements.txt
+```
+
+**Step 5:** 
+You can download the entire dataset simply running the following command:
+```
+python codes/downloader/downloader.py
+```
+
+**Step 6:**
+Congratualations! The dataset is downloaded.
+
+
+### Dataloader
+
+We provide supporting codes for easy access of the dataset.  A dataloader written in PyTorch is available in `avcaffe/codes/dataloader/`. A minimum example usage is shown below and details usage is presented [here](./codes/dataloader/README.MD):
+
+```
+db = AVCAffe(ROOT,
+             subset='train',
+             return_video=True,
+             video_clip_duration=2,
+             video_fps=16.,
+             return_audio=True,
+             audio_clip_duration=2,
+             audio_fps=16000,
+             return_labels=True,
+             class_name='mental_demand',
+             mode='clip',
+             clips_per_video=1,
+             )
+
+sample = db.__getitem__(1)
+
+```
+
+### Additional Details
+
+We list some of the additional details here:
+- participants used for training and validation splits are mentioned in `info/train.txt` and `info/val.txt` respectively.
+- the outcomes of prestudy questionnaire are available in `info/prestudy_response.csv`.
+- the list of participants who agree to use their faces/images/videos for article or accopanied media contents, are available in `info/public_face_ids.txt`.
+- some of the clips available in `shorter_segments` have no speech when the participant were listening to the other participant or thinking or trying to solve the tasks, the file ids of such clips are available in `info/no_audio_files.txt`.
+
+
+### Citation
+If you find this repository useful, please consider giving a star :star: and citation using the given BibTeX entry:
+```
+@misc{sarkar2022avcaffe,
+title={AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work},
+author={Pritam Sarkar and Aaron Posen and Ali Etemad},
+year={2022},
+eprint={2205.06887},
+archivePrefix={arXiv},
+primaryClass={cs.HC}} 
+```
+
+### Question
+You may directly contact me at <[email protected]> or connect with me on [LinkedIn](https://www.linkedin.com/in/sarkarpritam/).
+
+
@@ -0,0 +1,8 @@
+# AVCAffe Dataloader
+
+We provide 2 dataloaders available in `avcaffe_vid.py` and in `avcaffe_img.py`. 
+- Use the dataloader in `avcaffe_vid.py` if you want to load the raw video and audio. 
+- Use the dataloader in `avcaffe_img.py` if you want to load the facial crops and audio. 
+
+## TODO
+- [ ] Detail example usages of the dataloaders.