This is a project for processing TronLab data using PySpark for large-scale data processing and feature extraction. The project includes multiple Python scripts, each performing specific data processing tasks.
checkCreateContractAnd0.py
convertLabeledAddress.py
convertTronAddressToEvmAddress.py
deleteNoBlackInBlackFeatures.py
findSameAdressBetweenTwoCsv.py
get_easy_features.py
get_edge.py
get_features.py
get_other_features.py
get_white_future.py
getWhiteFromAllEasyfeature.py
graph_features.py
groupbyTransType.py
simpleWeb3ApiCall.py
- checkCreateContractAnd0.py: Calculate the number of createContract.
- convertLabeledAddress.py: Convert Tron addresses to EVM addresses.
- convertTronAddressToEvmAddress.py: Provide a function to convert Tron addresses to EVM addresses.
- deleteNoBlackInBlackFeatures.py: Delete features not in the blacklist.
- findSameAdressBetweenTwoCsv.py: Find the same addresses between two CSV files.
- get_easy_features.py: Extract simple features.
- get_edge.py: Extract edge data.
- get_features.py: Extract various feature data.
- get_other_features.py: Extract time-related feature data.
- get_white_future.py: Deprecated script.
- getWhiteFromAllEasyfeature.py: Extract whitelist features from all simple features.
- graph_features.py: Calculate graph features.
- groupbyTransType.py: Group and count transactions by type.
- simpleWeb3ApiCall.py: Simple Web3 API call test.
- Python 3.11
- PySpark
- Web3.py
-
Clone the repository:
git clone https://github.com/yourusername/TronLabDataProcessing.git
-
Install dependencies:
pip install -r requirements.txt
-
Run the desired script:
python checkCreateContractAnd0.py
Feel free to submit issues and contribute code! Please ensure all tests pass before submitting a PR.
This project is licensed under the MIT License. See the LICENSE file for details.
Thank you for using the TronLab Data Processing project! If you have any questions or suggestions, please feel free to contact me.