Skip to content

Commit be31feb

Browse files
committed
added Simple Word Frequency in Excel and renamed from Data_integration_tips.md
1 parent f4b4a20 commit be31feb

File tree

1 file changed

+26
-0
lines changed

1 file changed

+26
-0
lines changed

Data_integration_tips.md renamed to Data Analysis.md

Lines changed: 26 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,6 @@
11

2+
# Integration
3+
24
## Peak email output periods
35

46
Sending emails is not always a sign of productivity,
@@ -18,6 +20,7 @@ using Power Query, and filter the emails from the Sent Items folder.
1820
* Filters: Folder Path
1921
* Rows: DateTimeSent
2022

23+
## Chunking
2124

2225
### Group data into time blocks
2326

@@ -40,3 +43,26 @@ are outlined clearly in the the article https://www.excelcampus.com/charts/group
4043
* Add Column / Custom Column
4144

4245

46+
## Anonymisation
47+
48+
See [Lubuild / data-extraction](https://github.com/artmg/lubuild/blob/master/help/manipulate/data-extraction.md#anonymisation)
49+
50+
## Frequency
51+
52+
### Simple Word Frequency in Excel 
53+
54+
Takes a column of cells each containing a short line of text and returns a frequency count in descending order
55+
56+
- Row 1 column headings 
57+
- Text Words Frequency Range 
58+
- Range formula in D2:  
59+
- a2:a1000 
60+
- Words formula in B2: 
61+
- =UNIQUE(TOCOL(IFERROR(REDUCE(,INDIRECT($D$2),LAMBDA(x,y,VSTACK(TEXTSPLIT(x, " "),TEXTSPLIT(y, " ")))),"-"))) 
62+
- Frequency formula in C2: 
63+
- =SUM((LEN(INDIRECT($D$2))-LEN(SUBSTITUTE(INDIRECT($D$2),B2,"")))/LEN(B2)) 
64+
- Copy the C cells down 
65+
- Credit for technique to: [https://www.get-digital-help.com/excel-udf-word-frequency/](https://www.get-digital-help.com/excel-udf-word-frequency/)  
66+
- Copy Paste Special Text Only into separate sheet and Sort 
67+
68+
This is a ‘clever’ technique, but as with many Spreadsheet ‘coding via formula’ solutions, very inefficient and it begins to fail once you get over several thousand cells in your range.

0 commit comments

Comments
 (0)