Compatibility with string data types#74
Open
AndrewWilkins84 wants to merge 3 commits into
Open
Conversation
|
This pull request introduces 1 alert when merging 60dde27 into 5ef2382 - view on LGTM.com new alerts:
|
Contributor
|
@AndrewWilkins84 thanks for the PR! I'll review shortly. In the meantime, can you take a look at why the tests are failing? The Travis CI logs indicate that it may be a relative importing issue. |
Contributor
|
Also, would you have any interest in creating a Jupyter notebook that highlights how to use Matrix Profile with string data types? I think the community would be very interested, and we could easily convert it into a blog post for the MPF website. |
Author
|
Hi Andrew,
I'm pretty sure the unused error is numba in mpx_char.py. I wanted to perform parallel processing on that function but never got around to it since the solution I developed works for my immediate application.
As for creating a Jupyter Notebook, I'll have to decline for now. I just don't have the bandwidth to take on any more tasks apart from my job workload. To run what I have, use the following in a Jupyter Noteobook:
import numpy as npimport matrixprofile as mp
a = np.array(list('abcdababcdddabcd'))p = mp.compute(a)p
That should do it.
Very respectfully,Drew Wilkins
On Tuesday, January 26, 2021, 06:43:44 PM PST, Andrew Van Benschoten <[email protected]> wrote:
Also, would you have any interest in creating a Jupyter notebook that highlights how to use Matrix Profile with string data types? I think the community would be very interested, and we could easily convert it into a blog post for the MPF website.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or unsubscribe.
|
Contributor
|
No worries - thanks for your contribution! Hopefully I can get this in
shortly.
On Wed, Jan 27, 2021, 11:05 AM AndrewWilkins84 <[email protected]>
wrote:
… Hi Andrew,
I'm pretty sure the unused error is numba in mpx_char.py. I wanted to
perform parallel processing on that function but never got around to it
since the solution I developed works for my immediate application.
As for creating a Jupyter Notebook, I'll have to decline for now. I just
don't have the bandwidth to take on any more tasks apart from my job
workload. To run what I have, use the following in a Jupyter Noteobook:
import numpy as npimport matrixprofile as mp
a = np.array(list('abcdababcdddabcd'))p = mp.compute(a)p
That should do it.
Very respectfully,Drew Wilkins
On Tuesday, January 26, 2021, 06:43:44 PM PST, Andrew Van Benschoten <
***@***.***> wrote:
Also, would you have any interest in creating a Jupyter notebook that
highlights how to use Matrix Profile with string data types? I think the
community would be very interested, and we could easily convert it into a
blog post for the MPF website.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or unsubscribe.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#74 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AB53ISEGAOGHOVYRPLLFXWTS4BBULANCNFSM4WUBKRQQ>
.
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
The proposed changes allow
mp.compute()to work with string data types.mp.analyze()still needs work however since function calls tomass2()are used when discovering motifs and/or discords, but a new function could easily bypass this behavior and produce results comparable to https://www.cs.ucr.edu/~eamonn/PAN_SKIMP%20%28Matrix%20Profile%20XX%29.pdf (page 3).Additionally, documentation suggests minimum window size should be 4. I have changed several of the functions I came across to reduce the minimum window size from 8 down to 4, consistent with documentation.