GitHub’s popularity in the open source world goes without saying. During the epidemic period, I stayed at home all day and felt a little bored. These days, WHEN BROWSING GitHub online, I came across a quite good NLP community (Wang Zhiqi is here github.com/intersystem…). , should be just recently opened, really bright, to share with you a ha…

The relevant open source code can be directly click in to understand

This new community is the NLP technology provided by InterSystems iKnow, which can be easily embedded into the InterSystems Iris-based data platform and used to search and find user cases. Of course, applications and business processes that are not based on the IRIS data platform can also be used by ordinary developers, which means that existing InterSystems customers and not existing InterSystems customers can use it, which is great. On the other hand, users can customize the technology, tune, and extend the language model themselves. I believe that this in the current various enterprises continue to reduce the trend of budget, to the continuous requirements of the automation of the enterprise is quite large appeal.

In fact, InterSystems iKnow is based entirely on the syntactic structure of the sentence to identify the phrase boundaries that define entities. Rather than relying on pre-set dictionaries or pre-trained models, it is relatively suitable for the initial exploration of new corpora. IKnow entities are not named entities in the NER sense, but rather phrases that need to be considered together and represent concepts or relationships created by the author of the text as a whole. The iKnow engine has been offered as an embedded service on the InterSystems IRIS data platform for many years, and packaging it with a database has the distinct advantage of storing index results from multiple documents in a single repository, thereby supporting the entire repository wide analysis through the actual API (I won’t go into details here). You can study IRIS’s iKnow documentation when you are free.

In addition to building iKnow, the source code of the iKnow engine is written in C ++, including.sln files for building with Microsoft Visual Studio 2019 Community Edition and Makefiles for building in Linux/Unix. If there is any good idea, engine code and language model can also be edited and published in the above, let’s learn more together ~~

More information about InterSystems iKnow can be found in the open source documentation below

The NLP community is also a great source of tools and resources for those who need them

Here are some tips on how to use GitHub, which you’ve researched with friends, figured out on your own, and tried again and again. When you search for code on GitHub, what do you do? Is it like this, typing what you’re looking for in the search box, and then constantly scrolling through the list? The diagram below?

no,no,no! In fact, the more detailed and clear search method will be more conducive to our accurate acquisition. Method 1: Accurately search the warehouse title and detailed description and README; Method 2: Search exactly how much the value of star & fork is greater than; Method 3: Search the warehouse accurately to obtain the size…… More methods are still being worked on. Trust me, this is a lot easier to use than simply searching for a keyword and turning pages to find it

Ok, let’s get back to InterSystems iKnow. If you’re interested, you can find out more about it at the lead website, and don’t forget to share with us what you find