fbpx

Exploring ChatGPT’s Capabilities & Limitations in Prior Art Search

In a world where ChatGPT, the cutting-edge AI language model by OpenAI, has rapidly transformed industries, the intellectual property (IP) sector is no exception. Its remarkable ability to extract profound insights from vast data pools raises intriguing questions about its impact on the IP industry, particularly in prior-art searches.

As research analysts, our curiosity led us to compare ChatGPT to our manual prior-art search method. Our hypothesis hangs in the balance: ChatGPT may offer speed but could encounter limitations, most notably restricted access to patent data. 

Are we about to reveal an accurate prediction or debunk our own theory? 

Let’s dive in and find out.

Direct Challenge to ChatGPT: Can It Dig Up Prior Art when given the patent number?

In the first part of exploring chatGPT, we tried being direct with it. We asked if it could find prior art for a given patent number.

Now that we were sure that ChatGPT could not give up as its been programmed to not get into legal conversations, we tweaked our methodology.

Using ChatGPT when performing various steps of prior art search

In the second part of using ChatGPT for patent search work, we wanted to leverage it for specific parts of prior art searches. So, we tried dividing the steps performed by the patent researcher into the following:

  • understanding the patent, 
  • extracting the important keywords/terms/CPC classes based on the understanding, 
  • and finally, preparing the search formula or logic to search on the patent/non-patent databases.

Can Chatgpt Summarize A Patent If Given The Patent Number?

We picked a fairly simple patent related to parental control on a mobile phone, and as the first task, we asked ChatGPT to summarize the patent.

But before that, we briefly summarised the patent manually, focusing on the problem statement and not only the claim features. 

This patent can be summarised as Implementing a guardianship function at a mobile terminal to enable the guardian to learn the use of the mobile terminal in real-time. The function includes a module for providing guardianship information and a module for processing short messages.”

Based on our summary, our aim was to see if ChatGPT would also extract the problem statement and key solution of the patent while writing the summary. On giving the prompt to ChatGPT, it did produce the key features from the patent. 

However, what we received was a broad summary of the patent. It failed to highlight the novel or the most important key feature of parental control from the patent. I think it was considering it as a text document and hence summarizing it based on its training history.

https://lh3.googleusercontent.com/-ZCJco6fcw-TUzZN8xgJqJCLwLgLIDBIj6w5fKPETAYRCDL5YMsQjzlLatFRP1eql5RaVsx5uEFjFnDKr9cJTXsCZ1vgNSt41KOzZWWCBzapg4Sm5KugO1hUZr3dcD8jDOarqh2HdZROQ8xd4-c_TNs
https://lh4.googleusercontent.com/yex9gKJ1GcXtc85T3WSJWloLPkFNvSzZz-p7D1_dmJPCkC8k6WEis08Fimo7Q_TRqjAPtInDr8_29oQxshs-0t-x_p88oY-fCugj2tC7Nx-AujpZL1Y4NUXhsRWN1ihJDDGCPyt1W-sjPQwjsOxzkD4
https://lh3.googleusercontent.com/ktDwyC1gf7R4Pw8VLH2PVzeLHzAH3jzTOLoEx55pCZkpdIgOpakVci_xLDScF6fkuHuq7LoiW4iNudGhl26TZzAURQz4neygJx_eQRpyAlSDD_J0hglecj06RKSoALHHR31n3hB2u8u_h7PNIDrxWN4

To take it a step further, we improved our prompt and provided ChatGPT with a detailed description of the claims from the same patent and asked for an explanation of the key terms. 

ChatGPT delivered a module-by-module summary and an overall summary that mentioned the invention focus, which closely aligns with the solution described in the patent. Overall, ChatGPT’s performance was impressive after the tweaked prompt. Basically you have to train it to see the things where you want him to focus.

Can Chatgpt Search For Relevant Patent Keywords And Classes?

Just like the previous task, asking a direct question wasn’t helpful in this case either. ChatGPT provided domain keywords such as data mining, content recommendations, and recommendation systems, among others, without mentioning parental control at all.

During the patent search, finding relevant keywords depends on how well we understand the patent, and eventually, this leads to better search strings. Hence, as the second prompt, we shared the Claim of the invention with ChatGPT and then asked it to churn out some relevant keywords.

In this case, it gave us essential keywords like threshold and timer. However, some important terms, like parental control and guardian control, were still not captured. 

https://lh5.googleusercontent.com/CptN8R-vWEgV36OrWO4dL9R64MxzDEl_RGvKsAD-xWaHlWlrPvIBlBQjTw7t_N4jPPbJldlzhdpbYoVdcBZaLeY5ZbUzVkM_9_nLn83hQl4hQliTBqpHGXDreDMzl1ixMkigtuTJYEqlBR71jwzBFuw
https://lh5.googleusercontent.com/tD64AizLC5vFTuWXnEpdCNw_2nWVvkrzJ2KmpqPgwGRoVJMoQRXNnSa2Nl0tJ5yjcZqElyO0KA4Znxqppq9MuZHFfIWoQlf_ncZ9E2QN3n88jfy9RsnDyFGP7_1EunLdqcUxIsU-KDh9q_XB3YcP7CI

Looks like ChatGPT was not able to deduce the purpose of the invention and focussed more on what was directly stated in the shared claim.

As we saw earlier, ChatGPT could not fetch the patent details when we provided it with the patent number. However, when we explicitly provided the background of the technology, ChatGPT was able to explain the problems discussed in the patent.

Therefore, we deduced that to understand the purpose of the invention disclosed in the patent, we must provide the background information from the patent to ChatGPT.

We further tested if ChatGPT could provide synonyms for these shared keywords. But if you see, most of the synonyms, just like the keywords, are generalized terms. Putting them in any search string would definitely increase a researcher’s work tenfold, as we might get too many irrelevant and repetitive results.

https://lh3.googleusercontent.com/_z1ykb7-7E_FN3Ddbh4bPlgzt0mzq3lzwT8gg6ADSsxkAWVm2Ejlv8fz_j5R5H8m14Zt_kcXrt1DpnW_uV-3IW1V95RWH6_5eym8S2ncvx0VjT790aabriBR3mf90FkbwCg8UWhEDrizNx-c0sCZH70
https://lh5.googleusercontent.com/kzEpSziu91bLDFl1klBupGEjHzRtI3agaHQzjU6meoQPtpSjjL-4T6JXRWPkS4rLFFzF6FeASu8px6Z6w11daBEU7w7VuC5AGN7I4Qf7DlgcXz5rWjqTy2ZpMxWXtjh6hKPoJyfdUi_jD0Z1iyssjrg

After keywords, we moved on to the classes. They are important in narrowing the search results to the target patent’s domain and when we tested ChatGPT to find relevant classes for the prior art search, we had a similar experience to that of keywords. 

Through our first prompt, ChatGPT only provided us with broad classes. We then requested for narrower classes to improve the search results. This time, ChatGPT presented some classes that appeared to be useful to us.

However, upon examining their definitions, we discovered that most of them were inaccurate. This is a concerning issue as it may discourage researchers from depending on the information provided by ChatGPT without verifying it from a credible source. Therefore, we recommend that individuals double-check any output given by ChatGPT, mainly when it involves specific numerical or value-based data.

Can ChatGPT make strings for Google Patent Search?

Our product development team raved about ChatGPT’s programming and coding skills, so we had high expectations in this task. We asked ChatGPT to create a search formula using keywords and their synonyms from the previous task.  Despite our high hopes, the output was underwhelming. We tried again with a different prompt, but the result was the same. 

https://lh3.googleusercontent.com/NRx7IhX1qa1pL68cRlEeKelJpNIuVpXmn3-2dhQtIOGofab-21u-US_JjkuLaKlH3xs2Wmp6OPjBBx-JLtPNLFx-lKhsT0SshgTlcLuQ7rM6i1jAnF7qma0ynTdgAK33m41C6KOomE26VeCxSyFuN_4
https://lh5.googleusercontent.com/tNd4dav56s7J7sVEP8Iq1NvxmY2Pds4_viEAMT9U_3oi9hGf19w0bO0U98sagDUxTQG95MHlPre5Vp6iEMGQP4m0M63GARUpYKbRJc2p7DfQz7AhYDas0iCb56LnxWAJaSMyW34J9z1i492eeQZrn64
https://lh6.googleusercontent.com/hWXiy2m6FsZx2NE8uRYnKONZ9JeWOCkWKJz1HcVgkkFxEnRGqnPvadH2KI4qvpCkbowQZsComt7HSdkRF-y0mlcj_DTL41O01bpnysqdtWabyKQBiW-PKHwHK8JGJSBikq-l3on4qI2YFsalHSJnUo0
https://lh3.googleusercontent.com/pz-oy5SwqSW1QIeWIbEPOdyxBdJfPoI3H_u_WpHM619zAOznaN1D80eiZ-1HTSP7sGpu3vTWHavOctxsdIiZgtH3TS39T1yCuFS5rpAsm2KLyXrmZnZTsrZRSVD5KAPA44cruTjpjQRhgEYvyKFSN9c

ChatGPT simply used a basic AND logic for all the search terms, which doesn’t make sense since the keywords are too general. For example, “mobile terminal,” “running time,” “program,” and “threshold” are all broad terms that, when combined, result in irrelevant search results. 

Additionally, ChatGPT didn’t use any specialized truncations specific to narrow down the results during Google Patents Search. To establish a comparison between our manual search strings, here are a few strings that we would prepare to target the prior-art search of the respective patent.

(Mobile Terminal* OR smartphone* OR UE*1 OR terminal* OR user device*1 OR handset*1) NEAR/4 (parental control* OR guardianship function* OR child lock* OR child caring OR child care* OR guardian* control*)

(mobile application* OR app*1 OR software* OR android application* OR IOS application*) NEAR/4 (parental control* OR guardianship function* OR child lock* OR child caring OR child care* OR guardian* control*)

If you see ChatGPT’s output, it misses major keywords that are based on the problem statement and solution aspect. It also misses truncations such as” terminal*” or “device*,” that help capture various documents. Additionally, we have used the NEAR/4 operator to narrow down the result sets, which ChatGPT won’t be able to prepare without an elaborate prompt.

Basically again its training data does not include much examples of building search strings which is why we can expect him to not perform that well.

Conclusion

Does this mean that ChatGPT can’t be used for patent searches? The answer is not straightforward. 

In theory, ChatGPT can be helpful for certain aspects of prior art searching. For instance, it can quickly explain patent claims and related keywords. It can also be used for surface-level searches, giving you a good starting point. However, the results you get from ChatGPT depend on the prompt you give it. To ensure accurate results, it’s best to provide your interpretation of the claims and some parts of the description. 

But there’s a catch. ChatGPT is not suitable for highly technical patents, such as those related to telecommunication or audio/video coding. It fails to understand the essential concepts of the invention and generalizes the patent, which is unacceptable for narrow levels of technology. In such cases, ChatGPT cannot replace a manual search team. 

ChatGPT has limitations because it doesn’t have access to databases, and its data is not in real-time. Therefore, AI lags way behind when finding prior art in non-patent literature. However, we are closely examining how AI will evolve in the coming years. 

But until then, manual prior art searches still have the upper hand. See GreyB’s prior art search team in action.

Click the button below to get in touch with us.

Get in touch

Authored By: Rutwik and Avantika, Prior-Art Team

Edited By: Annie Sharma, Editorial Team

Leave a Comment

Become a part of GreyB’s insider list

Get our distilled learning delivered to you.

Get the Sample Report

Fill out the form and get the report.