Chapter 473 Running into the arena(1/3)
"Gather resources from all over the world to share the happiness of you, me and him..."
Listening to Zheng Qiu slowly reading out this product introduction, Yu Hua couldn't help but chew it.
"Hmm! It does sound like some kind of resource sharing platform?"
Zheng Qiu was speechless, rolled his eyes and pointed at a line of small words below.
"How can such an awesome company be so superficial! Look below!
Select high-quality knowledge resources to promote the reform of educational methods, improve the quality of talents, and promote the sharing of wisdom.
Carry forward Chinese culture and absorb world civilizations.
Accelerate the pace of entering a knowledge-based society - good news for the majority of students!
This pattern... sounds so big when you hear it!"
"A certain dating website even advertises that it wants to promote human reproduction and continue the civilization of the earth. Who wouldn't do it if it's big enough?"
"Hmm... a dating website?"
Zheng Qiu stared at the older bachelor in front of him suspiciously.
"Ahem! These are all small details. So, let's go in quickly and see if there are any surprises. This is produced by Bai Yeji!
Hmm... It says that you can use Tianshu ERP or Bajie's account to complete the registration simultaneously, but we don't have either.
If you are a new user, then personal account registration requires real-name authentication, wait a moment..."
So, register...enter your ID number...bind your mobile phone number...
After some operations, Yu Hua obtained an initial account.
After logging in, you immediately switch to a new interface that looks very simple.
On the left is an information bar. In addition to your personal name and information to be edited, there is also something called "Smart Coin", the current amount is 5.
There is nothing on the right side, just a solitary search box, with Bai Yeji's robot logo in the background.
"How to use this thing?"
"Since it is a knowledge sharing platform, it should be something similar to a search engine. Let's try entering a search item first."
So Yu Hua casually entered "journal papers related to artificial neural network (ANN) random forest algorithm"...
This is also his current research topic.
"Brush!" The interface changed.
Ten relevant papers and their introductions appeared in front of the two people one by one.
"Huh? It's not bad. The first few articles above are authoritative papers with more citations and higher weight in this field...
Look! The 10th article is still your paper from last year!" Yu Hua said in surprise.
Zheng Qiu grabbed the mouse and clicked on his paper. Sure enough, he had written it correctly. The author and publication time were clearly marked on it.
Looking at his work, Zheng Qiu nodded awkwardly.
"Well! It seems that this search still has some discernment!"
Yu Hua was too lazy to pay attention to this narcissist, so he clicked on the next page and continued to check other papers in the search order 10 to 20. Sure enough, he found several more familiar industry authorities.
Until the 100th article, almost no article is invalid "hydrology".
In terms of retrieval efficiency and effect, it is by no means worse than that of a spider web.
And what is surprising is that when other websites retrieve journal articles, as long as the year is slightly older, they are usually graphic versions, which are scanned with a camera.
Because computers were not widely popular in the past, most papers were only archived on paper.
Even if computers became popular later, retyping each article would still be an extremely huge and vast project, so it could only be scanned and electronically archived.
However, the clarity is like reading an old newspaper on a computer, which greatly affects the reading experience.
For example, there was an article "Random Vector Learning Model". He clearly remembered that it was a scanned document in the spider web search database.
However, what is displayed here is a clear and complete electronic file.
Even the tables, pictures and texts in the article have been electronically reproduced to a high degree of restoration of the original work, making it clear at a glance.
Even if the original author reads it, I'm afraid he can only say "impeccable"...
"These guys must have optimized all the old database documents..."
An extremely incredible idea suddenly popped into Zheng Qiu's mind.
This kind of project cannot be completed by a small amount of manpower. The only possibility is to rely on automated programs for batch image-text conversion and recognition...
If this is really the case, then the image-to-text conversion effect is simply explosive!
"This thing is said to be able to detect plagiarism, but I don't know how effective it is... I'll give it a try."
Yu Hua did not forget his original purpose. He immediately found an already reviewed master's thesis from the computer and directly dragged it into the dialog box according to the prompts...
[Would you like to spend 1 smart coin for plagiarism check service? Yes/No]
"It turns out that it costs site currency... 1 coin per time, so it seems that every new user has 5 free opportunities to check plagiarism? Not bad."
Yu Hua was still somewhat satisfied.
Although it's not completely free, it's still better than those who just pop up the payment code directly.
Select "Yes", and the next moment, a progress bar will pop up on the screen.
After about 3 minutes, the progress bar slowly stretched to the end, and finally a large number popped up - the repetition rate was 69.3%!
"What? 69.3%?" Yu Hua looked a little surprised.
"What's the matter?"
"I used Spider Web to check this paper for plagiarism, and the final result was 36.2%...Here, this is Spider Web's plagiarism check report."
Yu Hua looked for it from the desk next to him and handed over a piece of A4 printing paper.
When the spider web duplication checking system detects the content of a paper, it will compare the paper with its own system library.
If it is found that 13 characters appear in a row, that is, seven or eight Chinese characters are similar, it will be judged as a duplication, the duplication rate will be calculated, and the duplicate-checked data will be displayed in the final report.
At the same time, repeated content will be displayed in red font in the text, and relevant documents cited in this paragraph will be marked next to the repeated area.
In general, although Spider.com is expensive, in terms of search results, the service is quite good.
Zheng Qiu looked at the duplication check report in his hand, then looked at the high duplication rate of 69.3% given on the computer, and frowned.
"If nothing else, the speed of duplication checking is a bit unbelievable. Isn't it just a fortune teller on the Internet... playing a random trick?"
The progress bar just now, even if it is full, it does not take more than 3 minutes.
3 minutes may seem like a long time, but compared to the large-scale literature database search volume, it is incredibly fast!
You should know that a single duplication check on Spidernet generally takes 30 to 60 minutes, and may even exceed 2 hours during the peak graduation period.
In comparison, such a "duplicate check" is time-consuming and somewhat childish.
What kind of computing speed and retrieval algorithm can search such a huge literature database in such a short time?
"Impossible... Such an excellent company, and they also got the result of a plagiarism check."
Yu Hua said and clicked the "Duplicate Check Report" button below.
The next moment, the two people in front of the computer were stunned.
On the screen, more than half of the paper is highlighted in red, with cited documents and corresponding jump links hanging next to it.
It's so dense that it doesn't look like a random fabrication...
"Is it true?" Zheng Qiu was a little dumbfounded.
He picked up the copy of the spider web plagiarism check report and began to compare it line by line.
"There is this one, there is this one too...hiss~all hits!"
The duplicate paragraph annotations and cited documents retrieved by the spider web are all available here, and links to the documents are thoughtfully provided, so the authenticity can be seen at a glance.
So many of these...
Through comparison between the two, a shocking conclusion was drawn.
That is the database searched by this wisdom tree...it is actually more comprehensive than a spider web!
Generally speaking, the databases of the paper plagiarism check system mainly include "academic journal databases", "dissertation databases" and "Internet databases".
Among them, "Internet database" is the most complex, referring to a large number of Internet information resources such as web pages, blogs, and forums.
The paper duplication check system will use the Internet database as an important comparison to detect whether there is similar content in the paper that has been publicly released on the Internet.
"Damn it! Where did the other party get such a huge database resource?" Yu Hua asked in confusion.
He seemed to react the next moment...
"Could it be that it's a crawler program?"
There is an awesome programmer abroad who has crawled the Internet public resources of more than 600 million websites in the world by writing powerful crawler programs!
To be continued...