6

As a student, I make mistakes all the time, and stealing my idea will not face heavy retaliation. I don't mind if someone comes up with concurrent works, but I do mind if my key idea is leaked. Many graduate students in my field therefore only use pre-print services if their papers are either publicly presented or conditionally accepted.

I am looking for a website that can do most of what ArXiv does, with an additional private option or limited visibility, for example an option to make the paper unindexed by search engines. This way, my paper cannot be seen for at least a few months. By doing so, I can at least claim that I am one of the first few researchers who got the idea right. And most importantly, I can privately correct my mistakes, if any. Publishing mistakes is embarrassing and sometimes harms one's career. Many mistakes will be revealed during the peer-review processes and I have a chance to correct them before publicizing.

Please kindly think twice before giving ethically and "politically correct" comments like no one will scoop me, everyone will cite me, and the scoopers will be punished. At least in my field, scooping and idea-stealing is commonly observable, and big-names don't care about publicizing pre-prints because they have resources to punish the scoopers. I do understand that keeping my work private might harm the community as the information becomes less transparent. However, it could also be possible that my un-reviewed flawed works misguide the community, and a paper reviewed will benefit the society more.

I personally wasted six months because of someone else's mistake in a published work.


I heard that in molecular biology there is a widely-used service to upload your research data, like protein structure and sequencing, to get a timestamp. The uploaded materials can be set to be private for a period, and fully disclosed later. A lot of people do use this function. This is not exactly what I am looking for but shares some similar ideas.

If I remember correctly, there is an interesting paper studying the scooping patterns with the aid of this database.


Solutions found! Zenodo and some other university-run pre-print services do have an embargo function.

Moreover, SSRN is a widely used repository, that, unlike arxiv, only displays the newest version to the general public, although still keeps a tag on the first submitted date!

High GPA
  • 4,442
  • 1
  • 26
  • 50
  • 35
    Hmmm. Isn't the whole purpose of ArXiv to make the papers public? Errors are a natural consequence of being human, actually. – Buffy Nov 14 '20 at 22:51
  • @Buffy I guess people also use it as a partial evidence to defend against plagiarism accusations? – High GPA Nov 14 '20 at 23:03
  • 1
    I remember several questions here discussing a very similar topic. Here’s one: https://academia.stackexchange.com/q/13216/40589 – Dan Romik Nov 15 '20 at 00:44
  • 2
    This is not a good idea, but an easy way to do it is to print your manuscript out, put it in an envelope, and have it postmarked. No website is needed. – Anonymous Physicist Nov 15 '20 at 01:33
  • 30
    Putting things on the internet is a terrible way to keep them private. – Anonymous Physicist Nov 15 '20 at 01:34
  • 25
    If you write your paper and put it in a drawer for thirty years, and somebody else publishes first, they will get the credit, even if you have proof that your paper was in a drawer the whole time. (It will invalidate any patents on the idea that somebody else files, but that probably doesn't help your academic career.) Just get your paper ready and submit it somewhere. – Peter Shor Nov 15 '20 at 02:16
  • @PeterShorp Well, I want it to be circulated within a small circle rather than the whole public, at the beginning. – High GPA Nov 15 '20 at 03:07
  • 21
    If you know who you want to circulate it to, just email it to them (possibly marked "draft—do not distribute"). If you don't, then I don't see how you expect a website could provide you with this information. – Peter Shor Nov 15 '20 at 04:11
  • 24
    Wait, wait, wait.... You want to claim credit for a result before you are sure whether it's correct?? Sorry, but that's not how this works. – JeffE Nov 15 '20 at 23:57
  • 5
    If something like this would work, then you could easily put something on the secret arxiv, circulate it to the friends, then submit it, juts to dicover then that someone else posted the same idea months before you on the same secret arxiv.... – Nick S Nov 16 '20 at 00:32
  • 3
    A private repository on GitHub would datestamp, cryptographically hash, and track changes to, your work. – jez Nov 16 '20 at 00:52
  • 1
    @PeterShor, putting it in a drawer will no longer invalidate later patents in the US, at least. The laws have changed here. – Buffy Nov 16 '20 at 14:02
  • 2
    If all you are worrying about is later claims you plagiarized and want a dated record of early work, have a notary sign and date your paper. They have legal standing. Your bank will probably do this for free and a lawyer will for a small-ish fee. – Buffy Nov 16 '20 at 14:03
  • @jez Unless I am missing something, one can rewrite git history and the timestamps associated with commits ultimately come from the committers computer. Does github record when commits were pushed? – emory Nov 16 '20 at 16:29
  • 1
    Publishing the SHA-256 of your preprint on Twitter should be enough to establish you already have it written by that time. If you want to make it fancy, publish it on some public blockchain, like Bitcoin or Ethereum. Just don't lose the exact same file you hashed. – lvella Nov 16 '20 at 17:05
  • 4
    The whole point of ArXiv is to increase transparency instead of hiding mistakes and pretending to have some superhuman ability which prevents you from going down dead ends. – Tom Nov 16 '20 at 18:22
  • 1
    You can put your paper on Zenodo and choose an embargo period during submission. You can even reserve a DOI without the paper being yet publicly visible. I think this gets you more or less what you want. – Miguel Nov 16 '20 at 23:56
  • @Miguel you are the true legend! – High GPA Nov 17 '20 at 00:51
  • @Miguel Can you modify the submitted version during the embargo? – High GPA Nov 17 '20 at 06:29
  • I'm not sure. You can probably upload a new version during the embargo, and maybe even delete the entry before the embargo is up (i.e., before public release of the record). You can check on their website. It's a free service. – Miguel Nov 17 '20 at 10:42
  • @NickS I think your example is perfectly fine as they developed the idea independently – High GPA Nov 20 '20 at 03:48
  • 1
    > SSRN is a widely used repository, that ... only displays the newest version to the general public but this is the opposite of what you wanted to achieve, as it conceals any evidence that the previous versions did already contain the novel ideas for which you are seeking some proof of priority. Earlier timestamps are only useful if the corresponding iteration of the document remains accessible. (It's also the first time I've heard of "SSRN", so a link would be nice) – Lou Knee Apr 09 '21 at 15:45
  • @LouKnee link added! I might be wrong about the description, though. – High GPA Apr 09 '21 at 22:16
  • By mistakes do you mean spelling / grammatical or more serious errors with the methodology? – JosephDoggie May 02 '22 at 18:40
  • 1
    @JosephDoggie Maybe both? – High GPA May 03 '22 at 01:15

7 Answers7

43

Basically, if I understand you correctly, you want proof of priority without publishing.

Well, this problem is well known from middle-ages and renaissance where people wanted to be able to prove they have the earliest solution without revealing what it is (so that if someone finds it, they can prove they were there first).

They often used anagrams, today you probably would use some private/public key combination for this (I am not a crypto expert, you would have to read up on this).

However, to be honest, this will look weird. The current position in science is: if you have done it, publish it and expose it to criticism, and you may gain priority. If you wish to monetize it, you can create a patent that is public, but protects your rights (in theory), or keep it a trade secret and make money out of it.

As for a website that supports timestamps for unrevealed papers, I am not aware of any, but of course if there were, you need to make sure you trust the maintainers. Furthermore, if you had one and if somebody publishes a completed paper somewhere else before your paper goes live, this will not help you. The fact that you had worked provably, say, for multiple years on the topic will at best prevent plagiarism charges when you end up publishing. But this, if it occurs, could be proven otherwise, e.g. lab diary, emails, etc. It will not make you the "first to publish". Don't forget, there are quite a few cases where people who had publications were superseded by later, but more visible publications. So, even open publications may be vulnerable to "post-scooping".

So, your question needs to be what one would need such a service for? I therefore doubt that people would set up such a system with obviously limited utility. Of course, this is not a statement that there isn't.

I understand that this does not precisely answer your question, but I hope I was able to put things in context and why your question may not have a satisfactory answer.

Captain Emacs
  • 47,698
  • 12
  • 112
  • 165
  • I don't necessarily need "first to publish". Just a valid defense against concurrent workers' potential plagiarism claims, if any. The privacy option is just an option. If I feel good about the paper, I'll make it indexed even if it is not yet accepted. – High GPA Nov 14 '20 at 23:23
  • @HighGPA How would you be accused of plagiarism? You publish too slow or what is the scenario you have in mind? – Captain Emacs Nov 14 '20 at 23:24
  • 1
    Yes I could be very slow on finalizing a paper because I sometimes care about unimportant details too much. And, the journal I chosen could be very slow in handling my paper. Does submission record serves as a creditable timestamp? – High GPA Nov 14 '20 at 23:27
  • 14
    @HighGPA Yes, in reputable journals, submission record is a very important timestamp. So, if your journal is 2 years in review (say), then you have two timestamps, one is submission, one is publication. Submission shows that you had the idea, and, in fact, the review process should help you iron out any defects. – Captain Emacs Nov 14 '20 at 23:29
  • 2
    I am not sure if this info is helpful: at least in molecular biology or biochem, there is a widely-used service to upload your research data like protein structure and sequencing to get a timestamp. The uploaded materials can be set to be private for a period, and fully disclosed later. A lot of people do use this function. – High GPA Nov 14 '20 at 23:34
  • By the way, I badly want someone else to criticize my paper. However, a criticism in private is preferred at early stages. Some of my coauthors may be very slow so I might need a timestamp. – High GPA Nov 14 '20 at 23:39
  • 13
    From the technical angle, you could use a cryptographic one-way hash function such as SHA1. Copy your text into a site like this one: https://hash.online-convert.com/sha1-generator It will pop out a string which looks like "70b557f903e46b1434dd23143f0f3e21e1c42238", which you could then make pubilc. Later, make public the text of your paper, and the hash is proof that you had it earlier. But I agree with @CaptainEmacs's advice that, at least in my field, this would look strange. – academic Nov 14 '20 at 23:42
  • 9
    @academic imho you should write your solution as an answer. I think it's a decent solution to OP's problem: yes it's weird and slightly paranoid, but if everything goes well it's never going to be used so nobody cares. If it turns out that OP actually needs to prove the timestamp on their work, it will be incredibly useful and therefore the method will be justified a posteriori. – Erwan Nov 15 '20 at 00:29
  • 7
    If you do go down the hashing route, you must be absolutely certain that the file you are preserving does not change in the intermediate period. The whole point of a hashing algorithm is that it is sensitive to small changes. So if you add some whitespace somewhere, the hash will be completely different, and you'll lose your proof. – preferred_anon Nov 15 '20 at 10:27
  • @preferred_anon you could possibly hash not the whole paper/text, but some paragraph/sentence of it which contains the gist of the idea. But I still doubt that's a lot useful. – Paŭlo Ebermann Nov 16 '20 at 00:17
  • 3
    @academic: do NOT use a website to hash, it could be hacked or even malicious; there are hundreds of programs for every possible computer since about 1970 to hash locally. But use SHA-256 or higher (standard since 2002); SHA-1 is now broken for collision (see https://shattered.io) and although that doesn't actually break this application less intelligent or informed people may not reliably make the distinction. Even better get your hash timestamped by a TSA or bitcoin, in case the published version(s) are lost or become hard to prove dating. – dave_thompson_085 Nov 16 '20 at 01:26
  • Worth noting that traditional cloud backup services typically have a revision log + the ability to upload documents. Given a trusted - in the sense of likelihood to be hacked or otherwise compromised - cloud backup service (e.g. Google Drive) would probably fit the requirements (allow switching from private to public with the original timestamp), but I agree completely with this answer that that would not be an advisable approach. – David Mulder Nov 16 '20 at 09:05
  • @preferred_anon I think the idea is that you hash version 1 of your paper. Perhaps you notice some inconsequential errors (e.g., misspelled words). You can produce version 2 but you must maintain version 1. The hash establishes the timeline of version 1. People can decide for themselves if the difference between versions 1 and 2 are consequential or not. If you make a consequential change then the hash no longer supports the timeline of version 2. – emory Nov 16 '20 at 16:36
  • @emory Of course - but in real life, people are often not so careful. It is after all easy to absent-mindedly modify a file without remembering it. Now imagine you leave the file sitting in your project folder for 10 years, as somebody might, and the probability of unwittingly making a change is not so low. – preferred_anon Nov 16 '20 at 22:33
  • @preferred_anon If one (1) think one has an idea worthy of a Fields medal; (2) is not ready to publicly release it; and (3) can not figure out how to use git to version ones documents; then one has little credibility. – emory Nov 20 '20 at 14:29
  • 3
    @emory Sorry, that's a non-sequitur. I know quite a few brilliant mathematicians that do not know how to use version control. H***, I know brilliant mathematicians which do discrete algebra and will not touch continuous spaces with a ten-foot pole. – Captain Emacs Nov 20 '20 at 15:27
  • @CaptainEmacs git does not have much to do with it. if one wants a Fields medal but does not want to publish then one has little credibility. if further one does not want to figure out git, then one has even less. – emory Nov 20 '20 at 16:18
  • @emory I agree with the "not prepared to get it to published level"; version control has nothing to do with it. However, OP asks how to protect their work. Not sure it is so easy, and the service they look for does probably not exist in this form. – Captain Emacs Nov 20 '20 at 21:00
  • 1
    @academic You've almost reinvented RFC 3161 trusted timestamping there, except in RFC 3161, the hashing is done locally, not on a remote server, and you don't make the hash public, you send it to a timestamping authority who adds a timestamp to the hash, cryptographically signs the combination of hash and timestamp, and sends the result back to you. – Daniel Hatton Apr 08 '21 at 17:00
12

Focusing just on the mistakes angle. Don't worry about having mistakes in arxiv papers, its a preprint service for a reason people know they are not full reviewed papers so my contain mistakes (not that fully reviewed papers don't also contain mistakes).

You can always update the version in the arxiv when you realize a mistake. I usually update my papers after peer review which fixes issues picked up by the referee, but I have also updated papers on arxiv before peer review finished because I made some basic mistakes that made me look bad. But you know what I'm still in academia and i doubt anyone even remembers or has the time to go look at what changed between arxiv versions.

Rob
  • 4,572
  • 16
  • 21
  • 1
    But that mistake stays there for ever for the committees to view anytime! – High GPA Nov 15 '20 at 09:35
  • 28
    So? you'll be lucky if a search committee reads the published version of your papers let alone goes back and reads each and every version you have on arxiv. – Rob Nov 15 '20 at 09:45
  • 3
    This post should be downvoted because it does not answer the question. Still, I agree with it: the whole point of arXiv, as I understand it, is to distribute new research quickly, with everyone understanding that it is more likely to contain mistakes. – Brian Drake Nov 15 '20 at 13:22
  • 5
    I haven't downvoted, but if you publish on arXiv, you should be reasonably sure that your paper does not contain glaring mistakes. It might be suboptimally presented and may contain minor glitches, but it is for everybody to see and is effectively public. OP is right to not want a paper in an immature state to be on arXiv. I would say, arXiv is for papers of which at least the authors believe to be essentially ready. – Captain Emacs Nov 15 '20 at 18:01
  • 6
    @BrianDrake Precisely. The point is to distribute new research quickly in a citable manner, which is exactly the reason why it is impossible to unpublish things on arxiv. I'd think thrice about better alternatives before citing a source that could suddenly vanish without any notice. – mlk Nov 15 '20 at 18:12
  • @Rob I do not have too much experience for hiring committees. But some PhD applicants do try to spot the technical and ethical mistakes of their competitors, and then send a mass email to possible PhD admission committee members. I personal know someone who successfully prevent his competitor to be admitted, by doing this. – High GPA Nov 15 '20 at 21:51
  • @HighGPA "ethical mistakes"? What do you mean by that? I wonder about this person that prevented that competitor to be admitted - were they admitted themselves? How did they submit the compromising material? Anonymously? – Captain Emacs Nov 16 '20 at 00:28
  • 6
    If an academic committee hires somebody based on a mass email "exposing" another candidate's mistake, I'd considered that a bullet dodged. I don't think any school "prestige" can offset a work environment where you have to snitch on your peers as if this is 1984. – juod Nov 16 '20 at 01:22
  • @CaptainEmacs By "ethical mistake" I mean the author seemed to faked a diagram with synthetic data; or the author did not cite a paper he supposed to cite; he did not credit a paper he supposed to credit. Although the author claimed that he never read that paper, the accuser still try to convince the committee that the all reasonable researcher in the branch do read the paper. In US some PhD program take one student out of 100. In the specific case, dozens of applicants united together to send out a lot of emails; some anonymously, some not. I am not sure if one of them is finally admitted – High GPA Nov 17 '20 at 01:08
  • @juod I truly agree with you but those peers are hard to be spotted. Many of those emails are anonymous. Some email senders are smart enough to publicly expose the mistakes in the paper and then motivate other applicants to send out emails. – High GPA Nov 17 '20 at 01:10
  • 1
    @HighGPA If the author faked a diagram, they deserve to be kicked out of the competition. This is not a mistake, this is fabrication, and a capital crime in science, in my opinion even worse than plagiarism, because it wastes everybody's time. As for failing to cover all the literature - what you describe is an accuser that clearly has some close interaction with the hiring committee. If they really spend time listening to denunciation stories by anonymous/applicant about omitted citations, that sounds like a pretty - excuse the word - sick environment; better not to be selected. – Captain Emacs Nov 17 '20 at 01:24
  • @CaptainEmacs The problem is, it is hard to prove that they actually faked a diagram, as Arvix does not require the full disclosure of data and code. Nothing is that black and white in real world, so I used the word "seemed". The accuser gives evidences, and the author gives counter evidences. There is no court rule on this. Regarding failing citations: this is not a minor issue; by claiming that others' contributions are yours is technically plagiarism, unless you could prove that you independently developed that idea. Just two cents. – High GPA Nov 17 '20 at 02:40
  • @CaptainEmacs Sorry that people seems to interpret my "ethical mistake" as "a mistake that is not unethical". What I meant is a mistake that raises questions regarding the authors' ethics. Truly sorry for my English skills. – High GPA Nov 17 '20 at 02:50
  • @HighGPA Basically, the only issue is plagiarism. And, yes, anyone can claim that. But even in parallel publications, there are lab books, mails etc. to disprove that. I am not aware of a service for safe deposit of notes to protect you against plagiarism. Claiming that someone fabricated data require much stronger evidence. Again, when that happens (which is rare enough), strong investigation needs to happen. Really, I would not think too much about that. If a department is rife with real or alleged data fabrication, you are better served staying away from it. – Captain Emacs Nov 17 '20 at 16:28
10

The website you are looking for probably doesn't exist and if it does it's moot because no one will check it. Then if another paper comes along with the same ideas, people will read that paper, not yours. Moreover, even if you have proof, i bet few people will actually care that you came first. (And, pardon my harshness, deservedly so, because you purposedly decided not to share your work with the community.)

The best way to avoid having mistakes on the Arxiv is:

  1. Send the draft to some expert in the field that you trust asking them for an opinion. If they agree, they will certainly spot the most embarassing mistakes. If they decline, they will likely do so in a polite way and they will be happy that you chose to contact them since that means that you value their opinion. As a bonus, they could also serve as an unbiased witness to the fact that you had the idea independently, in case you get scooped at the last minute.

  2. Use the "replacement" feature on Arxiv. You can always correct mistakes on your preprints. Sure, the older versions are still visible, but Arxiv by default shows the latest one and who is going to check an older version on purpose?

Also, the premise of your question seems a bit off to me: you would like to keep a result unpublished because you don't know that it's correct, but you'd still like to retain priority rights if someone publishes before you. But if you don't know that your ideas are correct, it seems to me that you have no results to claim priority for.

If you are worried that point 1. will actually increase your chances to get scooped, well, that's true. But academia thrives on the fact that ideas can be freely exchanged among scholars. The risk of being scooped is a price that I happily pay for a more open community. I have to re-apologize for my harshness, but if you disagree with this mindset, you should reconsider working in academia. (So I hope you agree.)

BlueElephant
  • 1,630
  • 8
  • 16
  • 2
    I haven't downvoted, but Point 1 is questionable. People have enough to do with regular reviews. Unless it is someone OP has a very good relationship with, it is unlikely to find a proper external reviewer outside of a formal publication. They can as well submit to a journal. As for your last paragraph putting OP in their place, it is really uncalled for. I happen to have the luck to work in a friendly community, the OP may have not. Even so, I sometimes did encounter some really nasty apples. If OP is in a unpleasant community (they exist, read academia.SE), their likelihood is much higher. – Captain Emacs Nov 16 '20 at 13:10
  • 2
    The best way to avoid mistakes is to have good coauthors! – Noah Snyder Nov 17 '20 at 16:42
  • Truly appreciate your words! I highly value your opinion. I do love to learn more. I agree that ideas should be shared openly, in a condition that the results are correct. I raised the original question because I frequently observe people rush to publish immature preprints to claim priority; sometimes, the claims in their abstract is correct, but the proofs or the experiments are mistaken. So if I don't rush to put a timestamp on my work, then someone else might publish something on preprint before me and later accuse me for plagiarism, even if their analysis is immature or incomplete. – High GPA Nov 18 '20 at 04:36
6

Figshare, for one, has the facility to apply an embargo to an item such that the material (e.g. an attached PDF) itself is not visible to the public, though the content description is. The timestamp is in the History section near the bottom of the item page. It looks like it even allows generating a private link for reviewers to access the material, but I've not tested this with an embargo active.

I'm not sure what you gain over saving it locally though - your selected reviewers can steal your ideas whether you send it by private mail or via some pre-print site, and you'll only find out when they publish and thus win the race.

Edit: I've reflected on this overnight, and my second paragraph above is a bit too pessimistic: I still don't think your proposed route will help you seek redress after the fact (save for literal plagiarism - wholesale lifting of content) but I can see that having a third party recording timestamps would have a deterrent effect against people stealing the ideas. It also provides them with a specific date when the embargo will end and they can use the ideas legitimately. (Doesn't affect those having the same idea independently, obviously.)

Bear in mind though that applying an embargo commits you to publishing that preprint, so if the underlying concepts are totally flawed you will still have to write a modified version admitting so and publish that version.

Lou Knee
  • 1,021
  • 4
  • 7
4

You can have this functionality without even trusting the hypothetical public server with any of your data treasure. Free public timestamp authorities (which follow RFC 3161) allow you to timestamp your article so that it is cryptographically provable that the article existed before a certain date (when you requested a timestamp from that particular public timestamp server). The technique is normally used for purposes of proving that certain data is genuine, i.e., coming from a particular originator. First the data is signed by that originator's private key (in the originator's environment), then a hash is computed and the public timestamp server is asked to "counter-sign" the data (or rather, just the hash plus that server's timestamp). This second signing of a very short byte stream happens using the public timestamp server's key in that server's environment.

Software vendors use this technique not primarily for protection of priority or of their intellectual property, but rather in order to provide a permanent cryptographical proof that their private signing act happened before the vendor's own certificate (private key) expired or was revoked, one of which is bound to eventually occur. Note that the public timestamp server does NOT have to store any record of whatever it ever countersigned, and its cooperation is NOT necessary for subsequent verification of authenticity of the data. (I have simplified how this works. There are additional transparency measures that prevent that timestamp server to maliciously backdate anything it wants to backdate. The service is not stateless.)

So, you could even build the website you want to use yourself, at your home, with the main external component being the public timestamp server.

If you ever get into a serious priority dispute, you will be able to prove your case at a pretty high standard, unless they suspect you of outright hacking of data from a "real" originator.

There is however a big hole there. You won't be able to present the corrected version of your article backdated. You will only be able to prove what the article looked like at a certain point in time. A combination of a "flawed priority article" and a "corrected subsequent article" is more likely to be taken as a proof that your "earlier" work was flawed and therefore unfinished (not yet in existence), so in the opposite sense to what you might want to attach to it.

Therefore you are better off to follow the normal practice and learn to publish pre-prints whenever you run out of the obvious ideas why your work might be flawed. Initially, your teachers will help you spot such obvious weak points and address them while you don't have a publishable article.

Jirka Hanika
  • 905
  • 6
  • 10
3

arXiv will keep every version on the website, so people can always go back and see what version 1 or version 2 looked like, even when the paper is on it's 5th version (for example).

These are some places where you can put your pre-prints, and then replace them without a version history being visible to any outsider:

  • Academia.edu
  • ResearchGate
  • Mendeley
  • FigShare
  • Zenodo
  • GitHub (if you delete your version history every time you post a new version)

Also a PDF on your own website can often be picked up by Google Scholar: I've seen this happen to my own papers automatically even when I didn't want it to happen.

Nik
  • 4,748
  • 13
  • 50
  • Can you make your Academia.edu or ResearchGate papers not indexed by google scholar? – High GPA Nov 17 '20 at 00:38
  • @HighGPA Yes! First of all Google Scholar is not great at picking up Academia.edu papers, compared to picking up arXiv papers or journal articles, then you can also manually un-index papers in Google Scholar if they do pick up a paper that you don't want indexed. – Nik Nov 17 '20 at 00:55
  • I am sorry but how do I un-index a paper on google scholar search engine or the main google search engine? Do you mean that I can remove that article from my personal google scholar page? – High GPA Nov 17 '20 at 00:58
  • 1
    @HighGPA I've seen articles removed from Google Scholar, but I would have to look up how to do it because I never did that myself. Removing them from a personal page is even easier. GitHub is not indexed by Google Scholar by default: many times I wished it was! Because I post papers on GitHub well before posting them on arXiv, but GitHub is only indexed by Google Scholar unless someone explicitly cites the GitHub repository (which is hard to do!). If you want to guarantee that it doesn't show up on Google Scholar, GitHub might be a better option. There's also FigShare and Zenodo. – Nik Nov 17 '20 at 01:13
2

If your goal is simply to timestamp the content, then you can create a hash and make that public. That is, you compute a MD5 or SHA hash of your pdf and tweet that, put it on your wikipedia page, or attach it to another arxiv submission. This provides proof that your document existed at that time, without revealing the contents of that document. At a later time you can reveal the document and others can check that the hash matches.

If your goal is to make the document "public" without it being indexed or earlier versions being available, then you can put it on a personal website and use robots.txt to tell web crawlers not to index it. That doesn't guarantee it won't be indexed, but it will stop most things.

Thomas
  • 18,342
  • 7
  • 45
  • 69