Talk:8.3 filename

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
WikiProject iconComputing Start‑class Low‑importance
WikiProject iconThis article is within the scope of WikiProject Computing, a collaborative effort to improve the coverage of computers, computing, and information technology on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.
StartThis article has been rated as Start-class on Wikipedia's content assessment scale.
 Low This article has been rated as Low-importance on the project's importance scale.

unicode support?[edit]

if full Unicode support didn't come until Windows 2000, how exactly were Unicode long file names supported?—Kbolino 06:46, Apr 7, 2005 (UTC)

what do you mean by "full unicode support"? 98 had some unicode support though it didn't have all apis (i don't have 95 handy to try but it wouldn't surprise me if it had some degree of support). NT4 was natively unicode though i belive it only supported the basic multilingual plane. Even now unicode support is not without its quirks in most operating systems. Plugwash 20:21, 13 Jun 2005 (UTC)
Full Unicode support is still miles away, even though were getting closerMs dos mode (talk) 21:35, 22 June 2009 (UTC)Reply[reply]
Windows 95+ supported two subsets of Unicode, defined by the system ANSI and OEM codepages. IFSMGR.VXD used UNICODE.BIN file to perform the required conversions. (talk) 12:36, 20 December 2014 (UTC)Reply[reply]

page name[edit]

this page used to be at "8.3 (computing)" imo a horrible and not very informative name.

vague rant who has never edited this page then moved it to just 8.3. which i belive is even worse since it gives very little idea of the articles subject.

if there are no objections here in a day or so. i'm going to move the page to the more discriptive name of 8.3 filename limitations. Plugwash 20:27, 13 Jun 2005 (UTC)

A more accurate title might be "8.3 FAT filename mapping" or "8.3 file naming scheme", but I don't see what's to be gained by moving. "8.3" is a moniker that the article has set out to explain, and it does that quite well. Another similar example might be 24/7.
Ghakko 23:03, 13 Jun 2005 (UTC)
I'll point out that my page move was simply made because there was no need for disambiguation, since the article 8.3 did not exist. I didn't take into account whether there might be a better title, I was just removing unnecessary disambiguation. I've no issue with the page being moved somewhere else. I've likewise no issue with it staying here. I'm not fussy. - Vague | Rant 13:31, Jun 16, 2005 (UTC)

Naming Convention Speculation?[edit]

I've read theories about this, and usually discussions about this end up saying there's no real answer.

First one from : Because to do things (like copy a file), you had to type in the entire file name (and path name), people used lots of abbreviations and concatenation to reduce typing. This is why CP/M (DOS) used 8.3 (8 charaters + a 3 character suffix).

Second: If you've come across with the name Gary Kildall during your research, count the letters in his name. This doesn't explain why it wasn't 7.4 though.

Third: 8.3, 8 + dot + 3 = 12 characters, compressed nicely with RAD50 into two words. However, CP/M does not compress filenames, and RAD50 was usually used to compress 6.3 or 9.3 filenames (some computers were 6-bit back then).

Fourth: It comes from the 12 rows on an 80 column card. One row for each letter, plus the 12th row for a data clock (based on the stereotypical IBM 029 keypunch). A single column was punched in each row to mark the letter from the character set, yielding up to 80 choices per character.

Fifth: There is evidence that Gary was one of those rare individuals with both DEC and IBM influences in his background. DEC had established the 3-character file extension as a standard on its systems, and IBM mainframes of the time had 8 characters as a common namespace size. Put the two together, and you get 8.3.

Sixth: DEC's RT-11 used 8.3 filenames, and CP/M was developed using it as a guideline. This doesn't explain where RT-11 got 8.3 though.

Seventh: Kildall chose 8.3 because he liked it.

I believe that the Mac OS X implementation of FAT32 uses a similar hashing step to the one listed as step 4 of the "algorithm" but I'm not completely sure about this. It may be worth mentioning if someone more knowledgable can verify this. 02:26, 21 January 2007 (UTC)Reply[reply]

DOS 8.3[edit]

This is usually called DOS 8.3. If the article is not going to have that official name, it should certainly redirect to here from there, and should mention the term right at the start. *** The discussion of the dot should perhaps add some more tech background. The dot does not exist in the internal representation -- there is simply a place for the 8 letters and a place for the 3 letters. So, the dot is always implicitly present. And it is also misleading to say that 8.3 names are uppercase. Is a tech sense that may be true, but in another sense they are generally defined as caseless, and can be represented arbitrarily as uppercase or lowercase. All a source of endless potential problems. 19:26, 21 January 2007 (UTC)Reply[reply]

I always thought of eight dot three file names as always being uppercase/all caps. Like the true form of 8.3 filenames are in all caps.
wmic lists all filenames and folder names as being all lowercase:
C:\>wmic datafile where name="C:\\0\\Doc\\Web\\Site\\n\\_b_ - _ z _ 8.html" list/format:list
dir lists all 8.3 filenames and foldernames as being all uppercase:
C:\0\Doc\Web\Site\n>dir /x/r
[...] _B_-_Z~1.HTM [...]
8.3 names are generated for folders too FYI. --User123o987name (talk) 06:45, 9 November 2020 (UTC)Reply[reply]

more info needed[edit]

There are endless complications with DOS 8.3 / LFN. This is a core tech subject. It is pathetic that this article is so limited, and has no links to relevant external resources. 20:12, 21 January 2007 (UTC)Reply[reply]

Move .. again[edit]

I'll agree with the discussion above that a simple title of "8.3" is vague. According to the MSDN documentation this is called a "8.3 file name" (spaces are intentional—although I could swing towards "filename" for constancy within Wikipedia). Could we agree to move it to either 8.3 file name or 8.3 filename? +mwtoews 17:38, 22 March 2007 (UTC)Reply[reply]

Moved. I chose to use "filename" over "file name" as it appears Wikipedia has more consistency with that spelling.+mwtoews 21:58, 24 March 2007 (UTC)Reply[reply]

Translating LFN to SFN[edit]

I don't understand "followed by 4 hexadecimal digits derived from an undocumented hash of the filename" how, exactly, does one determine those four digits on their own? Maybe it's my severe lack of understanding of Hexadecimals ...

TextFile.Mine = TE021F~1


00000000h: 54 65 78 74 46 69 6C 65 2E 4D 69 6E 65  ; TextFile.Mine

00000000h: 78 74 46 69 6C 65 2E 4D 69 6E 65  ; xtFile.Mine

00000000h: 54 65 78 74 46 69 6C 65 2E  ; TextFile

Jndrline (talk) 20:10, 16 July 2008 (UTC)Reply[reply]

Hex is just a way of expressing a number. The number is determined by an algorithm which is undocumented. The algorithm varies by OS, version, and sometimes service pack. —EncMstr (talk) 21:22, 16 July 2008 (UTC)Reply[reply]
I am sure I have seen such SFNs, but at the same time, I am certain I have also seen SFNs of the sort FOOBA~12.TXT (which uses less than 6 characters and more than one digit). The article does not mention that, nor anything useful about the 4-hex-digit variety. It is a little vague as it stands and needs some information about that/those. I guess I just got a new project. ಠ_ಠ Synetech (talk) 19:45, 21 August 2015 (UTC)Reply[reply]
Windows NT systems use the checksum after FOOBAR~4.TXT, but pre-NT (I'm pretty sure that includes 95 but I don't know what the last non-NT system was) systems use FOOBAR~5.TXT, which can logically be extended to double-digit numbers (so FOOBA~12.TXT like you said)... Nameless6144 (talk) 12:58, 20 May 2018 (UTC)Reply[reply]
Found a page about a successful reverse-engineering of this checksum... A Tale of Two File Names: Should it be included, or is it not encyclopedic to include information discovered through reverse-engineering? Either way, it's probably what Jndrline wanted, so I'll post it here. Nameless6144 (talk) 12:58, 20 May 2018 (UTC)Reply[reply]
Wow Nameless6144! You answered a question I'd forgotten I ever asked. I think my original intent was writing cmd/bat files, and how can you predict an 8.3 file name in certain instances. What a great read! Thanks for sharing. Jndrline (talk) 19:16, 27 September 2018 (UTC)Reply[reply] (talk) 10:43, 3 August 2018 (UTC)Reply[reply]

The example of a shortened filename 'OVI3KV~N' contains the character 'N', whereas the text below this clearly states that the character after the tilde is always a digit.

Converstion to short name misses details[edit]

The conversion section does not cover how I can manually obtain the right number after the tilde.

For example, if I have to filenames abcdefgh-cat.txt and abcdefgh-dog.txt which one will be ABCDEF~1.TXT and which one will be ABCDEF~2.TXT? How do I determine the correct number? -- (talk) 10:06, 18 June 2022 (UTC)Reply[reply]

Usually, the file that was present in the directory first will be "~1", and the file added there later will be "~2". However, the purpose of this article is not really to examine the details of the operating system source code... AnonMoos (talk) 17:56, 18 June 2022 (UTC)Reply[reply]