Why do date ranges in computing start at ancient periods?

-1

Reading the Wikipedia article for HFS+, I noticed the allowable date range is January 1, 1904 – February 6, 2040. Likewise, the range for NTFS is 1 January 1601 – 28 May 60056. In my mind, this is ridiculous to think about as I can't think of any cases where a file would need to have a modified/created at date set in the 1600s or 1900s. I can understand the Unix timestamp epoch since it would be reasonable to have a file created/modified in the 70s, 80s, etc, but it just seems illogical to have the epoch of those timestamps to be set so far back in the past.

svbnet

Posted 2017-03-20T01:08:02.410

Reputation: 107

4http://stackoverflow.com/questions/10849717/what-is-the-significance-of-january-1-1601 – Polygnome – 2017-03-20T01:12:33.080

2Its due to the 400 year cycle for leap intervals, and the modulo arithmetic involved in math surrounding leap cycles. It is the beginning of the first such period where digital files came to exist. – Frank Thomas – 2017-03-20T01:17:52.920

Its notable that ANSI adopted this same date for their stamps, so its used in the COBOL and other ANSI language specifications. Basically it makes a lot of math on the raw value is a whole lot easier if you can limit the number of recursions in terms of time adjustment cycles, by establishing a base that we haven't drifted too far from (in the grand scheme of things). If you have ever tried to write code to calculate the Date Easter Sunday will appear on, you will notice that the further back or forward you go, the longer and more complicated the formula required becomes. – Frank Thomas – 2017-03-20T02:30:03.507

Answers

It's not just NTFS; Windows internal timekeeping is done using the same time format and with the same start of epoch.

They knew they wanted a 64-bit binary time value, because the original Unix 32-bit value was already known to be a dead end (those counters will wrap in 2038), and 64-bit time values had already been used in VMS. 64 bits gives the ability to reckon about 18 billion billion different time values. Well, actually, only 9 billion billion because time values with the high bit set have a different meaning in Windows (as they do in VMS). So we really have "only" 63 bits for counting date and time of day.

Whereas the 32-bit Unix time only counted seconds, Windows timestamps reckon in increments of 100 nanoseconds. So a time value of 1 means 100 ns past midnight on January 1, 1601.

But why pick such a "historical" date?

Well, one, it does make day-of-week and similar calculations a bit easier, as that was the first year of the earliest 400-year cycle that included electronic computers of any sort. There is some very authoritative support for that reason.

However, I have to opine that in the context of modern computing the additional compute power needed to cope with a different starting year would be pretty small in context.

January 1 1601 also happens to be the date from which ANSI dates are counted. So the "Windows date" is the same day number as the "ANSI date", which makes things a trifle easier in various places.

It's also been standardized as "year 1" of the Gregorian calendar (even though that calendar was not adopted everywhere at that time).

For a practical, working reason, though, consider: This date/time format allows historical date/times to be represented in e.g. data bases right alongside present-day ones, using the same format. A genealogical data base, for example, can thus store your ancestors' birth and death dates going back 400+ years, which is far longer than most such records exist in reliable form.

There would have been no point extending this earlier, starting at say the year 1201 or even 1, because of the Julian-to-Gregorian calendar transition that began in some countries in 1582 and continued to as late as 1926, depending on which country you were in. All dates recorded in ANSI time format and, by extension, in Windows "binary" time format are assumed to be Gregorian calendar.

btw, VMS uses a similar scheme but its base time is 17 Nov 1858. This was a standard picked by the Smithsonian Astrophysical Observatory as a "base date" for satellite tracking; this was related to astronomers' earlier use of the original Julian Day scheme, which counts days since noon, January 1, 4713 B.C. On this scheme, 17 Nov 1858 comes out to Modified Julian Day number 2,400,000. By using the MJD instead of the JD they were able to fit contemporary dates into just 18 bits, which was an important feat at the time. See this article from VMS Engineering for details.

Jamie Hanrahan

Posted 2017-03-20T01:08:02.410

Reputation: 19 777

Dates (and their format specification on the bit level) are not only used to label files, but also for calculation and in many other places. A historician might want to have dates from the 17th or 18th century in his Excel columns, for example; or an astronomer calculate planetary alignments in these periods.

Even if the amount of people thathave a use from it is small, the loss is negligent - it doesn't matter much if you can use this format for 100 or 5000 or 60000 years; it will probably not survive the next 50 years.

Aganju

Posted 2017-03-20T01:08:02.410

Reputation: 9 103

"it will probably not survive the next 20 years." That's a bit pessimistic. Why do you believe this? – DavidPostill – 2017-03-20T16:34:12.293

Not because it is bad or anything, but better things will come along; if you look at the current speed of size increasing, terabytes will be the normal size for files in 20 years. I am willing to change it to 50, my point was not thousands of years – Aganju – 2017-03-20T16:45:07.333

@arganju, I think you underestimate the amount of COBOL still running on mainframes out there. Also, note that for high level programs like Excel, they are free to implement whatever backing and presentation format they like for dates, so the topic really has little to do with userland programs, but instead with the low level mechanics of the filesystem; stuff that is right on the line between software and hardware. There is a lot of number theory at that level; a lot of extra smart people spent years coming up with extremely efficient and clever algorithms. – Frank Thomas – 2017-03-20T21:28:34.063

And the format / convention of storing years as two-digit integers probably won’t survive past 1990 … oh, wait …   :-)   ⁠ – G-Man Says 'Reinstate Monica' – 2017-09-07T17:23:39.137

When examined in isolation the wide time stamp range of NTFS may seem illogical. But when you look at the big picture it is completely logical.

One of the things an operating system does is provide a set of functions that most programs will need. This allows programmers to concentrate on their programs instead of wasting time rewriting these common functions for each program. Any operating system that does not provide these functions is unlikely to be successful. Windows and Linux provide hundreds of such functions.

Windows includes a series of functions for representing and working with dates and times. For maximum usefulness this covers as wide a range of dates as is reasonably possible. Many programs use these functions for a wide variety of purposes.

The NTFS file system was released as part of the NT platform. Like any modern file system it needed some way of storing file date stamps. Logically the designers chose to use the same system as the one provided to applications. This makes things simpler for developers. Of course the date range is much wider than needed for date stamps but it costs nothing and causes no problems. Using a different system with a more restricted range of dates for date stamps would have been illogical.

LMiller7

Posted 2017-03-20T01:08:02.410

Reputation: 1 849