hSeek on text files
Bulat Ziganshin
bulat.ziganshin at gmail.com
Fri Jul 28 07:10:04 EDT 2006
Hello Duncan,
Friday, July 28, 2006, 2:52:10 PM, you wrote:
>> Simon, are you remember problem with using hTell+hSeek on handles open
>> in text (not binary) mode on Windows? afair, you was finished with the
>> decision to use NoBuffering for text files on Windows?
>>
>> but this solution is very inefficient. i now thought about dealing
>> with the same problem in my lib and found that there is another
>> solution - prohibit using of hTell/hSeek on files open in text mode
>> (on Unix, too?). i think this is better - one should either open file
>> in binary mode and use random access or open file in text mode and
>> read/write it sequentially. what you think about it?
>>
>> also, i will be glad to hear comments from other haskellers
> According to MS documentation:
> http://msdn.microsoft.com/library/default.asp?url=/library/en-us/wceappdev5/html/wce50lrffseek.asp
> For streams opened in text mode, fseek has limited use, because
> carriage return — linefeed translations can cause fseek to
> produce unexpected results.
>
> The only fseek operations guaranteed to work on streams opened
> in text mode are as follows:
>
> * Seeking with an offset of 0 relative to any of the
> origin values.
> * Seeking from the beginning of the file with an offset
> value returned from a call to ftell
> It's a good point that you might want to seek to a point previously
> remembered with hTell, so we probably don't want to ban it entirely.
> I could not find any documentation in the Win32 API describing the
> behaviour of text files vs binary files. Is this translation behaviour
> only implemented at the MS C library layer?
after writing this letter, i added one more comment to my
internal documentation - "hRewind and seeking to EOF will be also
useful". about fseek - i don't see an easy way to implement it. if
your translated data are already in buffer, then fseek can't return
just file position corresponding to buffer start + current offset in
buffer. btw, the same problem arises when we want to use UTF-8 or some
other Char<->Byte translation with buffered I/O (using iconv, f.e.)
so, for translated text
1) we can implement hRewind/hSeekToEOF
2) we can implement some form of hSavePosition/hRestorePosition if
this position will include tuple (file position of buffer start, offset
in buffer)
3) we can implement tell/seek functionality but this will be slow
--
Best regards,
Bulat mailto:Bulat.Ziganshin at gmail.com
More information about the Libraries
mailing list