NFS

NFS is the Network File Storage protocol. It’s actually a standard of protocols that allow POSIX-like file access from a client to data that’s stored on a remote server.

Although NFS itself has not been a high-performance interface for networked file systems, many concepts that it employs are in use by HPC parallel file systems. Knowing how NFS works helps understand how these other file systems work, and knowing why NFS is slow helps understand the design decisions that have guided HPC-optimized file systems.

There are also a few recent extensions to NFS that can make it a high-performance parallel client. VAST is perhaps the most interesting of such implementations since it employs NFS over RDMA¹, nconnect², and an enhanced NFS multipathing³ to enable parallel, scale-out I/O performance.

NFS version 3

NFS version 3 is famously stateless; the NFS server does not keep track of who has the file system mounted or what files are open. It even goes so far as to not implement open or close commands, making it kind of like an object store in its low-level semantics. Instead, all NFS clients rely on the NFS server to provide file handles which are magical tokens that uniquely identify files or directories.

As a result,

mounting an NFS export actually involves asking a special service on the NFS server for the root file handle.
opening a file on an NFS export involves performing recursive LOOKUP commands given the file handle of a parent directory and the name of a file or directory in that parent. This starts with the root file handle.
reading from a file involves READing data from a file handle given a byte offset and number of bytes
writing to a file involves WRITEing data to a file handle at a given byte offset
closing a file, well, doesn’t happen because you never opened the file. If you want, you can COMMIT and force both client and server to flush any cached writes down to persistent media.

Because NFSv3 is stateless but POSIX file I/O is inherently stateful, there’s a bunch of wonky bolt-on services required by NFS servers in practice to make stateless NFSv3 operate like a stateful file system. For example,

allowing clients to statefully mount the NFS export (mount -t nfs) is enabled by mountd
allowing clients to hold locks on files (e.g., flock) is enabled by lockd

In practice, NFSv3 is like a good idea in principle that forgot to include a lot of important features used in practice.

Lock Manager (lockd) and Status Monitor (statd)

The state of all outstanding NFS locks is persisted by the NFS statd on a local file system on the NFS server. These outstanding locks can be viewed in /var/lib/nfs/sm. For example, taking a file lock on a file named hello on an NFS mount on a client named cloverdale:

glock@cloverdale:/mnt/nfs$ flock hello sleep 30

results in the following appearing on the server:

root@synology:/var/lib/nfs/sm# ls -lrt
total 4
-rw------- 1 root root 92 Mar 29 19:54 cloverdale
root@synology:/var/lib/nfs/sm# cat cloverdale
0100007f 000186b5 00000003 00000010 66a89ab53b650b00804d729c00000000 192.168.50.27 synology

Note that merely opening a file for writes does not lock it; this is why casual file editing from multiple NFS clients can result in file corruption.

NFS version 4

NFSv4 is a massive improvement over NFSv3 that

rolls up the core NFSv3 service and all the required add-ons into a single standard
adds a bunch of new features that enhance performance and functionality that were completely missing from NFSv3

Hammerspace wrote a really nice whitepaper that discusses the evolution of NFSv4, pNFS, and the recent enhancements.

Parallel NFS (pNFS) from NFS v4.1

The original implementation of Parallel NFS was kind of junky, because it relied on clients getting file layout information from a single metadata server (like Lustre does) before clients can talk directly to data servers. Panasas and NetApp were proponents of this, but it never gained a lot of traction.

pNFS v4.2 with Flex Files

See Flex Files.

Linux NFS

The Linux NFS client has become the de facto standard and supports a bunch of implementation-specific features that make it useful beyond what the standard prescribes. A few:

nconnect:Establishes multiple connections between client and server, then stripes RPCs over them to get more parallelism in data transfers. Is able to boost the achieval
noalignwrite: Prevents the client from sending back more than it has written when doing a partial-page write. This allows two clients to modify non-overlapping parts of the same 4K page at the same time and not suffer from last-writer-wins consistency.⁴ It was implemented in Linux 6.12.⁵

https://www.usenix.org/legacy/events/fast02/wips.html#callaghan ↩
https://lkml.iu.edu/hypermail/linux/kernel/1907.2/02845.html ↩
https://vastdata.com/blog/meet-your-need-for-speed-with-nfs/ ↩
https://manpages.debian.org/unstable/nfs-common/nfs.5.en.html#noalignwrite ↩
According to personal communication with Sven Breuener on 2025-07-22. He referenced https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=dfb07e990a0d019d7ae9b78dd4260620ce32e79a ↩

Glenn's Digital Garden

Explorer

NFS

NFS version 3

Lock Manager (lockd) and Status Monitor (statd)

NFS version 4

Parallel NFS (pNFS) from NFS v4.1

pNFS v4.2 with Flex Files

Linux NFS

Graph View

Table of Contents

Backlinks

Glenn's Digital Garden

Explorer

NFS

NFS version 3

Lock Manager (lockd) and Status Monitor (statd)

NFS version 4

Parallel NFS (pNFS) from NFS v4.1

pNFS v4.2 with Flex Files

Linux NFS

Footnotes

Graph View

Table of Contents

Backlinks