ZFS Administration - Part X - Creating Filesystems

1

We now begin down the path that is the "bread and butter" of ZFS, known as "ZFS datasets", or filesystems. Previously, up to this point, we've been discussing how to manage our storage pools. But storage pools are not meant to store data directly. Instead, we should create filesystems that share the same storage system. We'll refer to these filesystems from now as datasets.

== Background ==

First, we need to understand how traditional filesystems and volume management work in GNU/Linux before we can get a thorough understanding of ZFS datasets. To treat this fairly, we need to assemble Linux software RAID, LVM, and ext4 or another Linux kernel supported filesystem together.

6

7

This is done by creating a redundant array of disks, and exporting a block device to represent that array. Then, we format that exported block device using LVM. If we have multiple RAID arrays, we format each of those as well. We then add all these exported block devices to a "volume group" which represents my pooled storage. If I had five exported RAID arrays, of 1 TB each, then I would have 5 TB of pooled storage in this volume group. Now, I need to decide how to divide up the volume, to create logical volumes of a specific size. If this was for an Ubuntu or Debian installation, maybe I would give 100 GB to one logical volume for the root filesystem. That 100 GB is now marked as occupied by the volume group. I then give 500 GB to my home directory, and so forth. Each operation exports a block device, representing my logical volume. It's these block devices that I format with ex4 or a filesystem of my choosing.

8

9

[[image:lvm.png||alt="lvm" height="386" width="526"]]

10

//Linux RAID, LVM, and filesystem stack. Each filesystem is limited in size.//

11

12

13

In this scenario, each logical volume is a fixed size in the volume group. It cannot address the full pool. So, when formatting the logical volume block device, the filesystem is a fixed size. When that device fills, you must resize the logical volume and the filesystem together. This typically requires a myriad of commands, and it's tricky to get just right without losing data.

14

15

ZFS handles filesystems a bit differently. First, there is no need to create this stacked approach to storage. We've already covered how to pool the storage, now we well cover how to use it. This is done by creating a dataset in the filesystem. By default, this dataset will have full access to the entire storage pool. If our storage pool is 5 TB in size, as previously mentioned, then our first dataset will have access to all 5 TB in the pool. If I create a second dataset, it too will have full access to all 5 TB in the pool. And so on and so forth.

16

17

[[image:zfs.png||alt="zfs" height="259" width="521"]]

18

//Each ZFS dataset can use the full underlying storage.//

19

20

21

Now, as files are placed in the dataset, the pool marks that storage as unavailable to all datasets. This means that each dataset is aware of what is available in the pool and what is not by all other datasets in the pool. There is no need to create logical volumes of limited size. Each dataset will continue to place files in the pool, until the pool is filled. As the cards fall, they fall. You can, of course, put quotas on datasets, limiting their size, or export ZVOLs, topics we'll cover later.

22

23

So, let's create some datasets.

== Basic Creation ==

In these examples, we will assume our ZFS shared storage is named "tank". Further, we will assume that the pool is created with 4 preallocated files of 1 GB in size each, in a RAIDZ-1 array. Let's create some datasets.

28

29

30

# zfs create tank/test

31

# zfs list

32

NAME USED AVAIL REFER MOUNTPOINT

33

tank 175K 2.92G 43.4K /tank

34

tank/test 41.9K 2.92G 41.9K /tank/test

35

36

37

Notice that the dataset "tank/test" is mounted to "/tank/test" by default, and that it has full access to the entire pool. Also notice that it is occupying only 41.9 KB of the pool. Let's create 4 more datasets, then look at the output:

38

39

40

# zfs create tank/test2

41

# zfs create tank/test3

42

# zfs create tank/test4

43

# zfs create tank/test5

44

# zfs list

45

NAME USED AVAIL REFER MOUNTPOINT

46

tank 392K 2.92G 47.9K /tank

47

tank/test 41.9K 2.92G 41.9K /tank/test

48

tank/test2 41.9K 2.92G 41.9K /tank/test2

49

tank/test3 41.9K 2.92G 41.9K /tank/test3

50

tank/test4 41.9K 2.92G 41.9K /tank/test4

51

tank/test5 41.9K 2.92G 41.9K /tank/test5

52

53

54

Each dataset is automatically mounted to its respective mount point, and each dataset has full unfettered access to the storage pool. Let's fill up some data in one of the datasets, and see how that affects the underlying storage:

# cd /tank/test3

# for i in {1..10}; do dd if=/dev/urandom of=file$i.img bs=1024 count=$RANDOM &> /dev/null; done

59

# zfs list

60

NAME USED AVAIL REFER MOUNTPOINT

61

tank 159M 2.77G 49.4K /tank

62

tank/test 41.9K 2.77G 41.9K /tank/test

63

tank/test2 41.9K 2.77G 41.9K /tank/test2

64

tank/test3 158M 2.77G 158M /tank/test3

65

tank/test4 41.9K 2.77G 41.9K /tank/test4

66

tank/test5 41.9K 2.77G 41.9K /tank/test5

67

68

69

Notice that in my case, "tank/test3" is occupying 158 MB of disk, so according to the rest of the datasets, there is only 2.77 GB available in the pool, where previously there was 2.92 GB. So as you can see, the big advantage here is that I do not need to worry about preallocated block devices, as I would with LVM. Instead, ZFS manages the entire stack, so it understands how much data has been occupied, and how much is available.

70

71

== Mounting Datasets ==

72

73

It's important to understand that when creating datasets, you aren't creating exportable block devices by default. This means you don't have something directly to mount. In conclusion, there is nothing to add to your /etc/fstab file for persistence across reboots.

74

75

So, if there is nothing to add do the /etc/fstab file, how do the filesystems get mounted? This is done by importing the pool, if necessary, then running the "zfs mount" command. Similarly, we have a "zfs unmount" command to unmount datasets, or we can use the standard "umount" utility:

# umount /tank/test5

# mount | grep tank

tank/test on /tank/test type zfs (rw,relatime,xattr)

81

tank/test2 on /tank/test2 type zfs (rw,relatime,xattr)

82

tank/test3 on /tank/test3 type zfs (rw,relatime,xattr)

83

tank/test4 on /tank/test4 type zfs (rw,relatime,xattr)

84

# zfs mount tank/test5

85

# mount | grep tank

86

tank/test on /tank/test type zfs (rw,relatime,xattr)

87

tank/test2 on /tank/test2 type zfs (rw,relatime,xattr)

88

tank/test3 on /tank/test3 type zfs (rw,relatime,xattr)

89

tank/test4 on /tank/test4 type zfs (rw,relatime,xattr)

90

tank/test5 on /tank/test5 type zfs (rw,relatime,xattr)

91

92

93

By default, the mount point for the dataset is "/<pool-name>/<dataset-name>". This can be changed, by changing the dataset property. Just as storage pools have properties that can be tuned, so do datasets. We'll dedicate a full post to dataset properties later. We only need to change the "mountpoint" property, as follows:

94

95

96

# zfs set mountpoint=/mnt/test tank/test

97

# mount | grep tank

98

tank on /tank type zfs (rw,relatime,xattr)

99

tank/test2 on /tank/test2 type zfs (rw,relatime,xattr)

100

tank/test3 on /tank/test3 type zfs (rw,relatime,xattr)

101

tank/test4 on /tank/test4 type zfs (rw,relatime,xattr)

102

tank/test5 on /tank/test5 type zfs (rw,relatime,xattr)

103

tank/test on /mnt/test type zfs (rw,relatime,xattr)

104

105

106

== Nested Datasets ==

107

108

Datasets don't need to be isolated. You can create nested datasets within each other. This allows you to create namespaces, while tuning a nested directory structure, without affecting the other. For example, maybe you want compression on /var/log, but not on the parent /var. there are other benefits as well, with some caveats that we will look at later.

109

110

To create a nested dataset, create it like you would any other, by providing the parent storage pool //and// dataset. In this case we will create a nested log dataset in the test dataset:

111

112

113

# zfs create tank/test/log

114

# zfs list

115

NAME USED AVAIL REFER MOUNTPOINT

116

tank 159M 2.77G 47.9K /tank

117

tank/test 85.3K 2.77G 43.4K /mnt/test

118

tank/test/log 41.9K 2.77G 41.9K /mnt/test/log

119

tank/test2 41.9K 2.77G 41.9K /tank/test2

120

tank/test3 158M 2.77G 158M /tank/test3

121

tank/test4 41.9K 2.77G 41.9K /tank/test4

122

tank/test5 41.9K 2.77G 41.9K /tank/test5

123

124

125

== Additional Dataset Administration ==

126

127

Along with creating datasets, when you no longer need them, you can destroy them. This frees up the blocks for use by other datasets, and cannot be reverted without a previous snapshot, which we'll cover later. To destroy a dataset:

128

129

130

# zfs destroy tank/test5

131

# zfs list

132

NAME USED AVAIL REFER MOUNTPOINT

133

tank 159M 2.77G 49.4K /tank

134

tank/test 41.9K 2.77G 41.9K /mnt/test

135

tank/test/log 41.9K 2.77G 41.9K /mnt/test/log

136

tank/test2 41.9K 2.77G 41.9K /tank/test2

137

tank/test3 158M 2.77G 158M /tank/test3

138

tank/test4 41.9K 2.77G 41.9K /tank/test4

139

140

141

We can also rename a dataset if needed. This is handy when the purpose of the dataset changes, and you want the name to reflect that purpose. The arguments take a dataset source as the first argument and the new name as the last argument. To rename the tank/test3 dataset to music:

142

143

144

# zfs rename tank/test3 tank/music

145

# zfs list

146

NAME USED AVAIL REFER MOUNTPOINT

147

tank 159M 2.77G 49.4K /tank

148

tank/music 158M 2.77G 158M /tank/music

149

tank/test 41.9K 2.77G 41.9K /mnt/test

150

tank/test/log 41.9K 2.77G 41.9K /mnt/test/log

151

tank/test2 41.9K 2.77G 41.9K /tank/test2

152

tank/test4 41.9K 2.77G 41.9K /tank/test4

== Conclusion ==

This will get you started with understanding ZFS datasets. There are many more subcommands with the "zfs" command that are available, with a number of different switches. Check the manpage for the full listing. However, even though this isn't a deeply thorough examination of datasets, many more principles and concepts will surface as we work through the series. By the end, you should be familiar enough with datasets that you will be able to manage your entire storage infrastructure with minimal effort.

----

(% style="text-align: center;" %)

163

Posted by Aaron Toponce on Monday, December 17, 2012, at 6:00 am.

164

Filed under [[Debian>>url:https://web.archive.org/web/20210430213505/https://pthree.org/category/debian/]], [[Linux>>url:https://web.archive.org/web/20210430213505/https://pthree.org/category/linux/]], [[Ubuntu>>url:https://web.archive.org/web/20210430213505/https://pthree.org/category/ubuntu/]], [[ZFS>>url:https://web.archive.org/web/20210430213505/https://pthree.org/category/zfs/]].

165

Follow any responses to this post with its [[comments RSS>>url:https://web.archive.org/web/20210430213505/https://pthree.org/2012/12/17/zfs-administration-part-x-creating-filesystems/feed/]] feed.

166

You can [[post a comment>>url:https://web.archive.org/web/20210430213505/https://pthree.org/2012/12/17/zfs-administration-part-x-creating-filesystems/#respond]] or [[trackback>>url:https://web.archive.org/web/20210430213505/https://pthree.org/2012/12/17/zfs-administration-part-x-creating-filesystems/trackback/]] from your blog.

167

For IM, Email or Microblogs, here is the [[Shortlink>>url:https://web.archive.org/web/20210430213505/https://pthree.org/?p=2849]].

----

[[https:~~/~~/web.archive.org/web/20210430213505/https:~~/~~/pthree.org/2012/12/17/zfs-administration-part-x-creating-filesystems/>>https://web.archive.org/web/20210430213505/https://pthree.org/2012/12/17/zfs-administration-part-x-creating-filesystems/]]

173

Wiki source code of ZFS Administration - Part X - Creating Filesystems