1.前言

本文所述关于文件管理的系列文章主要是对陈莉君老师所讲述的文件系统管理知识讲座的整理。

Linux可以支持不同的文件系统，它源于unix文件系统，也是unix文件系统的一大特色。

本文将以不同文件系统之间的拷贝为实例进行讲述

2. 实例：文件拷贝

图不同文件系统之间的拷贝

图文件拷贝对应的C语言片段

3.打开文件

3.1 open函数

文件读写之前都要先打开文件，打开函数的原型如下：

open通过路径名、标志和mask信息，打开或创建文件，最后返回此文件对应的fd
用户态下调用open，进入系统调用处理程序后，会调用内核相应的系统调用服务例程

3.2 打开文件的内核实现

　从整体流程来看，open的内核实现如下：

进程从用户态获取路径名到内核缓冲区；
然后查找到父目录；如果设置了O_CREAT标志，则继续查找路径最后一个分量
最后获取对应文件的打开文件结构
将这个结构与当前进程的打开文件表联系起来，返回相应的fd。

4. do_sys_open

 long do_sys_open(int dfd, const char __user *filename, int flags, umode_t mode)

 {

     struct open_flags op;

     int fd = build_open_flags(flags, mode, &op);

     struct filename *tmp;

     if (fd)

         return fd;

     tmp = getname(filename);

     if (IS_ERR(tmp))

         return PTR_ERR(tmp);

     fd = get_unused_fd_flags(flags);

     if (fd >= ) {

         struct file *f = do_filp_open(dfd, tmp, &op);

         if (IS_ERR(f)) {

             put_unused_fd(fd);

             fd = PTR_ERR(f);

         } else {

             fsnotify_open(f);

             fd_install(fd, f);

         }

     }

     putname(tmp);

     return fd;

 }

open系统调用服务例程的核心为do_sys_open

4.1 do_filp_open

 struct file *do_filp_open(int dfd, struct filename *pathname,

         const struct open_flags *op)

 {

     struct nameidata nd;

     int flags = op->lookup_flags;

     struct file *filp;

     set_nameidata(&nd, dfd, pathname);

     filp = path_openat(&nd, op, flags | LOOKUP_RCU);

     if (unlikely(filp == ERR_PTR(-ECHILD)))

         filp = path_openat(&nd, op, flags);

     if (unlikely(filp == ERR_PTR(-ESTALE)))

         filp = path_openat(&nd, op, flags | LOOKUP_REVAL);

     restore_nameidata();

     return filp;

 }

当内核要访问一个文件时，第一步需要找到这个文件，这由do_filp_open完成
在do_filp_open的实现中，查找文件过程由path_openat调用path_init和link_path_walk完成
这两个函数将用户传进来的用字符串表示的文件路径，转换成一个dentry结构，建立好相应的inode，并返回file对象

4.2 fd_install

 void fd_install(unsigned int fd, struct file *file)

 {

     __fd_install(current->files, fd, file);

 }

 void __fd_install(struct files_struct *files, unsigned int fd,

         struct file *file)

 {

     struct fdtable *fdt;

     might_sleep();

     rcu_read_lock_sched();

     while (unlikely(files->resize_in_progress)) {

         rcu_read_unlock_sched();

         wait_event(files->resize_wait, !files->resize_in_progress);

         rcu_read_lock_sched();

     }

     /* coupled with smp_wmb() in expand_fdtable() */

     smp_rmb();

     fdt = rcu_dereference_sched(files->fdt);

     BUG_ON(fdt->fd[fd] != NULL);

     rcu_assign_pointer(fdt->fd[fd], file);

     rcu_read_unlock_sched();

 }

do_sys_open完成以上处理后，将获取到的file结构体通过fd_install到当前进程的打开文件表中。其索引为fd

4.3 do_sys_open剩余操作

do_sys_open的剩余将进程关联的file的描述符返回用户

用户随后通过文件描述符，来访问这些数据结构

如上打开文件的核心是查找文件

5.查找文件

1. 打开文件的核心为查找，通常内核将查找过程分为两部分：

查找起始位置信息

主要是判断是系统根目录还是当前工作目录，以获取后面循环查找的起始位置（如/home/clj/file1.c中的“/”）

循环查找路径名后续分量

以起始位置开始，循环查找后续每个路径分量

2. 循环查找路径分量的过程，涉及多级cache.

循环查找后续路径分量，首先从dentry cache开始查找，在dentry cache中查找对应的dentry，若找到则直接返回；

若没有找到，则必须去底层文件系统查找对应的dentry

5.1 dentry cache的引入

由于块设备速度比较慢，可能需要很长时间才能找到与一个文件名关联的inode信息，所以引入dentry cache

5.2 dentry cache的描述

缓存的组织

散列表：包含了所有活动的dentry对象。散列表由dentry_hashtable组织，dentry通过d_hash连入散列表中；

LRU链表：dentry结构体中由d_lru链表组织。LRU链表中的元素同时也在dentry cache中；

缓存的查找

缓存由d_hash计算散列值，通过值对应的索引从dentry_hashtable中查找相应的队列；

再从队列头循环查找对应的dentry;

并将其从LRU中移除

图 dentry cache组织图

5.3 快速查找关键结构体qstr

 struct qstr {

     union {

         struct {

             HASH_LEN_DECLARE;

         };

         u64 hash_len;

     };

     const unsigned char *name;

 };

5.4 dentry cache查找关键代码

 struct dentry *__d_lookup(const struct dentry *parent, const struct qstr *name)

 {

     unsigned int len = name->len;

     unsigned int hash = name->hash;

     const unsigned char *str = name->name;

     struct hlist_bl_head *b = d_hash(parent, hash);

     struct hlist_bl_node *node;

     struct dentry *found = NULL;

     struct dentry *dentry;

     /*

      * Note: There is significant duplication with __d_lookup_rcu which is

      * required to prevent single threaded performance regressions

      * especially on architectures where smp_rmb (in seqcounts) are costly.

      * Keep the two functions in sync.

      */

     /*

      * The hash list is protected using RCU.

      *

      * Take d_lock when comparing a candidate dentry, to avoid races

      * with d_move().

      *

      * It is possible that concurrent renames can mess up our list

      * walk here and result in missing our dentry, resulting in the

      * false-negative result. d_lookup() protects against concurrent

      * renames using rename_lock seqlock.

      *

      * See Documentation/filesystems/path-lookup.txt for more details.

      */

     rcu_read_lock();

     hlist_bl_for_each_entry_rcu(dentry, node, b, d_hash) {

         if (dentry->d_name.hash != hash)

             continue;

         spin_lock(&dentry->d_lock);

         if (dentry->d_parent != parent)

             goto next;

         if (d_unhashed(dentry))

             goto next;

         /*

          * It is safe to compare names since d_move() cannot

          * change the qstr (protected by d_lock).

          */

         if (parent->d_flags & DCACHE_OP_COMPARE) {

             int tlen = dentry->d_name.len;

             const char *tname = dentry->d_name.name;

             if (parent->d_op->d_compare(parent, dentry, tlen, tname, name))

                 goto next;

         } else {

             if (dentry->d_name.len != len)

                 goto next;

             if (dentry_cmp(dentry, str, len))

                 goto next;

         }

         dentry->d_lockref.count++;

         found = dentry;

         spin_unlock(&dentry->d_lock);

         break;

 next:

         spin_unlock(&dentry->d_lock);

      }

      rcu_read_unlock();

      return found;

 }

Linux文件系统3--打开文件的更多相关文章

Linux操作系统中打开文件数量的查看方法
Linux操作系统中打开文件数量的查看方法ulimit -n 4096也就是限制用户的最大文件打开数为4096个在网上查了关于怎么查看文件打开数的文章大致有两种说法/proc/sys/fs/file ...
【Linux学习】Linux文件系统5—查看文件内容命令
Linux文件系统5-查看文件内容命令 cat: 由第一行开始显示文件内容 more: 一页一页地显示文件内容,空格键可以继续翻页显示下一页内容 less:与more类似,但是可以往前翻页 head: ...
Linux下Firefox打开文件jnlp文件
ubuntu(linux)打开jnlp文件咘咘 2019-05-20 15:12:48 1331 收藏展开前提条件是安装有java环境．whereis java 查看自己java安装目录．本人是在 ...
Linux文件系统与日志文件
目录一.inode和block 1.1.inode和block概述 1.2.inode的内容 inode包含文件的元信息: 查看inode号两种方式目录文件的结构 1.3.inode的号码用户通 ...
深入理解Linux文件系统与日志文件
目录: 一.inode与block 二.inode内容三.inode的号码四.inode的大小五.链接文件六.inode节点耗尽故障处理七.恢复EXT类型的文件编译安装extundelete ...
Linux文件系统之删除文件、文件夹（rm,rmdir）
rm命令,rmdir命令 rm命令Remove,功能:1)删除目录,2)删除文件. (可以递归的删除指定目录的所有文件及子目录) 注意:rm是一个危险的命令,使用的时候要特别当心,尤其对于初学者来说 ...
【Linux】系统打开文件最大数量限制（进程打开的最大文件句柄数设置）
利用ulimit命令可以对资源的可用性进行控制. -H选项和-S选项分别表示对给定资源的硬限制(hard limit)和软限制(soft limit)进行设置. 硬限制(hard limit)一旦被设 ...
Linux记录-lsof打开文件工具常用操作
lsof `which httpd` //那个进程在使用apache的可执行文件 lsof /etc/passwd //那个进程在占用/etc/passwd lsof /dev/hda6 //那个进程 ...
Linux文件系统之复制文件cp（文件复制）
cp 命令(文件复制) cp命令用来将一个或多个源文件或者目录复制到指定的目的文件或目录.它可以将单个源文件复制成一个指定文件名的具体的文件或一个已经存在的目录下.cp命令还支持同时复制多个文件, ...

随机推荐

【题解】 Codeforces Edu44 F.Isomorphic Strings （字符串Hash）
题面戳我 Solution 我们按照每个字母出现的位置进行\(hash\),比如我们记录\(a\)的位置:我们就可以把位置表示为\(0101000111\)这种形式,然后进行字符串\(hash\) 每 ...
洛谷 P1378 油滴扩展改错
P1378 油滴扩展题目描述在一个长方形框子里,最多有\(N(0≤N≤6)\)个相异的点,在其中任何一个点上放一个很小的油滴,那么这个油滴会一直扩展,直到接触到其他油滴或者框子的边界.必须等一个油 ...
修复Mysql主从不同步shell
使用第三方工具MySQL Enterprise Monitor,MySQL企业版监控工具.MONyog – MySQL Monior and Advisor,MONyog大家都不陌生,windows下 ...
CF710F String Set Queries
CF710F String Set Queries 支持字符串的插入和删除...SAM也干不了这个事所以可以用cdq分治+AC自动机O(nlogn)解决但是本题强制在线~~~ 我们还有一个工具,叫 ...
[Cqoi2014]数三角形——组合数
Description: 给定一个nxm的网格,请计算三点都在格点上的三角形共有多少个.下图为4x4的网格上的一个三角形. 注意三角形的三点不能共线. Hint: 1<=m,n<=1000 ...
A1041. Be Unique
Being unique is so important to people on Mars that even their lottery is designed in a unique way. ...
SQL语句中 int 溢出 + Asp语句中 Long 溢出
晚上5点多,同事在QQ告诉我,一个用户向他反应,在他登录的时候显示错误信息,我们在管理平台查看该用户的基本信息时,也显示错误信息. 经过初步分析,原来是在执行 SQL语句的时候发生Int溢出: sql ...
Visio画UML类图、序列图 for Java
参考文档: 1.百度搜索: 怎样用Visio 2007画C++类图连接 https://jingyan.baidu.com/article/9f7e7ec07286e16f281554f7.html ...
[持续更新] 文章列表 Last Update: 8/21/2017
1.前端 HTML5快速学习二 Canvas@20141125 HTML5快速学习一@20141122 2.ASP.NET(MVC) MVC5+EF6 完整教程17--升级到EFCore2.0@201 ...
django基础之数据库操作
Django 自称是“最适合开发有限期的完美WEB框架”.本文参考<Django web开发指南>,快速搭建一个blog 出来,在中间涉及诸多知识点,这里不会详细说明,如果你是第一次接触D ...

Linux文件系统3--打开文件