Libata error messages

 
 

Contents

[hide

Overview

All libata error messages produced by the kernel use a standard format:

ata3.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
ata3.00: cmd ea/00:00:00:00:00/00:00:00:00:00/a0 tag 0
res 40/00:00:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout)
ata3.00: status: { DRDY }

Prefix

The prefix

ata3.00:

decodes as

ata prefix, indicating this is a libata port or device message
3 port number, counting from one (1)
00 device number, usually zero unless Port Multiplier or PATA master/slave is involved

Exception line

The exception line gives an overview of the EH (Error Handler) state.

exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
Emask Error classification bitmask (AC_ERR_xxx in source code)
SAct SATA SActive register
SErr SATA SError register
action ATA_EH_xxx actions, like revalidate, softreset, hardreset (see include/linux/libata.h)
frozen if present, indicates the port was frozen for EH
t<number> number of retries

Input taskfile

The "cmd" line gives the ATA command (taskfile) sent to the device:

cmd ea/00:00:00:00:00/00:00:00:00:00/a0 tag 0

This lists ATA registers in the following order:

ea Command (FLUSH CACHE EXT EAh, Non-Data)
/ (separator)
00 Feature
00 NSect
00 LBA L
00 LBA M
00 LBA H
/ (separator)
00 HOB Feature
00 HOB NSect
00 HOB LBA L
00 HOB LBA M
00 HOB LBA H
/ (separator)
a0 Device/Head
tag NCQ tag
0 NCQ tag number, or listed as zero if NCQ is not active/applicable.

Output taskfile, error summary

The next line contains a current dump of the ATA device's registers, along with an error summary:

res 40/00:00:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout)

In order:

40 Status
/ (separator)
00 Error
00 NSect
01 LBA L
4f LBA M
c2 LBA H
/ (separator)
00 HOB Error
00 HOB NSect
00 HOB LBA L
00 HOB LBA M
00 HOB LBA H
/ (separator)
00 Device/Head
Emask ATA command's internal error mask (AC_ERR_xxx in source code)
0x4 An English summary of the error, such as

  • timeout
  • HSM violation
  • media error

See below for a full list.

Error classes

These are the possible values for the internal error mask in each error message mentioned above.

AC_ERR_XXX, ATA Completion Errors were defined in include/linux/libata.h.

0x20 host bus error Host<->chip bus error (i.e. PCI, if on PCI bus)
0x10 ATA bus error chip<->device bus error
0x4 timeout Controller failed to respond to an active ATA command. This could be any number of causes. Most often this is due to an unrelated interrupt subsystem bug (try booting with 'pci=nomsi' or 'acpi=off' or 'noapic'), which failed to deliver an interrupt when we were expecting one from the hardware.
0x2 HSM violation Hardware failed to respond in an expected manner. "HSM" stands for Host State Machine, a software-based finite state machine required by ATA that expects certain hardware behaviors, based on the current ATA command and other hardware-state programming details.
0x40 internal error Hardware flagged an impossible condition, most likely due to software misprogramming.
0x8 media error Software detected a media error
0x80 invalid argument Software marked ATA command as invalid, for some reason
0x1 device error Hardware indicates an error with last command. This error is delivered directly from the ATA device. If you see a lot of these, that is often an indication of a hardware problem.
0x100 unknown error Uncategorized error (should never happen)

ATA status expansion

The final line

status: { DRDY }

expands the ATA status register returned in the output taskfile into its component bits:

Busy Device busy (all other bits invalid)
DRDY Device ready. Normally 1, when all is OK.
DRQ Data ready to be sent/received via PIO
DF Device fault
ERR Error (see Error register for more info)

ATA error expansion

If any bits in the Error register are set, the Error register contents will be expanded into its component bits, for example:

error: { ICRC ABRT }
ICRC Interface CRC error during Ultra DMA transfer - often either a bad cable or power problem, though possibly an incorrect Ultra DMA mode setting by the driver
UNC Uncorrectable error - often due to bad sectors on the disk
IDNF Requested address was not found
ABRT Command aborted - either command not supported, unable to complete, or interface CRC (with ICRC)

SATA SError expansion

If any bits in the SATA SError register are set, the SError register contents will be expanded into its component bits, for example:

SError: { PHYRdyChg CommWake }

These bits are set by the SATA host interface in response to error conditions on the SATA link. Unless a drive hotplug or unplug operation occurred, it is generally not normal to see any of these bits set. If they are, it usually points strongly toward a hardware problem (often a bad SATA cable or a bad or inadequate power supply).

RecovData Data integrity error occurred, but the interface recovered
RecovComm Communications between device and host temporarily lost, but regained
UnrecovData Data integrity error occurred, interface did not recover
Persist Persistent communication or data integrity error
Proto SATA protocol violation detected
HostInt Host bus adapter internal error
PHYRdyChg PhyRdy signal changed state
PHYInt PHY internal error
CommWake COMWAKE detected by PHY (PHY woken up)
10B8B 10b to 8b decoding error occurred
Dispar Incorrect disparity detected
BadCRC Link layer CRC error occurred
Handshk R_ERR handshake response received in response to frame transmission
LinkSeq Link state machine error occurred
TrStaTrns Transport layer state transition error occurred
UnrecFIS Unrecognized FIS (frame information structure) received
DevExch Device presence has changed

Libata Error Message 解析的更多相关文章

  1. Compiler Error Message: CS0016: Could not write to output file 回绝访问

    Compiler Error Message: CS0016: Could not write to output file 'c:\Windows...dll' 拒绝访问 C:\Windows\Te ...

  2. Oracle Error - "OCIEnvCreate failed with return code -1 but error message text was not available".

    ISSUE: When trying to connect to an Oracle database you receive the following error: "OCIEnvCre ...

  3. 网站部署后Parser Error Message: Could not load type 的解决方案

    asp.net 的Webproject 项目是在64bit机上开发,默认选项发布后,部署到32bit的服务器上,出现Parser Error Message: Could not load type的 ...

  4. [BTS] The adapter "SQL" raised an error message. Details "The Messaging Engine is shutting down. ".

    Get a warning in event log. Log Name:      ApplicationSource:        BizTalk ServerDate:          3/ ...

  5. Error message when you try to modify or to delete an alternate access mapping in Windows SharePoint Services 3.0: "An update conflict has occurred, and you must re-try this action"

    Article ID: 939308 - View products that this article applies to. Expand all | Collapse all Symptoms ...

  6. Fix the “No Private Key” Error Message

    This article will show you how to correct the “No Private Key” error message in Windows Internet Inf ...

  7. undefined reference to typeinfo - C++ error message

    undefined reference to typeinfo - C++ error message There are some compiler and loader error message ...

  8. "This connection is untrusted" - Firefox error message

    Error Messages I am receiving the following error message in Firefox: After selecting Cancel to clos ...

  9. android stack error message is Fail to start the plugin

    E: 08-26 16:34:11.934: E/AliSDK(32236): 错误编码 = 1002208-26 16:34:11.934: E/AliSDK(32236): 错误消息 = SDK  ...

  10. Error message “Assembly must be registered in isolation” when registering Plugins in Microsoft Dynamics CRM 2011 2013 解决办法

    Error message “Assembly must be registered in isolation” when registering Plugins in Microsoft Dynam ...

随机推荐

  1. Prime Time - 介绍

    Prime Time是对timing进行分析 Prime Time使用的是STA方法进行分析 工具会有更新,但是核心内容是不变的 Prime Time(intro to STA) 没有PT工具的时候, ...

  2. 【Gui-Guider】安装后运行模拟器报 JAVA 错误

    运行模拟器出错 上述错误是因为需要JAVA环境 JAVA 环境下载网址 https://www.oracle.com/java/technologies/javase-jdk16-downloads. ...

  3. 【java】 向上转型的运用

    应用 :求面积 1,抽象类  Geometry . public abstract class Geometry { public abstract double getArea(); } 2,矩形 ...

  4. [转帖]如何不耍流氓的做运维之-SHELL脚本

    https://www.cnblogs.com/luoahong/articles/8504691.html 前言 大家都是文明人,尤其是做运维的,那叫一个斯文啊.怎么能耍流氓呢?赶紧看看,编写SHE ...

  5. [转帖]Unicode与utf的前世今生

    https://www.cnblogs.com/naodong/p/12742987.html 历史上存在两个独立的尝试创立单一字符集的组织,即 国际标准化组织(ISO)于1984年创建的通用字符集( ...

  6. [转帖]MySQL 8.0 以后的版本策略变化

    https://www.modb.pro/db/1717815842220630016 产品版本变更   从2023年7月18日开始,MySQL官网出现了一个新的版本 MySQL 8.1.0,直接改变 ...

  7. [转帖]Linux 上 SQL Server 2022 (16.x) 的各版本和支持的功能

    https://zhuanlan.zhihu.com/p/371869456   本文内容 SQL Server 版本 将 SQL Server 用于客户端/服务器应用程序 SQL Server 组件 ...

  8. [转帖]Innodb存储引擎-锁(数据库锁的查看、快照读&当前读、MVCC、自增长与锁、外键与锁、行锁、并发事务的问题、阻塞、死锁、锁升级、锁的实现)

    文章目录 锁 lock 与latch 读锁/写锁/意向锁 INNODB_TRX/INNODB_LOCKS/INNODB_LOCK_WAITS 一致性非锁定读(快照读) 一致性锁定读(当前读) MVCC ...

  9. [转帖]开源软件项目中BSD、MIT许可证合规问题探析

    https://www.allbrightlaw.com/CN/10475/3be2369275d19e9e.aspx   [摘要]本文将探析BSD开源许可证(Berkeley Software Di ...

  10. [转帖] 原来awk真是神器啊

    https://www.cnblogs.com/codelogs/p/16060082.html 简介# 刚开始入门awk时,觉得awk很简单,像是一个玩具,根本无法应用到工作之中,但随着对awk的了 ...