sfpshow - fault-finding on Brocade Fibre Channel Switches

So you've hit a situation where a Fibre Channel link light has gone out and you don't know what is wrong. Alternately, perhaps you have extremely high latency on just one ESX hosts with regular storage disconnections (even though the link light looks fine)

At a high level, it is likely to be one of the following, but you don't want to go through and replace/re-seat/test each of these components:

  • FC HBA (PCI Host Bus Adapter in the server or storage)
  • SFP (Small Form-Factor Pluggable optical/copper transciever)
  • Fibre Channel patch cable
  • Structured Cabling

If you're using Brocade Fibre channel switches, there is some simple troubleshooting you can perform using statistics collected in the switch

sfpshow

This command will print stats pertaining to a particular port on the switch.
Ideally if you can capture this output regularly (e.g. monthly via
"supportshow"), it will give you an excellent baseline to compare
against when a fault occurs.

Below is an example of a relatively healthy port

switch01> sfpshow 3
Identifier: 3 SFP
Connector: 7 LC
Transceiver: 540c404000000000 200,400,800_MB/s M5,M6 sw Short_dist
Encoding: 1 8B10B
Baud Rate: 85 (units 100 megabaud)
Length 9u: 0 (units km)
Length 9u: 0 (units 100 meters)
Length 50u: 5 (units 10 meters)
Length 62.5u:2 (units 10 meters)
Length Cu: 0 (units 1 meter)
Vendor Name: BROCADE
Vendor OUI: 00:05:1e
Vendor PN: 57-1000012-01
Vendor Rev: A
Wavelength: 850 (units nm)
Options: 003a Loss_of_Sig,Tx_Fault,Tx_Disable
BR Max: 0
BR Min: 0
Serial No: UAF109320000071
Date Code: 090802
Temperature: 42 Centigrade
Current: 7.656 mAmps
Voltage: 3281.3 mVolts
RX Power: -4.9 dBm (320.6 uWatts)
TX Power: -3.3 dBm (464.9 uWatts)

Some things to note:

Field Meaning
Transciever This shows the capabilities of the SFP. Supported speeds are in Megabytes per second, so this needs to be multiplied by 10 to get the megabits (or divided by 100 for gigabits) - 800 = 8Gbps
Length {microns} This shows the maximum supported transmission distance for the specified type of cable. 9u is Single-Mode (the value is zero because this SFP doesn't support Single Mode), 50u is OM2/OM3/OM4 cable, 62.5u is OM1 cable. The cable specifications are printed on the cable. This doesn't take patching/connector attenuation into account - if you have lots of patch points etc, this transmission distance will be smaller.
Wavelength 850nm will be Short Wave & Multi Mode. Worth double checking when making a new connection, in case the SFPs at each end don't match or the cable isn't supported
Current Very cool - the ports come with a multimeter!

This parameter is important - if the current is zero ("0.000") mAmps,
the SFP in the FC switch is faulty and needs to be replaced.
For Short-Wave SFPs, normal current is typically around 7.000 mAmps
For long wave transcievers, this value will be higher (e.g. 26mAmps for 40Km transciever)

Voltage Standard operating voltage for an SFP is 3.3V (3,300 Millivolts).
If a single SFP shows a low vaultage, there is an electrical fault in
the SFP and it should be replaced. I don't really come across this
scenario too often though - often voltage will be OK when current is
zero.
RX Power This is the measurement of incoming light signal, converted to
electrical power by the SFP (and with a calculated signal). Not sure if
this is peak or average signal (please provide feedback if you know
more), however more importantly, the normal operating specifcations are
well documented by the vendors.

"x.xdBm" = Signal power in decibel milliwatts. A signal power of 0 dBm
is one milliwatt, a signal power of 3 dBm is 2milliwatts, 6 dBm is 4
milliwatts, etc

Typically any signal value below -12db (e.g. -13db, -20db, etc) would indicate a fault outside of the FC Switch SFP.

Some remedial actions to try include:
- Clean the cable connectors - with the correct cleaning apparatus
- Clean the source and destination SFP optical interface - with the correct cleaning apparatus
- Replace the patch cables
- Replace the structured cabling
- Replace the SFP (in the HBA) in the server/storage
- Replace the HBA in the server/storage

TX Power Likewise, this is a measurement of the electrical power which is converted into optical transmission out of the SFP.

Typically, TX Signal Power less than -12dBm would indicate that the
SFP is unable to generate the light signal required for transmission. Or
more simply, the SFP in the FC switch is faulty and needs to be
replace.

Normal SFP Specifications

Each vendor will publish the normal operating specifications for
SFPs. These are also very useful when troubleshooting faults, as
operations outside of these parameters will narrow down the search.

Brocade specs:
http://www.brocade.com/products/all/transceivers/product-details/transceiver-modules/specifications.page

Further Reading

Bill White has put together a really good article on Link Budget Loss
which explains the calculations behind DB Loss for connections, cables,
lengths, etc. Definitely worth a read (it is a short doc, easy to
digest).
http://www.redbooks.ibm.com/abstracts/tips0087.html?Open

Examples of faults

1) This SFP shows zero voltage and an output power of -17dBm. In this situation, the SFP itself is faulty

switch01> sfpshow 8
Identifier: 3 SFP
Connector: 7 LC
Transceiver: 540c404000000000 200,400,800_MB/s M5,M6 sw Short_dist
Encoding: 1 8B10B
Baud Rate: 85 (units 100 megabaud)
Length 9u: 0 (units km)
Length 9u: 0 (units 100 meters)
Length 50u: 5 (units 10 meters)
Length 62.5u:2 (units 10 meters)
Length Cu: 0 (units 1 meter)
Vendor Name: BROCADE
Vendor OUI: 00:05:1e
Vendor PN: 57-1000012-01
Vendor Rev: A
Wavelength: 850 (units nm)
Options: 003a Loss_of_Sig,Tx_Fault,Tx_Disable
BR Max: 0
BR Min: 0
Serial No: UAF1093100007HE
Date Code: 090728
Temperature: 38 Centigrade
Current: 0.000 mAmps
Voltage: 3303.9 mVolts
RX Power: -2.1 dBm (623.5 uWatts)
TX Power: -17.2 dBm (19.2 uWatts)

2) This SFP shows a low incoming power - indicating that it can't detect a light signal from the source. Perhaps there is nothing plugged in, or there is a remote fault in the server HBA or interconnecting optical cables. Either way, there is no light reaching the SFP. The SFP is otherwise healthy

switch03> sfpshow 37
Identifier: 3 SFP
Connector: 7 LC
Transceiver: 540c402000000000 200,400,800_MB/s M5,M6 sw Inter_dist
Encoding: 1 8B10B
Baud Rate: 85 (units 100 megabaud)
Length 9u: 0 (units km)
Length 9u: 0 (units 100 meters)
Length 50u: 5 (units 10 meters)
Length 62.5u:3 (units 10 meters)
Length Cu: 0 (units 1 meter)
Vendor Name: BROCADE
Vendor OUI: 00:05:1e
Vendor PN: 57-1000012-01
Vendor Rev: A
Wavelength: 850 (units nm)
Options: 001a Loss_of_Sig,Tx_Fault,Tx_Disable
BR Max: 0
BR Min: 0
Serial No: UAA209331552692
Date Code: 090811
Temperature: 36 Centigrade
Current: 7.100 mAmps
Voltage: 3314.2 mVolts
RX Power: -24.8 dBm (3.3 uWatts)
TX Power: -2.6 dBm (552.8 uWatts)

FC Switch sfpshow的更多相关文章

  1. Brocade FC Switch 光信号强度查看

    步骤: 1.先运行,porterrshow 查看口状态,然后'statsclear' and 'slotstatsclear' 2.查看:porterrshow,看下port 口的情况 3.收集sup ...

  2. 8.mvc core上传文件

    以下方法均是个人,仅供参考 public interface IFileHelper { /// <summary> /// 保存文件 (返回 Test.jpg) 出错就返回 error| ...

  3. LAN-LAN FREE-SERVER FREE

    注:本文转自http://taotao1240.blog.51cto.com/731446/839702 CU上有个板块叫“存储备份”,可见存储和备份是紧紧联系的.在看备份的时候,经常能看到LAN f ...

  4. 网络存储技术(3) based on zt

    各种术语介绍 一   ESCON    1991 年,IBM公司在S/390服务器中推出了ESCON(Enterprise System Connection)技术.它是基于光纤介质,最大传输速率达1 ...

  5. 网络存储技术介绍(2) ( based on zt)

    http://www.educity.cn/tx/429084.html 互联网技术DAS.NAS和SAN存储方案的比较 按照设备位置和接入方式,磁盘存储可以分为内置存储和外挂存储,外挂存储又分为直连 ...

  6. 深入浅出谈存储:如何区别NAS、SAN与DAS

    深入浅出谈存储:如何区别NAS.SAN与DAS 2012年02月17日16:51 来源:新浪博客 作者:林沛满 编辑:曾智强 查看全文 赞(0)评论(1) 分享 标签: DAS , SAN , 存储系 ...

  7. 存储的一些基本概念(HBA,LUN)

    有些新手总是在各式各样的概念里绕来绕去,弄的不亦乐乎.所以我就把我的一些理解写了下来,供您参考.我说的不局限于任何一种具体产品和厂家,也可能有些说法和某些厂家的说法不一样,但是我觉得应该算的上是本原的 ...

  8. Device Mapper Multipath(DM-Multipath)

    Device Mapper Multipath(DM-Multipath)能够将server节点和存储阵列之间的多条I/O链路配置为一个单独的设备.这些I/O链路是由不同的线缆.交换机.控制器组成的S ...

  9. FPGA与安防领域

    安防主要包括:闭路监控系统.防盗报警系统.楼宇对讲系统.停车厂管理系统.小区一卡通系统.红外周界报警系统.电子围栏.巡更系统.考勤门禁系统.安防机房系统.电子考场系统.智能门锁等等. 在监控系统中,F ...

随机推荐

  1. Python 读取window下UTF-8-BOM 文件

    with open('target.txt', 'r', encoding='utf_8_sig') as fp: print(fp.read())

  2. Server.Transfer 页面之间传值

    server.transfer 特点: 1:大家熟悉的一个特点,用server.transfer 跳转到新页面时,浏览器的地址是没有改变的(因为重定向完全在服务器端进行,浏览器根本不知道服务器已经执行 ...

  3. Alt+Shift+R组合键,用来在一个java文件中批量的重命名变量。

    myeclipse和eclipse集成编译软件,都提供了一个快捷键用来批量重命名变量:Alt+Shift+R组合键,用来在一个java文件中批量的重命名变量.扩展知识:如果想要重命名文件名,又不想手动 ...

  4. Linux之FTP服务

     一.ftp服务 ftp是一个文件传输协议(File Transfer Protocal).lftp相当于一个浏览器,用来向服务器发送请求的. 进行FTP服务的相关操作的时候,要先修改 vim /et ...

  5. 从JDK源码角度看Byte

    Java的Byte类主要的作用就是对基本类型byte进行封装,提供了一些处理byte类型的方法,比如byte到String类型的转换方法或String类型到byte类型的转换方法,当然也包含与其他类型 ...

  6. Linux 释放物理内存和虚拟内存

    1.查看内存占用情况 $ free -m -h total used free shared buff/cache available Mem: .7G .0G .9G 385M 780M .0G S ...

  7. 框架重构:测试中的DateTime.Now

    存在的问题 DateTime.Now是C#语言中获取计算机的当前时间的代码: 但是,在对使用了DateTime.Now的方法进行测试时,由于计算机时间的实时性,期望值一直在变化.如:计算年龄. pub ...

  8. ubuntu安装依赖:0.8.1-1ubuntu4.4 正要被安装以及vm nested解决方法

    刚才在ubuntu10.04虚拟机上安装kvm,提示0.8.1-1ubuntu4.4 正要被安装,查了一下,有一种解决方法: 进入“系统->系统管理->更新管理器->设置”,在弹出的 ...

  9. erl_0016 《硝烟中的erlang》 读书笔记003 “error_logger 爆炸”

    error_logger 爆炸 具有讽刺意味的是,负责错误日志的进程竟然是最为脆弱的之一.在Erlang的缺省安装中,error_logger39负责记录日志(写入磁盘或者发送到网络上),它的速度要比 ...

  10. 逻辑回归 logistic regression(1)逻辑回归的求解和概率解释

    本系列内容大部分来自Standford公开课machine learning中Andrew老师的讲解,附加自己的一些理解,编程实现和学习笔记. 第一章 Logistic regression 1.逻辑 ...