KDD: Knowledge Discovery and Data Mining (KDD)

Insititute: 复旦大学,中科大

Problem: time series prediction; modelling extreme events;

overlook the existence of extreme events, which result in weak performance when applying them to real time series.

为什么研究extreme events: Extreme events are rare and random, but do play a critical role in many real applications, such as the forecasting of financial crisis and natural disasters.

the weakness of deep learning methods roots in the conventional form of quadratic loss平方损失; --------> this paper use the extreme value theory极值理论 and develop a new form of loss for detecting the future occurrence of extreme events: extreme value loss.

普通预测: quadratic loss

极值预测:extreme value loss

Use memory network to memorize extreme events in historical records. EVL + memory network

Introduction:

time series prediction: classical research topic.

applications: climate prediction and stocks price monitoring;

Statistical methods: autoregressive moving average ARMA; nonlinear autoregressive exogenous NARX;

RNN (LSTM and GRU, gated recurrent unit); Compared with traditional methods, one of the major advantages of RNN structure is that it enables deep non-linear modeling of temporal patterns.

data imbalance and extreme events are harmful to deep learning models????; 值得验证

what are extreme events in time series: extremely small or large values of irregular and rare occurrences.

How to find extreme events? use certain thresholds to label extreme events

the randomness of extreme events have limited degrees of freedom (DOF)

end-to-end framewark.

underfitting and overfitting training problem;

Related work:

extreme events: 极大阈值 + 极小阈值

重尾分布

extreme value theory;

PROBLEMS CAUSED BY EXTREME EVENTS

conclusion: such a model would perform relatively poor if the true distribution of data in series is heavy-tailed.

underfit and overfit phenomenon

PREDICTING TIME-SERIES DATA WITH EXTREME EVENTS

Two factors: memorizing extreme events and modelling tail distribution;   memory network  to memorize the characteristic of extreme events; EVL

Memory network module:

1. Assumption: As pointed out by Ghil et al., extreme events in time-series data often show some form of temporal regularity [19]. 极值事件是有时间规律的,这是前提,如果没有这个前提,那么极值事件的研究是没有意义的。对于自然界的事物,如果没有规律性,那么无法进行建模。

2. windows sequence, wj -------- then use GRU to embed each window into feature space. wj as input,

Qi as the extreme events vector.  add attention mechanism as a part of the weight and update the output.

Extreme value loss:

Optimization: a direct thought is to combine the predicted outputs ot with the prediction of the occurrence of extreme events,

方差损失上增加了一个对于极值事件的惩罚项。

??这只能算作loss function,怎么算作optimization呢?

Effectiveness of Time Series Prediction

针对两个真实数据库(climate and stock),一个伪造数据库上进行了实验,以rooted mean square error作为度量指标,在预测上约准确了0.01-0.08

但是看结果输出图,在极值上的预测结果确实好了。

Supplementary knowledge:

1. 张老师是战略能力很强,但是由于科研不在一线,导致战术可能会出现偏差。

2. 做交叉领域的文章时,i.可以做方法,ii.可以和领域结合,在领域里make sense, 有影响. 但如果四不像,两边都不会要。

3. 科研过程是一个严谨的流程体系,有一定的方法规律可循,不是瞎打一耙。

4. 其实真正重要的还是loss function怎么定,optimization 如何做,以及tailed distribution的一些现象。本质是数学问题,而非学习各种网络框架,最终还是要看deep learning 那本书和微积分。

PP: Modeling extreme events in time series prediction的更多相关文章

  1. PP: A dual-stage attention-based recurrent neural network for time series prediction

    Problem: time series prediction The nonlinear autoregressive exogenous model: The Nonlinear autoregr ...

  2. (转)LSTM NEURAL NETWORK FOR TIME SERIES PREDICTION

    LSTM NEURAL NETWORK FOR TIME SERIES PREDICTION Wed 21st Dec 2016   Neural Networks these days are th ...

  3. (zhuan) LSTM Neural Network for Time Series Prediction

    LSTM Neural Network for Time Series Prediction Wed 21st Dec 2016 Neural Networks these days are the ...

  4. PP: Composite visual mapping for time series visualization

    However: The conventional visual mapping maps each data attribute onto a single visual channel Purpo ...

  5. PP: Multi-Horizon Time Series Forecasting with Temporal Attention Learning

    Problem: multi-horizon probabilistic forecasting tasks; Propose an end-to-end framework for multi-ho ...

  6. PP: Time series clustering via community detection in Networks

    Improvement can be done in fulture:1. the algorithm of constructing network from distance matrix. 2. ...

  7. Autocorrelation in Time Series Data

    Why Time Series Data Is Unique A time series is a series of data points indexed in time. The fact th ...

  8. 【转载】Chaotic Time-Series Prediction

    原文地址:https://cn.mathworks.com/help/fuzzy/examples/chaotic-time-series-prediction.html?requestedDomai ...

  9. PP: Soft-DTW: a differentiable loss function for time-series

    Problem: new loss Label: new loss; Abstract: A differentiable learning loss; Introduction: supervise ...

随机推荐

  1. SAP 对HU做货物移动报错-Only 0 serial numbers entered instead of 30 -

    SAP 对HU做货物移动报错-Only 0 serial numbers entered instead of 30 - 元旦刚过,就收到客户的业务人员报错说,当其对HU做转库(同一个公司代码下工厂到 ...

  2. [VB.NET Tips]创建匿名类型列表

    在调用一些Web API时经常要发送或接收一些数据,在构造Json时可能要创建一些类. 很多都是在调用相关方法才使用到这些类,那使用匿名类型是个不错的选择.如果要传些表结构数据时,就要创建List. ...

  3. mysql 查询出现 "this is incompatible with sql_mode=only_full_group_by"错误解决方案,以及个人rpm方式重装所遇到的问题备份

    一.错误说明        这个错误发生在mysql 5.7 版本及以上版本会出现的问题:        mysql .7版本默认的sql配置是:sql_mode="ONLY_FULL_GR ...

  4. Nginx 和Apache 中的虚拟主机的概念

    在部署环境的时候,有时候会引用到虚拟主机的概念,什么是虚拟主机呢,博主之前一直把虚拟主机的概念没搞清楚,导致在部署的时候,一直动不动就404 ,或者500,或者服务器不通 所以,什么是虚拟主机呢? 虚 ...

  5. mybatis配置---> mybatisConfig.xml 配置加接数据源

    mybatisConfig.xml 配置主要作用是连接数据源配置的前提是在完成mybatis的jar包基础之上进行的同时要确保数据用户名和密码是否正确 一:密码写在 mybatisConfig.xml ...

  6. python格式化输出保留2位小数

    我是小白就不用多说了,学习python做了个练习题,结果运行了一遍,发现输入金额后得到的有很多位小数, 虽然不知道为什么,但是看得很不舒服, 就想到应该把让小数点后只保留2位数 找到了方法:将{0}改 ...

  7. Map的底层实现原理

    一,前言 1.1,概述 ​ 现实生活中,我们常会看到这样的一种集合:IP地址与主机名,身份证号与个人,系统用户名与系统用户对象等,这种一一对应的关系,就叫做映射(K-V).Java提供了专门的集合类用 ...

  8. java设计模式学习笔记--接口隔离原则

    接口隔离原则简述 客户端不应该依赖它不需要的接口,即一个类对另一个类的依赖应建立在最小的接口上 应用场景 如下UML图 类A通过接口Interface1依赖类B,类C通过接口Interface1依赖类 ...

  9. JavaScript中条件分支语句和循环语句的使用,用简洁的代码实现强大功能

    if()  else if()   else() alert() 弹出警告框 prompt() 输入框,确定:返回输入信息:取消:返回null <!DOCTYPE html> <ht ...

  10. 【MySql】数据库连接异常:Thelastpacketsentsuccessfullytotheserverwas0millisecondsago

    参考链接:http://blog.sina.com.cn/s/blog_7540bf5f0102xjpk.html 最近新入职,用了新版的mysql8数据库,结果连接数据库时出现了问题,报了几个异常, ...