Author: r Sys.info()[['user']]
Edited: r format(Sys.time(),'%x %X')
#Global settings
Sys.setlocale("LC_CTYPE","Chs")
source("~/Github/recharts/R/echartR.R")
#source("C:/HMSProjects/Data Analytics/R_scripts/CommonFunctions.R")
knitr::opts_chunk$set(message=FALSE,warning=FALSE,results='asis')
本工具来源于百度开发的国内顶尖水平的开源d3-js
可视项目Echarts(Github Repo)。Yang Zhou和Taiyun Wei基于该工具开发了recharts包,经Yihui Xie修改后,可通过htmlwidgets
传递js参数,大大简化了开发难度。但此包开发仍未完成。为了赶紧上手用,基于该包做了一个函数echartR
,用于制作基础Echart交互图。需要R版本>=3.2.0.
This tool originates from a top-tier d3-js
visualization project of China: Baidu Echarts(Github Repo). Yang Zhou and Taiyun Wei developed an experimental R package recharts based on it, which then evoluted into yihui/recharts by Yihui Xie to pass js parameters through htmlwidgets
. The package is sill uder development. I developed a function echartR
based on this package to make basic Echarts interation charts. This function requires R>=3.2.0.
echartR
的主要工作是将Echarts参数封装成list,Yihui Xie的原型函数echart
被用来处理这个list。echart
函数的基本用法如下,除了数据本身,并没有提供其他参数的设置方法。
echartR
majorly packs Echarts parameters into a list while echart
, the prototype function developed by Yihui Xie, is used to parse the list. The basic examples of echart
is as follows, which does not provide parameters entries other than dataset itself.
if (! 'recharts' %in% installed.packages()[,1]){
install.packages('recharts',
repos = c('http://yihui.name/xran', 'http://cran.rstudio.com')
)
}
library(recharts)
echart(iris, ~Sepal.Length, ~Sepal.Width)
echart(iris, ~Sepal.Length, ~Sepal.Width, series = ~Species)
echartR则主要细化了部分参数的设置方法,可以做出如下效果的动图:
- 安装 Install
devtools
(install.packages('devtools')
) - 安装 Insall recharts (
install_github('yihui/recharts')
) - 下载 Download
echartR.R
脚本文件到本地位置 script file to local disk: https://github.com/madlogos/recharts/blob/master/R/echartR.R) - 调用 Source
echartR
脚本 script (假设我将脚本放在本地 suppose I stored the script to local GitHub Repo:source("~/Github/recharts/R/echartR.R")
)
echartR(data, x=NULL, y, z=NULL, series=NULL, weight=NULL,
xcoord=NULL, ycoord=NULL, x1=NULL, xcoord1=NULL, ycoord1=NULL,
type="scatter", stack=FALSE,
title=NULL, subtitle=NULL, title_url=NULL, subtitle_url=NULL,
symbolList=NULL, dataZoom=NULL,
dataRange=NULL, splitNumber=NULL, dataRangePalette=NULL,
xAxis=list(lab=NULL,color=NULL,splitLine=T,banded=F,rotate=0), xlab=NULL,
yAxis=list(lab=NULL,color=NULL,splitLine=T,banded=F,rotate=0), ylab=NULL,
xAxis1=list(lab=NULL,series=NULL,reverse=F,color=NULL,splitLine=T,
banded=F,rotate=0), xlab1=NULL,
yAxis1=list(lab=NULL,series=NULL,reverse=F,color=NULL,splitLine=T,
banded=F,rotate=0), ylab1=NULL,
xyflip=FALSE, AxisAtZero=TRUE, scale=TRUE,
palette='aetnagreen', tooltip=TRUE, legend=TRUE, toolbox=TRUE,
pos=list(title=6, legend=11, toolbox=1, dataZoom=6, dataRange=8, roam=2),
calculable=TRUE, asImage=FALSE,
markLine=NULL, markLinesmooth=NULL, markPoint=NULL,
theme=list(backgroundColor=NULL, borderColor=NULL,
borderWidth=1),
...))
-
data: 数据集 dataset
-
x: x变量,直方图、饼图、环形图、玫瑰图可省略。x variable, only omitable for histograms, pie, ring and rose charts。
-
y: y变量 y variable
-
z: z变量,只接受时间/日期变量,并打开时间轴。z variable, only accept data/time variable to open time axis
-
series: Series(系列)变量 series variable
-
weight: 权重变量,可用于直方图、气泡图等 weight variable, used in histogram, bubble, etc
-
xcoord: 纬度坐标变量,仅用于点/线标注的map。Lattitude variable, only for point/line-marking map.
-
ycoord: 经度坐标变量,仅用于点/线标注的map。Longitude variable, only for point/line-marking map.
-
x1: 备用自变量,仅用于线标注的地图、力导向图和和弦图。Backup x variable, only for line-marking map, force andchord chart.
-
xcoord1: 备用纬度坐标变量,仅用于线标注的map。Backup lattitude variable, only for line-marking map.
-
ycoord1: 备用经度坐标变量,仅用于线标注的map。Backup Longitude variable, only for line-marking map.
-
type: 默认 default
scatter
,可选 options 'scatter', 'bubble', 'bar', 'line', 'linesmooth', 'map', 'k', 'pie', 'ring', 'rose','area', 'areasmooth', 'chordribbon', 'chord', 'force', 'tree', 'treemap', 'wordcloud', 'heatmap', 'histogram', 'funnel', 'pyramid', 'radar', 'radarfill'- 如选择map,则控制项必须写作一个长度为3的向量:c('map',
mapType
,markType
)。mapType可选'world'、'china',或简体中文表示的具体中国地名。markType
为area时,用区块颜色表示效应大小;为point时,用点在地图上做标注;为line时,用线条在地图上做标注。默认为c('map','china','area')。Ifmap
was chosen, the control option should be a vector of length 3: c('map',mapType
,markType
).mapType
could be either 'world' or 'china', of which simplified Chinese names are required for 'china'. WhenmarkType
equals to 'area', the function colors polygons to show the effects; while equals to 'point', it ticks pins on the map; while equals to 'line', it ticks lines on the map.
- 如选择map,则控制项必须写作一个长度为3的向量:c('map',
-
stack: 默认FALSE,是否堆积。用于制作堆积条图、柱图、线图和面积图等直角坐标系图形。Default to FALSE (do not stack). Used in stacked column, bar, line and area chart, etc.
-
title: 标题 title of the figure
-
subtitle: 副标题 subtitle of the figure
-
pos: 图形要素的位置,写作一个列表,用1-12点钟方向指示。标题默认6点位,即底部居中位置。图例、工具箱、值域漫游轴、数据缩放轴、地图漫游控件默认位置分别是11、1、6、8、2点钟位。Position of image elements which are integers 1-12. Title position default to 6 (o'clock), which means bottom middle. Defaults to legend, toolbox, dataRange, dataZoom, roamController are 11, 1, 6, 8, 2, respectively.
-
title_url: 标题链接,url of the title
-
subtitle_url: 副标题链接, url of the subtitle
-
symbolList: 图形标志。可使用数组循环使用,如数组长度小于series水平数,则以最后一个标志填充。如设置为NULL或不设置,则循环显示Echarts默认的标志图形列表:c('circle','rectangle','triangle','diamond','emptyCircle','emptyRectangle','emptyTriangle','emptyDiamond')。也可任意指定'heart','droplet','pin','arrow','star5','star6'等非标图形。设为'none'则不显示。A vector assigning symbols. You can use an array of symbols. If the length of the symbols array is smaller than number of levels of the series, the last symbol will be used to extend the array. If you set symbolList NULL or leave it unset, the function circulates the default symbol list of Echarts: c('circle','rectangle','triangle','diamond','emptyCircle','emptyRectangle','emptyTriangle','emptyDiamond'). You can also assign non-standard symbols, such as 'heart','droplet','pin','arrow','star5','star6', 'star7', etc. When assigned to 'none', no symbols are shown.
-
dataZoom: 数据缩放轴,默认FALSE. The axis to zoom data. Default to FALSE. 如
dataZoom=TRUE
,默认范围为0-100%。也可用一个长度为2的向量控制初始范围,如c(30,70)
显示初始30-70%。IfdataZoom=TRUE
, the default range is 0-100%. You can assign a vector with length of 2 todataZoom
to control the initial range. E.g.,c(30,70)
means from 30% to 70% at the initial view. -
dataRange: 数据范围漫游范围,默认不打开。如要打开,设置dataRange=c(
高值标签
,低值标签
) The range to zoom the data. Default to FALSE. Set dataRange=c(High value label
,Low value label
) to enable dataRange. -
splitNumber: 如打开数据漫游,可指定数据范围切分段数,默认为连续漫游轴(0)。在直方图里,如设定splitNumber,则将数据切分成splitNumber个块。When dataRange is on, assign splitNumber to cut the range into discrete sections. Default to 0 (continuous range). In histogram, if splitNumber is set, the y variable will be cut into splitNumber groups.
-
dataRangePalette: 如打开数据漫游,可单独指定漫游色板(同palette功能),否则采用Echarts默认值。You can independently assign palettes to dataRange (similar to overall palette). Default to NULL (applies echarts defaults).
-
xAxis: x轴参数,写作一个列表,默认为
list(lab=NULL,color=NULL,splitLine=T,banded=F,rotate=0)
,lab为标题,color为颜色,splitLine为分割线,banded为间隔区块,rotate为轴标签旋转(-90 ~ 90)。x Axis parameters in a list, default tolist(lab=NULL,color=NULL,splitLine=T,banded=F,rotate=0)
. -
yAxis: y轴参数,参考xAxis。parameters of y Axis. Refer to xAxis.
-
xlab: 也可忽略xAxis项,单独通过xlab指定x轴标题。如
xAxis[['lab']]
和xlab
冲突,取xlab。You can also omit xAxis, directly assign xAxis title. xlab has a higher priority thanxAxis[['lab']]
. -
ylab: 也可忽略yAxis项,单独通过ylab指定y轴标题。如
yAxis[['lab']]
和ylab
冲突,取ylab。You can also omit yAxis, directly assign yAxis title. ylab has a higher priority thanyAxis[['lab']]
. -
xAxis1: 次级x轴参数,多系列数据时有效。写作一个列表,默认为
list(lab=NULL,series=NULL,reverse=F,color=NULL,splitLine=T,banded=F,rotate=0)
。其中,series指定放于次坐标轴的数据系列,可写系列名或序号,如c(1,2)或c('male','female');reverse设为TRUE时将数据颠倒显示。Secondary x axis, effecitve only if there are multiple series. Written in a list, default tolist(lab=NULL,series=NULL,reverse=F,color=NULL,splitLine=T,banded=F,rotate=0)
.series
set which series to put on secondary x-axis, which can be either a name vector or an index vector, e.g., c(1,2) or c('male','female'). The data is fliped up whenreverse
is set TRUE. -
yAxis1:级次y轴参数,参考xAxis1。
-
xlab1:次级x轴标题,优先级高于xAxis1列表中的lab项。Title of secondary x-axis, with higher priority than
lab
in xAxis1 list, -
ylab1:次级y轴标题,优先级高于yAxis1列表中的lab项。Title of secondary y-axis, with higher priority than
lab
in yAxis1 list, -
xyflip: 默认FALSE,是否翻转坐标轴。Flip x,y-axies. Default to FALSE.
-
AxisAtZero: 默认FALSE,坐标轴是否交叉于零点。Axes cross at zero. Default to FALSE.
-
scale: 默认TRUE,是否基于最大、最小值调整坐标尺度。Rescale the axes based on min and max values. Default to TRUE.
-
palette: 调色板。默认aetnagreen。Overall palette. Default to 'aetnagreen'.
- 种类 Palette names:
- Aetna palettes: 可用 Including 'aetnagreen', 'aetnablue', 'aetnaviolet', 'aetnaorange', 'aetnateal', 'aetnacranberry'
- RColorBrewer palettes: 可用 Including 'BrBG', 'PiYG', 'PRGn', 'PuOr', 'RdBu', 'RdGy', 'RdYlBu', 'RdYlGn', 'Spectral', 'Accent', 'Dark2', 'Paired', 'Pastel1', 'Pastel2', 'Set1', 'Set2', 'Set3', 'Blues', 'BuGn', 'BuPu', 'GnBu', 'Greens', 'Greys', 'Oranges', 'OrRd', 'PuBu', 'PuBuGn', 'PuRd', 'Purples', 'RdPu', 'Reds', 'YlGn', 'YlGnBu', 'YlOrBr', 'YlOrRd'
- ggthemes palettes: 'calc', 'economist', 'economist_white', 'economist_stata','excel', 'exel_fill', 'excel_old', 'excel_new', 'few', 'fivethirtyeight', 'gdocs', 'pander', 'tableau', 'stata', 'stata1','stata1r','statamono', 'tableau20', 'tableau10medium', 'tableaugray', 'tableauprgy', 'tableaublrd', 'tableaugnor', 'tableaucyclic', 'tableau10light', 'tableaublrd12', 'tableauprgy12', 'tableaugnor12','hc','darkunica', 'solarized','solarized_red', 'solarized_yellow', 'solarized_orange','solarized_magenta','solarized_violet', 'solarized_blue', 'solarized_cyan', 'solarized_green', 'wsj', 'wsj_rgby', 'wsj_red_green', 'wsj_black_green', 'wsj_dem_rep', 'colorblind', 'trafficlight'
- 其他Other palettes: 'rainbow', 'terrain', 'topo', 'heat', 'cm'
- 用法 Usage:
- 可以不指定,使用函数默认。Do not set the value and function defaults will be loaded
- 可以
palette=NULL
,使用Echarts默认。Setpalette=NULL
to use Echarts defaults - 可以
palette=palette name
指定上述任何一种色板。Setpalette=palette name
to assign any palette listed above - 规定色板的同时跟个数限定,限定色板颜色的个数,如
palette='calc(3)'
,会从calc色板中随机取3种颜色。Setpalette=palette name(number)
to restrict number of colors within the palette (e.g.,palette='calc(3)'
picks 3 colors out of 'calc' RANDOMLY) - 可以
palette=c(color1,color2,color3,...)
自定义色板向量,向量可以是颜色名,也可以是Hex表达式。可以用colors()
函数查看所有支持的颜色名称,demo(colors)
查看颜色效果。Setpalette=c(color1,color2,color3,...)
to define a palette vector, made of which either color names or Hex expressions. Usecolors()
to check available color names and check the effects usingdemo(colors)
.
- 种类 Palette names:
-
tooltip: 默认TRUE,鼠标指针特效。Mouse tip effects swtich. Default to TRUE.
-
legend: 图例,默认TRUE。可以写作一个长度为2的列表,
list(mode='single/multiple',select=选中的系列)
。如list(mode='single',select='Male')
表示单选,初始仅选中显示Male系列。legend=FALSE
则不显示图例。Whether to show the legend. Default to TRUE. Can also be a list length 2:list(mode='single/multiple',select=vector(...))
. E.g.,list(mode='single',select='Male')
means singular choice, with series 'Male' selected at the initial view.legend=FALSE
closes legend. -
toolbox: 默认TRUE,是否显示工具箱。Echarts Tool box switch. Default to TRUE.
-
calculable: 默认TRUE,是否支持拖曳重算(Echarts专利) Calculable switch (Echarts patent).
-
asImage: 默认FALSE,是否显示为静态图。renderAsImage switch.Deafult to FALSE.
-
markLine: 显示标线,默认不显示。格式写作一个4或8列的数据框或矩阵 Show markline, default to NULL. The grammar is a data.frame or matrix with 4 or 8 columns:
- 缩略格式 Short form :
series name/index * line name Line type Light effect String or number String / NA min/max/average/lm TRUE / FALSE - 完整格式 Full form :
series name/index * line name Value P0 x P0 y P1 x P1 y Light effect String or number String/NA num x val y val x val y val TRUE / FALSE - 例子 Examples
- 如
t(c('male',NA,'average',F))
或t(c(1,NA,'average',F))
都可表示male数据系列平均值标线,只用于line, linesmooth, bar, scatter, bubble。lm
可出线性回归标线,只用于散点或气泡图。如t(c(1,NA,'average',T))
则表示male系列开启炫光特效。E.g., botht(c('male',NA,'average',F))
andt(c(1,NA,'average',F))
refer to an average markline of the series 'male', only available for line, linesmooth, bar, scatter, bubble charts. 'lm' refers to linear regresson markline which is only available for scatters and bubbles.t(c(1,NA,'average',T))
opens light effects of series 'male'. - 如
t(c('male',NA,100,0,5,100,5,F))
表示在'male'数据系列中画一条经过P0(0,5)和P1(100,5)的直线。E.g.,t(c('male',NA,100,0,5,100,5,F))
refers to a markline through P0(0,5) and P1(100,5) as of sereis 'male'. 在line, bar, k, scatter图中,'P0 x','P0 y','P1 x','P1 y'均被理解为直角坐标系的定位。在map中,这些坐标值必须写作经纬度。t(c('male',NA,100,0,5,100,5,T))
可打开male系列的炫光特效。 In line, bar, k and scatter charts, 'P0 x','P0 y','P1 x','P1 y' are comprehended as coordinates. In map charts, these coordinates should be lattitudes and longitudes.t(c('male',NA,100,0,5,100,5,T))
opens light effects of series 'male'.
-
markLinesmooth: 如要用平滑标线,可用markLinesmooth替代markLine。语法完全一样。Used this instead of
markLine
for smooth marklines with totally identical grammar. -
markPoint: 显示标注点,默认不显示。格式写作一个4或6列的数据框或矩阵 Show markpoints, default to NULL. The grammar is a data.frame or matrix with 4 or 6 columns:
- 缩略格式 Short form :
series name/index * Point name Point type Light effect String or number String / NA min/max TRUE / FALSE - 完整格式 Full form :
series name/index * Point name Value P x P y Light effect String or number String / NA num x val y val TRUE / FALSE - 例子 Examples
- 如
t(c('male',NA,'min',F))
或t(c(1,NA,'min',F))
都可表示male数据系列最小值标注,只用于line, linesmooth, bar, scatter, bubble。t('male',NA,'min',T)
则表示male系列开启炫光特效。E.g., botht(c('male',NA,'min',F))
andt(c(1,NA,'min',F))
refer to a min markpoint of the series 'male', only available for line, linesmooth, bar, scatter, bubble charts.t(c('male',NA,'min',T))
opens light effects of series 'male'. - 如
t(c('male',NA,100,0,5,F))
表示在'male'数据系列中标注点P(0,5)。E.g.,t(c('male',NA,100,0,5,F))
refers to a markpoint at P(0,5) as of sereis 'male'. 在line, bar, k, scatter图中,'P x','P y', ... 均被理解为直角坐标系的定位。在map中,这些坐标值必须写作经纬度。t(c('male',NA,100,0,5,T))
可打开male系列的炫光特效。 In line, bar, k and scatter charts, 'P x','P y',... are comprehended as coordinates. In map charts, these coordinates should be lattitudes and longitudes.t(c('male',NA,100,0,5,T))
opens light effects of series 'male'.
-
theme: 主题元素设置,语法为
theme=list(backgroundColor=color name/value, borderColor=color name/value, borderWidth=1)
,color默认均为NULL,width默认为1。
以下为静态图示例,真实d3-js效果请前往>>>>>>>>路由器上建的个人站点,不稳定、较慢,见谅
#Global settings
Sys.setlocale("LC_CTYPE","Chs")
source("~/Github/recharts/R/echartR.R")
#source("C:/HMSProjects/Data Analytics/R_scripts/CommonFunctions.R")
knitr::opts_chunk$set(message=FALSE,warning=FALSE,results='asis')
没有数据系列,因此指定显示第1个系列的均数线(markLine=c(1,NA,'average',F)
)。
echartR(data = iris, x = ~Sepal.Width, y = ~Petal.Width,
type = 'scatter', palette='solarized_magenta',
title = 'Scatter - Sepal Width vs Petal Width',
subtitle = "(source: iris)", xlab = 'Sepal Width', ylab = 'Petal Width',
markLine=t(c(1,'Mean',"average",F)))
指定series,且显示范围从零点开始(scale=FALSE
)。在第2个数据系列(versicolor)中打开最大(max)、最小(min)、均线(average)三根标线;在第1、2、3数据系列中分别标注max、min、max点,且打开第3个系列的眩光特效(markPoint=rbind(c(1,'Max','max',F),c(2,'Min','min',F), c(3,'Max','max',T))
)。
通过xAxis和yAxis设置,去掉了所有纵轴。
echartR(data = iris, x = ~Sepal.Width, y = ~Petal.Width, series = ~Species,
type = 'scatter', palette='wsj_dem_rep', symbolList='circle',
scale=F, xAxis = list(lab='Sepal Width', color='darkgray'),
yAxis = list(lab='Petal Width',color='none'),
title = 'Scatter - Sepal Width vs Petal Width, by Species',
subtitle ='(source: iris)',
markLine=rbind(c(2,'Mean','average',F),c('versicolor','Maximum','max',F),
c(2,'Minimum','min',F)),
markPoint=rbind(c(1,'Max','max',F),c(2,'Min','min',F), c(3,'Max','max',T)))
使用三套非标准图形(箭头、心形、八角星)区分数据系列(c('arrow','heart','star8')
)。命令指定随机选择excel_old色版中的1种颜色,但由于数据集有3个水平(series),颜色个数限制在执行中被忽略。
同时拟合3条回归线(markLine列表写作short form,类型指定为'lm',并开启炫光特效)。
echartR(data = iris, x = ~Sepal.Width, y = ~Petal.Width, series = ~Species,
type = 'scatter', palette='excel_old(1)',
symbolList=c('arrow','heart','star8'),
title = 'Scatter - Sepal Width vs Petal Width, by Species',
subtitle = '(source: iris)', xlab = 'Sepal Width', ylab = 'Petal Width',
markLine = rbind(c('setosa','Linear Reg Coef','lm',T),
c(2,'Linear Reg Coef','lm',T), c(3,'Linear Reg Coef','lm',T)))
大规模散点图(2000个点以上)同样可以秒级出图(但html文件会变得很大)。
x <- rnorm(2001)*2
e <- vector()
for (i in 1:2001) e <- c(e,rnorm(1,0,x[i]+abs(min(x))))
df <- data.frame(x, sin=sin(x)+e/20, cos=cos(x)+e/20)
df <- melt(df,id="x")
echartR(df,x=~x,y=~value,series=~variable,type='scatter',
palette='wsj_red_green',symbolList='circle', AxisAtZero=F,
title='Scatter of 2,001 points', subtitle = 'Large-scale scatter')
气泡图同样来源于散点图。type改为bubble
即可,如不指定weight
变量,函数默认指定y为气泡权重。
echartR(data = iris, x = ~Sepal.Width, y = ~Petal.Width, weight = ~Petal.Length,
type = 'bubble', palette='solarized_cyan',
title = paste("Bubble - Sepal Width vs Petal Width,",
"weighed by Petal Length"),
subtitle = '(source: iris)',
xlab = 'Sepal Width', ylab = 'Petal Width')
用symbolList=c('circle','emptyCircle')
设置第1个数据系列为实心圆、剩下的数据系列以空心圆作为标志图形。palette设置为tableauGnOr(3),只随机取该色板的3种颜色。
再做一个基于全集的线性回归(lm()
)。由于无法显示在任何一个系列上,因此echartR生成了一个新的数据系列。前一节多系列散点图中,三个系列Sepal Width与Petal Width都正相关,而本例中,合并起来两者却呈负相关。这个例子形象地展示了分层分析的重要性。
fit <- lm(Petal.Width~Sepal.Width,iris)
pred <- data.frame(Sepal.Width=c(min(iris$Sepal.Width),max(iris$Sepal.Width)))
yhat <- round(predict(fit,pred),2)
echartR(data = iris, x = ~Sepal.Width, y = ~Petal.Width,
weight = ~Petal.Length, series = ~Species,
symbolList=c('emptyCircle','circle'),
type = 'bubble', palette='tableaugnor(4)',
title = paste('Bubble - Sepal Width vs Petal Width, by Species,',
'weighed by Petal Length'),
subtitle = '(source: iris)', xlab = 'Sepal Width', ylab = 'Petal Width',
markLine=t(c('Reg Coef.','slope',round(fit$coefficients[[2]],2),
pred[1,1],yhat[[1]],pred[2,1],yhat[[2]],F)))
先生成一个汇总数据集dtiris
。
library(reshape2)
dfiris <- iris
dfiris$id <- row.names(iris)
dfiris <- melt(dfiris,id=c("Species","id"))
names(dfiris) <- c("Species","id","Param","Value")
dtiris <- dcast(dfiris[,c(1,3,4)],Species+Param~.,value.var="Value",mean)
names(dtiris) <- c("Species","Param","Mean")
knitr::kable(dcast(dtiris,Param~Species,sum,value.var="Mean"),
format='html',caption="Table: Mean of parameters (iris)")
Param | setosa | versicolor | virginica |
---|---|---|---|
Sepal.Length | 5.006 | 5.936 | 6.588 |
Sepal.Width | 3.428 | 2.770 | 2.974 |
Petal.Length | 1.462 | 4.260 | 5.552 |
Petal.Width | 0.246 | 1.326 | 2.026 |
echartR(data = dtiris, x = ~Param, y = ~Mean, series = ~Species,
type = 'bar', palette='fivethirtyeight',
title = "Column - Parameter Mean by Species",
subtitle = '(source: iris)',legend_pos=3,
xlab = 'Parameter', ylab = 'Mean')
echartR(data = dtiris, x = ~Param, y = ~Mean,
series = ~Species, stack=T,
type = 'bar', palette='pander',
title = "Column - Parameter Mean by Species",
subtitle = '(source: iris)',
xlab = 'Parameter', ylab = 'Mean', legend_pos=3)
堆积与否,也可以简单地通过工具栏的平铺
和堆积
按钮切换。非常强大。
条图和柱图的区别只在于xyflip
开关选项。
echartR(data = dtiris, x = ~Species, y = ~Mean, series = ~Param,
type = 'bar', palette='stata1', xyflip=T,
title = 'Species-specific Mean by Parameters',
subtitle = '(source: iris)', legend_pos=3,
xlab = 'Species', ylab = 'Mean')
palette设为calc的随机4种颜色。打开stack (stack=TRUE
)。
echartR(data = dtiris, x = ~Param, y = ~Mean,
series = ~Species, stack=T, xyflip=T,
type = 'bar', palette='calc(5)',
title = 'Parameter Mean by Species',
subtitle = '(source: iris)',
xlab = 'Parameter', ylab = 'Mean', legend_pos=3)
直方图是柱图的一种特例,只需要指定y变量。可通过splitNumber
指定直方数(默认9)。xyflip
设为TRUE则成为等价的横条图。
echartR(airquality, y=~Temp, type='histogram', splitNumber=13,
palette='pastel2', title='Histogram of temperature',
subtitle = '(source: airquality)')
用mtcars
作为作图数据集。
dtcars <- mtcars
dtcars$car <- row.names(dtcars)
dtcars$transmission <- as.factor(dtcars$am)
levels(dtcars$transmission) <- c("Automatic","Manual")
dtcars$cylinder <- as.factor(dtcars$cyl)
dtcars$carburetor <-as.factor(dtcars$carb)
echartR(dtcars, x = ~transmission, y = ~car, type='pie',
palette='darkunica',
title='Number of cars by transmission',
subtitle = '(source: mtcars)')
环形图是饼图的变形,只需将type
改为'ring'。Echarts中只需要把饼图的半径参数扩展为包含内、外径的长度为2的向量即可。
echartR(dtcars, x = ~cylinder, y = ~car, type='ring',
palette='hc', title='Number of Cylinders',
subtitle = '(source: mtcars)')
echartR(dtcars, x = ~cylinder, y = ~car, type='rose',
palette='colorblind', title='Number of Cylinders',
subtitle = '(source: mtcars)')
打开时间轴(z=~Month
),按月动画显示每日风力数据。
#airquality$Date <- strptime(paste(2015,airquality$Month,airquality$Day,sep="-"),
# format="%F", tz="Asia/Taipei")
airquality$strDate <- with(airquality,paste(2015,Month,Day,sep="-"))
airquality$TempG <- cut(airquality$Temp,breaks=c(0,60,70,80,100))
airquality <- rbind(airquality,c(rep(NA,4),6,31),c(rep(NA,4),9,31))
# Jun, Sep has no 31th, suppl with NAs
echartR(airquality, x = ~Day, y= ~Wind, z=~Month, type='line',
symbolList='none',
palette='tableauBlRd', xlab = 'Days', ylab = 'Wind',
title='Day-specific Wind by month (airquality)',
pos=list(title=12,toolbox=3))
线段平滑(type='linesmooth'
),不显示标志图形(symbolList='none'
)。此外开启双坐标轴(yAxis1
),将系列1、2(Ozone、Solar.R)显示于次y轴。Echarts对缺失值默认不填补,因此有很多断线。需要在数据前处理时自行作插值运算。打开数据缩放,设置初始显示30-70%(dataZoom=c(20,50)
)。
airq <- melt(airquality[,c("Ozone","Solar.R","Wind","Temp","strDate")],
id=c("strDate"))
#airQ <- melt(airquality[,c("Wind","Temp","Date")],
# id=c("Date"))
echartR(airq, x = ~strDate, y= ~value, series= ~variable, type='linesmooth',
symbolList='none', dataZoom=c(20,50),
palette='tableauPrGy', ylab = 'Wind, Temp',xlab="Day",
title='Climate measures by day', subtitle = '(source: airquality)',
yAxis1=list(lab="Ozone, SolorR",series=c(1,2)))
打开时间轴(z=~Month
),按月显示每日天气数据。
airq <- melt(airquality[c("Ozone","Solar.R","Wind","Temp","Month","Day")],
id=c("Month","Day"))
echartR(airq, x= ~Day, y= ~value, series= ~variable, z= ~Month, type='linesmooth',
symbolList='none', pos=list(toolbox=3),
palette='tableauPrGy', xlab = 'Day', ylab = 'Measure',
title='Climate measures by day', subtitle = '(source: airquality)')
设置xAxis和yAxis,关闭所有纵线(color='none'
),并显示横向间条(banded=T
)。
echartR(dfiris, x = ~id, y= ~Value, series= ~Param, type='line',stack=T,
palette='tableauBlRd12', xlab = 'Sample ID', ylab = 'Measure',
yAxis=list(color='none',banded=T),
title='Parameter measures', subtitle = '(source: iris)')
线段平滑,不显示标志图形
echartR(dfiris, x = ~id, y= ~Value, series= ~Param, type='linesmooth',stack=T,
palette='tableauGnOr12', xlab = 'Sample ID', ylab = 'Measure',
symbolList='none', yAxis=list(color='none'),
title='Parameter measures', subtitle = '(source: iris)')
Echarts中,面积图本质上被定义为线图,只需通过itemStyle
参数渲染颜色。
echartR(dfiris, x = ~id, y= ~Value, series= ~Param, type='area',
palette='brbg', xlab = 'Sample ID', ylab = 'Measure',
symbolList='emptyDiamond',title='Parameter measures',
yAxis=list(color='none'), subtitle = '(source: iris)')
线段平滑type='areasmooth'
,并打开数据缩放dataZoom=TRUE
,初始显示40%-80%。
echartR(dfiris, x = ~id, y= ~Value, series= ~Param, type='areasmooth',
palette='PiYG', xlab = 'Sample ID', ylab = 'Measure',
symbolList='none', dataZoom=c(40,80),yAxis=list(color='none'),
title='Parameter measures', subtitle = '(source: iris)')
echartR(dfiris, x = ~id, y= ~Value, series= ~Param, type='area',stack=T,
palette='PRGn', xlab = 'Sample ID', ylab = 'Measure',
symbolList='emptyCircle',yAxis=list(color='none'),
title='Parameter measures', subtitle = '(source: iris)')
线段平滑(type='areasmooth'
)。自定义色板向量。
echartR(dfiris, x = ~id, y= ~Value, series= ~Param, type='areasmooth',stack=T,
palette=c('red','yellow','limegreen','skyblue'),
xlab = 'Sample ID', ylab = 'Measure',
symbolList='none',yAxis=list(color='none'),
title='Parameter measures', subtitle = '(source: iris)')
echartR(dtcars, x = ~carburetor, y = ~car, type='funnel',
palette='RdBu', title='Number of carburetors of cars',
subtitle = '(source: mtcars)',pos=list(legend=12))
金字塔图即逆序漏斗图。
echartR(dtcars, x = ~carburetor, y = ~car, type='pyramid',
palette='RdGy', title='Number of carburetors of cars',
subtitle = '(source: mtcars)',pos=list(legend=12))
雷达图就是极坐标系下的线图/面积图,通过Echarts的polar
参数模块控制。
browser <- as.data.frame(
matrix(c(2001,390,208,15,9,0.5,2002,380,204,20,18,2,2003,370,200,25,27,4.5,
2004,360,196,30,36,8,2005,350,192,35,45,12.5,2006,340,188,40,54,18,
2007,330,184,45,63,24.5,2008,320,180,50,72,32,2009,310,176,55,81,40.5,
2010,300,172,60,90,50,2011,290,168,65,99,60.5,2012,280,164,70,108,72,
2013,270,160,75,117,84.5,2014,260,156,80,126,98,2015,250,152,85,135,112.5,
2016,240,148,90,144,128,2017,230,144,95,153,144.5,2018,220,140,100,162,162,
2019,210,136,105,171,180.5,2020,200,132,110,180,200,2021,190,128,115,189,220.5,
2022,180,124,120,198,242,2023,170,120,125,207,264.5,2024,160,116,130,216,288,
2025,150,112,135,225,312.5,2026,140,108,140,234,338,2027,130,104,145,243,364.5,
2028,120,100,150,252,392),byrow=T,ncol=6)
)
names(browser) <- c("Year","IE8-","IE9+","Safari","Firefox","Chrome")
browser <- melt(browser,id="Year")
echartR(browser, x= ~variable, y= ~value, series= ~Year, type='radar',
palette=paste0('heat(',nlevels(as.factor(browser$Year)),")"),
pos=list(legend=9), symbolList='none',
title='Browser Mkt Occup Ratio', subtitle= 'Totally virtual data')
player <- data.frame(name=c(rep("Philipp Lahm",8),rep("Dani Alves",8)),
para=rep(c("Passing%","Key passing","Comp crosses",
"Crossing%","Successful dribbles",
"Dispossessed","Dribbled past","Fouls"),2),
value=c(89.67, 1.51, 0.97, 24.32, 0.83, 0.86, 1.15, 0.47,
86.62, 2.11, 0.99, 20.78, 1.58, 1.64, 0.9, 1.71))
echartR(player, x= ~para, y= ~value, series= ~name, type='radarfill',
symbolList='none', palette=c('firebrick1','dodgerblue'),
title='Lahm vs Alves', subtitle= '(by @mixedknuts)')
R和Rstudio的中文转码在Windows里一直是老大难。在本文档,Rstudio的默认编码采用CP936编码,文档用UTF-8。如直接在程序内读数据集,用iconv转码后,落单的汉字仍然会显示为乱码。可先下载ChinaGDP.txt到本地,readLines
读入。
最令人orz的地方是Chrome不能正确显示地图,变成了一堆锯齿。
#gdp <- readLines("https://raw.githubusercontent.com/madlogos/Shared_Doc/master/Shared_Documents/ChinaGDP.txt")
gdp <- readLines("ChinaGDP.txt")
dtgdp <- unlist(strsplit(gdp,split=","))
dtgdp <- as.data.frame(t(matrix(dtgdp,nrow=3)),stringsAsFactors=F)
names(dtgdp) <- c('Year','Prov',"GDP")
dtgdp$GDP <- as.numeric(dtgdp$GDP)
knitr::kable(dcast(dtgdp,Prov~Year,sum,value.var="GDP"), format='html',
caption="Table: 2012-2014 GDP of Provinces in China (Million USD)")
Prov | 2012 | 2013 | 2014 |
---|---|---|---|
安徽 | 272666 | 307416 | 339401 |
北京 | 283238 | 314871 | 347249 |
福建 | 312107 | 351347 | 391609 |
甘肃 | 89508 | 101208 | 111273 |
广东 | 904046 | 1003746 | 1103605 |
广西 | 206497 | 232158 | 255144 |
贵州 | 108550 | 129284 | 150599 |
海南 | 45236 | 50805 | 56989 |
河北 | 420990 | 456976 | 478953 |
河南 | 468900 | 519212 | 568786 |
黑龙江 | 216896 | 232237 | 244829 |
湖北 | 352482 | 398316 | 445514 |
湖南 | 350958 | 395622 | 440328 |
吉林 | 189136 | 209608 | 224715 |
江苏 | 856368 | 955269 | 1059587 |
江西 | 205131 | 231520 | 255724 |
辽宁 | 393607 | 437216 | 466018 |
内蒙古 | 251574 | 271788 | 289274 |
宁夏 | 37090 | 41417 | 44802 |
青海 | 29997 | 33925 | 37460 |
山东 | 792289 | 882974 | 967419 |
山西 | 191886 | 203485 | 207714 |
陕西 | 228969 | 259078 | 287978 |
上海 | 319710 | 348804 | 383554 |
四川 | 378183 | 424026 | 464555 |
天津 | 204259 | 232031 | 255950 |
西藏 | 11105 | 13041 | 14990 |
新疆 | 118896 | 134991 | 150812 |
云南 | 163318 | 189255 | 208612 |
浙江 | 549154 | 606609 | 653668 |
重庆 | 180746 | 204364 | 232230 |
dtgdp$Prov <- as.factor(enc2native(dtgdp$Prov))
dtgdp$Year<- as.factor(dtgdp$Year)
开启数据漫游,并定义色板。subtitle_url
链接为本例数据源。通过markPoint
参数也可以在区块标注地图中标注某些点,但效果并不是很好。
top3 <- dcast(dtgdp[dtgdp$Prov %in% c("广东","江苏","山东"),c("Prov","GDP")],
Prov~., sum)
top3 <- cbind(top3,rep("Top3",3),c(32.04,23.16,36.65),c(118.78,113.23,117.00),
rep(T,3))
top3 <- top3[,c(3,1,2,4:6)]
names(top3) <- c("Series","Prov","GDP","Xcoord","Ycoord","Effect")
for (i in 1:2) top3[,i]<-as.character(top3[,i])
echartR(dtgdp, x = ~Prov, y = ~GDP, series= ~Year,
type=c('map','china','area'), palette='gdocs',
title="GDPs of China Provinces, 2012-2014 (Million USD)",
subtitle='(source: Wikipedia)',
subtitle_url="https://raw.githubusercontent.com/madlogos/Shared_Doc/master/Shared_Documents/ChinaGDP.txt",
dataRangePalette=c('red','orange','yellow','green','limegreen'),
dataRange=c('High',"Low"),pos=list(toolbox=3),markPoint=top3)
用世界GDP前20粗糙演示世界地图。漫游轴切分为10档(splitNumber=10
)。
worldgdp <- data.frame(
country=c('United States of America','China','Japan','Germany',
'United Kingdom','France','Brazil', 'Italy','India','Russia',
'Canada','Australia','South Korea','Spain','Mexico','Indonesia',
'Netherlands','Turkey','Saudi Arabia','Switzerland'),
GDP=c(17418925,10380380,4616335,3859547,2945146,2846889,2353025,2147952,
2049501,1857461,1788717,1444189,1416949,1406855,1282725,888648,866354,
806108,752459,712050))
echartR(worldgdp, x = ~country, y = ~GDP, type=c('map','world','area'),
title="Nations with top 20 GDPs, 2014 (Million USD)",
subtitle = '(source: Wikipedia)',
subtitle_url="https://en.wikipedia.org/wiki/List_of_countries_by_GDP_(nominal)",
dataRangePalette='rainbow(5)', dataRange=c("High","Low"),
splitNumber=10, pos=list(toolbox=3))
将China PM2.5数据集下载到本地,readLines
读入。
#chinapm2 <- readLines('https://raw.githubusercontent.com/madlogos/Shared_Doc/master/Shared_Documents/China%20PM2.5.txt')
chinapm25 <- enc2native(unlist(strsplit(paste0(readLines("China PM2.5.txt"), collapse=","),",")))
chinapm25 <- as.data.frame(matrix(chinapm25,byrow=T,ncol=4),stringsAsFactors=F)
names(chinapm25) <- c("City","PM25","ycoord","xcoord")
for (i in 2:4) chinapm25[,i] <- as.numeric(chinapm25[,i])
top5 <- head(chinapm25[order(chinapm25$PM25,decreasing=T),],5)
top5$Name <- "Top 5"
top5$effect <- T
top5 <- top5[,c(5,1,2,4,3,6)]
echartR(chinapm25, x=~City, y=~PM25, xcoord=~xcoord, ycoord=~ycoord,
type=c('map','china','point'),title='PM2.5 in Chinese cities',
subtitle="(source: PM25.in)",subtitle_url="http://pm25.in/",
dataRange=c("High","Low"), pos=list(toolbox=3),
dataRangePalette=c('maroon','red','orange','yellow','lightgreen','green'),
markPoint=top5)
线标注地图比较特殊,数据集要包括y(数量)、x(起点地名)、x1(终点地名)、series(数据系列)、xcoord(起点纬度)、ycoord(起点经度)、xcoord1(终点纬度)、ycoord1(终点经度)。额外标注的话,markLine
数据集格式变为下表(第2列格式为'起点'/'终点'):
series name/index * | Path | Value | P0 x | P0 y | P1 x | P1 y | Light effect |
---|---|---|---|---|---|---|---|
String or number | From/To | num | x val | y val | x val | y val | TRUE / FALSE |
下载南航航班.txt到本地。readLines
读入1-2行。
作点标注的时候,按markPoint
语法将数据集处理成6列:series, name, value, xcoord, ycoord, effect。
flight <- readLines("CZflight.txt")[1]
flight <- as.data.frame(matrix(unlist(strsplit(flight,",")),byrow=T,ncol=2),
stringsAsFactors=F)
names(flight) <- c("From","To")
flightCoord <- readLines("CZflight.txt")[2]
flightCoord <- as.data.frame(matrix(unlist(strsplit(flightCoord,",")),byrow=T,ncol=3),
stringsAsFactors=F)
for (i in 2:3) flightCoord[,i] <- as.numeric(flightCoord[,i])
names(flightCoord) <- c("City","Ycoord","Xcoord")
flight <- merge(flight,flightCoord,by.x="From",by.y="City",all.x=T)
flight <- merge(flight,flightCoord,by.x="To",by.y="City",all.x=T)
Tier1 <- dcast(flight,From~.,length)
Tier1 <- merge(Tier1,unique(flight[,c("From","Xcoord.x","Ycoord.x")]),by="From",all.x=T)
Tier1$effect <- T
Tier1 <- Tier1[,c(1,1:5)]
flight$y <- NA
echartR(flight, x=~From, x1=~To, y=~y, series=~From, xcoord=~Xcoord.x, ycoord=~Ycoord.x,
xcoord1=~Xcoord.y, ycoord1=~Ycoord.y, type=c('map','china','line'),
pos=list(toolbox=3), title="南方航空公司主要航班线路",markPoint=Tier1)
上个例子只作标线而未注数量。也可以对这些标线的数值进行渲染。仍以CZflight.txt为例,读取3-5行(我偷懒了,存在同一个文件里)。本例通过theme
设置背景为灰黑色(#1b1b1b),边界线为暗蓝色royalblue4。legend中设置单选模式,初始选中数据系列'北京'。数据集处理的具体策略是x - ycoord1变量放背景数据集,包含所有迁徙关系;markLine放三个突出系列的数量,markPoint则放终点的数据。最后的效果同时包括了背景标线、突出标线和点标。全部打开炫光特效。
migrate <- as.data.frame(matrix(unlist(strsplit(readLines("CZflight.txt")[3],",")),
byrow=T,ncol=2),stringsAsFactors=F)
names(migrate) <- c("From","To")
migrateCoord <- as.data.frame(matrix(unlist(strsplit(readLines("CZflight.txt")[4],",")),
byrow=T,ncol=3),stringsAsFactors=F)
for (i in 2:3) migrateCoord[,i] <- as.numeric(migrateCoord[,i])
names(migrateCoord) <- c("City","Ycoord","Xcoord")
migrate <- merge(migrate,migrateCoord,by.x="From",by.y="City",all.x=T)
migrate <- merge(migrate,migrateCoord,by.x="To",by.y="City",all.x=T)
migrate$series <- "全国"
# markLine source data
migrateEm <- as.data.frame(matrix(unlist(strsplit(readLines("CZflight.txt")[5],",")),
byrow=T,ncol=3),stringsAsFactors=F)
migrateEm[,3] <- as.numeric(migrateEm[,3])
names(migrateEm) <- c("From","To","NFlights")
#migrate <- merge(migrate,migrateEm,by=c("From","To"),all.x=T)
#migrate$Val[is.na(migrate$Val)] <- "-"
migrate$NFlights <- NA
migrateEm <- merge(migrateEm, migrateCoord, by.x="From",by.y="City",all.x=T)
migrateEm <- merge(migrateEm, migrateCoord, by.x="To",by.y="City",all.x=T)
# markLine dataset (8 col)
markline <- migrateEm[,c(2,1,3,5,4,7,6)]
markline$To <- paste(markline$From,markline$To,sep="/")
markline$effect <- T
# markPoint dataset (6 col)
markpoint <- migrateEm[,c(2,1,3,7,6)]
markpoint$effect <- T
# plot
echartR(migrate, x=~From, x1=~To, y=~NFlights, series=~series, xcoord=~Xcoord.x,
ycoord=~Ycoord.x, xcoord1=~Xcoord.y, ycoord1=~Ycoord.y,
type=c('map','china','line'), palette=c("Gray","Orange","Green","Purple"),
pos=list(toolbox=3), title="百度迁徙(模拟数据)", dataRange=c("High","Low"),
dataRangePalette=c("red","orange",'gold','green1','aquamarine2','royalblue4'),
legend=list(mode='single',select=c('北京')),
markLinesmooth=markline, markPoint=markpoint,
theme=list(backgroundColor="#1b1b1b", borderColor="royalblue4"))
直接knitr时readLines读取网页会报错,此例将百度热词榜网页 的源代码保存为本地txt后再读取。
#baiduhot <- paste0(readLines("http://top.baidu.com/buzz?b=1"),collapse="")
baiduhot <- paste0(readLines("Baidu Hot Words.txt"),collapse="")
hotword <- gsub(".+?<a class=\"list-title\"[^>]+?>([^<>]+?)</a>.+?<span class=\"icon-(rise|fair|fall)\">(\\d+?)</span>.+?","\\1\t\\3\t",baiduhot)
hotword <- enc2native(gsub("^(.+?)\t{4,}.+$","\\1",hotword))
hotword <- t(matrix(unlist(strsplit(hotword,"\t")),nrow=2))
hotword <- as.data.frame(hotword,stringsAsFactors=F)
names(hotword) <- c("Keyword","Freq")
hotword$Freq <- as.numeric(hotword$Freq)
hotword <- hotword[order(hotword$Freq,decreasing=T),]
knitr::kable(hotword, format='html', row.names=F,
caption="Table: Baidu hot words Aug 18 (descending order by Freq)")
Keyword | Freq |
---|---|
霆锋让杨幂骑脖子 | 545761 |
文章澄清牵手门 | 329896 |
幼童在车里热死 | 194836 |
葛天反击刘翔粉丝 | 187192 |
老汉被指强奸入狱 | 185177 |
男子惨遭挤压身亡 | 178935 |
孕妇离奇死亡 | 106216 |
海参阅兵照爆红 | 103025 |
大妈为狗刷卡占座 | 96443 |
女子家门口被性侵 | 85931 |
拒加领导微信被炒 | 83213 |
曝王思聪进娱乐圈 | 76823 |
女客梦中惊醒 | 54671 |
男子大脑消失过半 | 43599 |
中国国际小姐大赛 | 39775 |
全国宜居城市排行 | 33747 |
抗战纪念币发行 | 33563 |
天津爆炸头七 | 28963 |
9月3日双倍工资 | 27298 |
奔驰女拒交停车费 | 22560 |
65岁女子产4胞胎 | 21674 |
英国疯抢状元笔记 | 17742 |
王思聪恋情坐实 | 16641 |
杨栋梁被查 | 16311 |
扎破手指险送命 | 15738 |
曼谷市中心爆炸 | 15444 |
男童跌入开水盆 | 14827 |
城管碰瓷执法 | 13619 |
明星真人秀片酬 | 12979 |
2万卖初夜反遭抢 | 12249 |
扮蝙蝠侠行善身亡 | 11928 |
天津回应毒气传闻 | 11305 |
金星喷范冰冰粉丝 | 9470 |
网曝何猷君将大婚 | 9115 |
袁咏仪儿子上学 | 7282 |
男子当街注射毒品 | 6530 |
世界大学学术排名 | 6488 |
郑州小伙出租云朵 | 6454 |
熊跃辉被调查 | 6232 |
安倍或访华 | 6123 |
天津下雨出现泡沫 | 5884 |
三星上海发布会 | 5137 |
张艺兴已被涂黑 | 4990 |
谎称自杀被救索赔 | 4301 |
唐嫣被指炒作恋情 | 3722 |
幼师用脚猛夹男童 | 3683 |
网曝柏青孙子失联 | 3538 |
女子逼小学生让座 | 3245 |
全球华人富豪榜 | 3010 |
惠若琪错过世界杯 | 2981 |
echartR(hotword[1:30,], x=~Keyword, y=~Freq, type="wordcloud",
title="Baidu Word Search Top 30", palette=NULL,
title_url="http://top.baidu.com/buzz?b=1",
subtitle="Tuesday, Auguest 18, 2015")
全部借用Echarts的例图数据,略有改动。
和弦图需要用到x,x1和y变量。其中x和x1是连接关系,y为权重。
本例用xAxis=list(rotate=90)
将标签转向。
deutsch <- data.frame(player=c('Kruse','Kramer','Neuer','Boateng','Lahm','Kroos',
'Muller','Gotze','Badstuber','Hummels','Weidenfeller',
'Reus','Gundogan'),
hire=c(rep('Monchengladbach',2),rep('Bayern',7),
rep('Dortmund',4)),
weight=rep(1,13),
role=c('Fw','Mf','Gk','Df','Df','Mf','Mf','Fw','Df','Df','Gk','Df','Md'),
stringsAsFactors=F)
echartR(deutsch,x=~player, y=~weight, x1=~hire, type='chord', xAxis=list(rotate=90),
title='Deutsch Soccer Team - Clubs',pos=list(legend=10,title=5))
echartR(deutsch,x=~player, y=~weight, x1=~hire, type='chordribbon',
title='Deutsch Soccer Team - Clubs',pos=list(legend=10,title=5))
也可以对矩阵作图。矩阵数据$x_{i,j}$表示i到j的关系数据。
grpmtx <- matrix(c(11975, 5871, 8916, 2868, 1951, 10048, 2060, 6171, 8010, 16145,
8090, 8045, 1013, 990, 940, 6907),byrow=T,nrow=4)
dimnames(grpmtx) <- list(LETTERS[1:4],LETTERS[1:4])
knitr::kable(as.data.frame(grpmtx),format='html',
caption="Table: Group A-D Mutual Links")
A | B | C | D | |
---|---|---|---|---|
A | 11975 | 5871 | 8916 | 2868 |
B | 1951 | 10048 | 2060 | 6171 |
C | 8010 | 16145 | 8090 | 8045 |
D | 1013 | 990 | 940 | 6907 |
grpmtx <- melt(grpmtx)
echartR(grpmtx,x=~Var1, y=~value, x1=~Var2, type='chordribbon',
title='Group A-D mutual links',pos=list(legend=10,title=5))
下载中东格局数据集到本地,read.csv
读入。
多系列和弦图必须用于两两之间关系。德国国家足球队的例子不适用。
mideast <- read.csv("MidEast.csv",header=T,stringsAsFactors=F)
names(mideast[,2:16]) <- mideast$X
mideast <- melt(mideast,id="X")
mideast$attd <- gsub("(.+)/\\d+","\\1",mideast$value)
mideast$attd[mideast$attd==''] <- NA
mideast$attd <- factor(mideast$attd,levels=unique(mideast$attd))
mideast$wt <- gsub(".+/(\\d+)","\\1",mideast$value)
mideast$wt <- as.numeric(mideast$wt)
echartR(mideast,x=~X, y=~wt, x1=~variable, series=~attd, type='chordribbon',
title='Relationship in Mid-east', subtitle='(source: Caixin)',
palette=c('#FBB367','#80B1D2','#FB8070','#CC99FF','#B0D961','#99CCCC','#BEBBD8',
'#FFCC99','#8DD3C8','#FF9999','#CCEAC4','#BB81BC','#FBCCEC','#CCFF66',
'#99CC66','#66CC66','#FF6666','#FFED6F','#ff7f50','#87cefa'),
pos=list(legend=10,title=5,toolbox=2),
subtitle_url='http://international.caixin.com/2013-09-06/100579154.html')
下载Network.txt到本地,readLines
读入。
将x整理成关系起点/关系终点/关系名称
格式,x1整理成关系起点赋值/关系终点赋值
格式,series整理成起点节点类型/终点节点类型
格式。
netNode <- as.data.frame(matrix(unlist(strsplit(readLines('Network.txt')[1],",")),
byrow=T,ncol=3),stringsAsFactors=F)
names(netNode) <- c("name","category","value")
netLink <- as.data.frame(matrix(unlist(strsplit(readLines('Network.txt')[2],",")),
byrow=T,ncol=4),stringsAsFactors=F)
names(netLink) <- c("from","to","relation","weight")
netLink$weight <- as.numeric(netLink$weight)
netLink <- merge(netLink,netNode,by.x="from",by.y="name",all.x=T)
netLink <- merge(netLink,netNode,by.x="to",by.y="name",all.x=T)
rm(netNode)
netLink$category.x <- factor(netLink$category.x,
levels=c("Root","Node 1","Node 2","Node 3","Leaf"))
# Order the categories
netLink <- netLink[order(netLink$category.x),]
netLink$Link <- with(netLink,paste(from,to,relation,sep="/"))
netLink$NodeVal <- with(netLink,paste(value.x,value.y,sep="/"))
netLink$Series <- with(netLink,paste(category.x,category.y,sep="/"))
按此格式读入数据集作图。
echartR(netLink,x=~Link,y=~weight,x1=~NodeVal,series=~Series,type='force',
title='绍兴俞氏社会网络',pos=list(title=5,legend=10),
palette=c('brown','green4','green3','lawngreen','olivedrab1'))
K线图必须将日期整理在x,开盘、收盘、最低、最高标签整理在x1(且按该顺序排序),价格整理在y。
stockidx <- data.frame(
date=c('1/24','1/25','1/28','1/29','1/30','1/31','2/1','2/4','2/5','2/6','2/7',
'2/8','2/18','2/19','2/20','2/21','2/22','2/25','2/26','2/27','2/28','3/1',
'3/4','3/5','3/6','3/7','3/8','3/11','3/12','3/13','3/14','3/15','3/18',
'3/19','3/20','3/21','3/22','3/25','3/26','3/27','3/28','3/29','4/1','4/2',
'4/3','4/8','4/9','4/10','4/11','4/12','4/15','4/16','4/17','4/18','4/19',
'4/22','4/23','4/24','4/25','4/26','5/2','5/3','5/6','5/7','5/8','5/9',
'5/10','5/13','5/14','5/15','5/16','5/17','5/20','5/21','5/22','5/23',
'5/24','5/27','5/28','5/29','5/30','5/31','6/3','6/4','6/5','6/6','6/7',
'6/13'),
open=c(2320.26,2300,2295.35,2347.22,2360.75,2383.43,2377.41,2425.92,2411,2432.68,
2430.69,2416.62,2441.91,2420.26,2383.49,2378.82,2322.94,2320.62,2313.74,
2297.77,2322.32,2364.54,2332.08,2274.81,2333.61,2340.44,2326.42,2314.68,
2309.16,2282.17,2255.77,2269.31,2267.29,2244.26,2257.74,2318.21,2321.4,
2334.74,2318.58,2299.38,2273.55,2238.49,2229.46,2234.9,2232.69,2196.24,
2215.47,2224.93,2236.98,2218.09,2199.91,2169.63,2195.03,2181.82,2201.12,
2236.4,2242.62,2187.35,2213.19,2203.89,2170.78,2179.05,2212.5,2227.86,
2242.39,2246.96,2228.82,2247.68,2238.9,2217.09,2221.34,2249.81,2286.33,
2297.11,2303.75,2293.81,2281.45,2286.66,2293.4,2323.54,2316.25,2320.74,
2300.21,2297.1,2270.71,2264.43,2242.26,2190.1),
close=c(2302.6,2291.3,2346.5,2358.98,2382.48,2385.42,2419.02,2428.15,2433.13,
2434.48,2418.53,2432.4,2421.56,2382.91,2397.18,2325.95,2314.16,2325.82,
2293.34,2313.22,2365.59,2359.51,2273.4,2326.31,2347.18,2324.29,2318.61,
2310.59,2286.6,2263.97,2270.28,2278.4,2240.02,2257.43,2317.37,2324.24,
2328.28,2326.72,2297.67,2301.26,2236.3,2236.62,2234.4,2227.74,2225.29,
2211.59,2225.77,2226.13,2219.55,2206.78,2181.94,2194.85,2193.8,2197.6,
2244.64,2242.17,2184.54,2218.32,2199.31,2177.91,2174.12,2205.5,2231.17,
2235.57,2246.3,2232.97,2246.83,2241.92,2217.01,2224.8,2251.81,2282.87,
2299.99,2305.11,2302.4,2275.67,2288.53,2293.08,2321.32,2324.02,2317.75,
2300.59,2299.25,2272.42,2270.93,2242.11,2210.9,2148.35),
low=c(2287.3,2288.26,2295.35,2337.35,2347.89,2371.23,2369.57,2417.58,2403.3,2427.7,
2394.22,2414.4,2415.43,2373.53,2370.61,2309.17,2308.76,2315.01,2289.89,
2292.03,2308.92,2330.86,2259.25,2270.1,2321.6,2304.27,2314.59,2296.58,
2264.83,2253.25,2253.31,2250,2239.21,2232.02,2257.42,2311.6,2314.97,2319.91,
2281.12,2289,2232.91,2228.81,2227.31,2220.44,2217.25,2180.67,2215.47,2212.56,
2217.26,2204.44,2177.39,2165.78,2178.47,2175.44,2200.58,2232.26,2182.81,
2184.11,2191.85,2173.86,2161.14,2179.05,2212.5,2219.44,2235.42,2221.38,
2225.81,2231.36,2205.87,2213.58,2210.77,2248.41,2281.9,2290.12,2292.43,
2274.1,2270.25,2283.94,2281.47,2321.17,2310.49,2299.37,2294.11,2264.76,
2260.87,2240.07,2205.07,2126.22),
high=c(2362.94,2308.38,2346.92,2363.8,2383.76,2391.82,2421.15,2440.38,2437.42,
2441.73,2433.89,2443.03,2444.8,2427.07,2397.94,2378.82,2330.88,2338.78,
2340.71,2324.63,2366.16,2369.65,2333.54,2328.14,2351.44,2352.02,2333.67,
2320.96,2333.29,2286.33,2276.22,2312.08,2276.05,2261.31,2317.86,2330.81,
2332,2344.89,2319.99,2323.48,2273.55,2246.87,2243.95,2253.42,2241.34,
2212.59,2234.73,2233.04,2242.48,2226.26,2204.99,2196.43,2197.51,2206.03,
2250.11,2245.12,2242.62,2226.12,2224.63,2210.58,2179.65,2222.81,2236.07,
2240.26,2255.21,2247.86,2247.67,2250.85,2239.93,2225.19,2252.87,2288.09,
2309.39,2305.3,2314.18,2304.95,2292.59,2301.7,2322.1,2334.33,2325.72,
2325.53,2313.43,2297.1,2276.86,2266.69,2250.63,2190.1))
stockidx <- melt(stockidx,id="date")
stockidx <- stockidx[order(stockidx$variable),]
echartR(stockidx,x=~date,x1=~variable,y=~value,type='k',title='2013年上半年上证指数',
dataZoom=c(0,50))
仪表盘的数据集比较简单,标签列在x,数值在y,单位在x1。如果要自定义表盘颜色,可以在x列标记'axisStyle',则x1列为颜色(Hex值或颜色名),y为切断点。此外可单独通过splitNumber
制定刻度数量。
gauge <- data.frame(x=c("完成率",rep("axisStyle",3)),
unit=c("%","forestgreen","orange","red2"),KPI=c(74,0.5,0.8,1))
echartR(gauge,x=~x,y=~KPI,x1=~unit,type='gauge')
echartR
比较接近于lattice
,而不是ggplot2
图层叠加的思路。每次使用都要一次性编辑所有需要的参数;- 函数本身写得比较笨重,技术还不行;
- 当没有数据系列的时候,如显示图例,会被拆成一串单字节字符;
- 未实现的功能:
- candlestick(k)还没有开发;Tree不打算开发了;
- 仍然不支持时间格式的坐标轴(series中数据结构有问题);
- tooltip不够智能;
- 进阶功能(包括多图联动等)仍未开发;
- 不支持多类型混搭;
- 注意:如要改进,千万不要在函数代码中
set.seed()
,这会全局锁定种子数,导致knitr时每做一图都按该种子随机化htmlwidget id
。最终的文档中,某些图可能会无法按指定代码出图,而是重复其他的图(串id)。