Plot frequencies of a variable as bar graph, histogram, box plot etc.

sjp.frq(var.cnt, title = "", weight.by = NULL, title.wtd.suffix = NULL,
sort.frq = c("none", "asc", "desc"), type = c("bar", "dot", "histogram",
"line", "density", "boxplot", "violin"), geom.size = NULL,
geom.colors = "#336699", errorbar.color = "darkred", axis.title = NULL,
axis.labels = NULL, xlim = NULL, ylim = NULL, wrap.title = 50,
wrap.labels = 20, grid.breaks = NULL, expand.grid = FALSE,
show.values = TRUE, show.n = TRUE, show.prc = TRUE,
show.axis.values = TRUE, show.ci = FALSE, show.na = FALSE,
show.mean = FALSE, show.mean.val = TRUE, show.sd = TRUE,
mean.line.type = 2, mean.line.size = 0.5, inner.box.width = 0.15,
inner.box.dotsize = 3, normal.curve = FALSE, normal.curve.color = "red",
normal.curve.size = 0.8, normal.curve.alpha = 0.4, auto.group = NULL,
coord.flip = FALSE, vjust = "bottom", hjust = "center",
y.offset = NULL, prnt.plot = TRUE)

var.cnt Vector of counts, for which frequencies or means will be plotted or printed. character vector, used as plot title. Depending on plot type and function, will be set automatically. If title = "", no title is printed. For effect-plots, may also be a character vector of length > 1, to define titles for each sub-plot or facet. Vector of weights that will be applied to weight all cases. Must be a vector of same length as the input vector. Default is NULL, so no weights are used. Suffix (as string) for the title, if weight.by is specified, e.g. title.wtd.suffix=" (weighted)". Default is NULL, so title will not have a suffix when cases are weighted. Determines whether categories should be sorted according to their frequencies or not. Default is "none", so categories are not sorted by frequency. Use "asc" or "desc" for sorting categories ascending or descending order. Specifies the plot type. May be abbreviated. "bar"for simple bars (default) "dot"for a dot plot "histogram"for a histogram (does not apply to grouped frequencies) "line"for a line-styled histogram with filled area "density"for a density plot (does not apply to grouped frequencies) "boxplot"for box plot "violin"for violin plots size resp. width of the geoms (bar width, line thickness or point size, depending on plot type and function). Note that bar and bin widths mostly need smaller values than dot sizes. User defined color for geoms, e.g. geom.colors = "#0080ff". Color of confidence interval bars (error bars). Only applies to type = "bar". In case of dot plots, error bars will have same colors as dots (see geom.colors). Character vector of length one or two (depending on the plot function and type), used as title(s) for the x and y axis. If not specified, a default labelling is chosen. To set multiple axis titles (e.g. with type = "eff" for many predictors), axis.title must be a character vector of same length of plots that are printed. In this case, each plot gets an own axis title (applying, for instance, to the y-axis for type = "eff"). Note: Some plot types do not support this argument. In such cases, use the return value and add axis titles manually with labs, e.g.: plot.list[[1]] + labs(x = ...) character vector with labels used as axis labels. Optional argument, since in most cases, axis labels are set automatically. Numeric vector of length two, defining lower and upper axis limits of the x scale. By default, this argument is set to NULL, i.e. the x-axis fits to the required range of the data. numeric vector of length two, defining lower and upper axis limits of the y scale. By default, this argument is set to NULL, i.e. the y-axis fits to the required range of the data. numeric, determines how many chars of the plot title are displayed in one line and when a line break is inserted. numeric, determines how many chars of the value, variable or axis labels are displayed in one line and when a line break is inserted. numeric; sets the distance between breaks for the axis, i.e. at every grid.breaks'th position a major grid is being printed. logical, if TRUE, the plot grid is expanded, i.e. there is a small margin between axes and plotting region. Default is FALSE. Logical, whether values should be plotted or not. logical, if TRUE, adds total number of cases for each group or category to the labels. logical, if TRUE (default), percentage values are plotted to each bar If FALSE, percentage values are removed. logical, whether category, count or percentage values for the axis should be printed or not. Logical, if TRUE), adds notches to the box plot, which are used to compare groups; if the notches of two boxes do not overlap, medians are considered to be significantly different. logical, if TRUE, NA's (missing values) are added to the output. Logical, if TRUE, a vertical line in histograms is drawn to indicate the mean value of the variables. Only applies to histogram-charts. Logical, if TRUE (default), the mean value is printed to the vertical line that indicates the variable's mean. Only applies to histogram-charts. Logical, if TRUE, the standard deviation is annotated as shaded rectangle around the mean intercept line. Only applies to histogram-charts. Numeric value, indicating the linetype of the mean intercept line. Only applies to histogram-charts and when show.mean = TRUE. Numeric, size of the mean intercept line. Only applies to histogram-charts and when show.mean = TRUE. width of the inner box plot that is plotted inside of violin plots. Only applies if type = "violin". Default value is 0.15 size of mean dot insie a violin or box plot. Applies only when type = "violin" or "boxplot". Logical, if TRUE, a normal curve, which is adjusted to the data, is plotted over the histogram or density plot. Default is FALSE. Only applies when histograms or density plots are plotted (see type). Color of the normal curve line. Only applies if normal.curve = TRUE. Numeric, size of the normal curve line. Only applies if normal.curve = TRUE. Transparancy level (alpha value) of the normal curve. Only applies if normal.curve = TRUE. numeric value, indicating the minimum amount of unique values in the count variable, at which automatic grouping into smaller units is done (see group_var). Default value for auto.group is NULL, i.e. auto-grouping is off. See group_var for examples on grouping. logical, if TRUE, the x and y axis are swapped. character vector, indicating the vertical position of value labels. Allowed are same values as for vjust aesthetics from ggplot2: "left", "center", "right", "bottom", "middle", "top" and new options like "inward" and "outward", which align text towards and away from the center of the plot respectively. character vector, indicating the horizontal position of value labels. Allowed are same values as for vjust aesthetics from ggplot2: "left", "center", "right", "bottom", "middle", "top" and new options like "inward" and "outward", which align text towards and away from the center of the plot respectively. numeric, offset for text labels when their alignment is adjusted to the top/bottom of the geom (see hjust and vjust). logical, if TRUE (default), plots the results as graph. Use FALSE if you don't want to plot any graphs. In either case, the ggplot-object will be returned as value. ## Value (Insisibily) returns the ggplot-object with the complete plot (plot) as well as the data frame that was used for setting up the ggplot-object (data). ## Note This function only works with variables with integer values (or numeric factor levels), i.e. scales / centred variables with decimals may result in unexpected behaviour. ## See also ## Examples library(sjlabelled) data(efc) # boxplot sjp.frq(efce17age, type = "box")
# histogram
sjp.frq(efc$e17age, type = "hist", show.mean = TRUE) # violin plot sjp.frq(efc$e17age, type = "v")
# bar plot
sjp.frq(efc$e42dep) library(sjmisc) # grouped variable ageGrp <- group_var(efc$e17age)
ageGrpLab <- group_labels(efc$e17age) sjp.frq(ageGrp, title = get_label(efc$e17age), axis.labels = ageGrpLab)
# plotting confidence intervals. expand grid and v/hjust for text labels
sjp.frq(
efc$e15relat, type = "dot", show.ci = TRUE, sort.frq = "desc", coord.flip = TRUE, expand.grid = TRUE, vjust = "bottom", hjust = "left" ) # Simulate ggplot-default histogram sjp.frq(efc$c160age, type = "h", geom.size = 3)
# histogram with overlayed normal curve
sjp.frq(efc\$c160age, type = "h", show.mean = TRUE, show.mean.val = TRUE,
normal.curve = TRUE, show.sd = TRUE, normal.curve.color = "blue",
normal.curve.size = 3, ylim = c(0,50))#> Warning: Removed 4 rows containing missing values (geom_bar).