-
read_bigwig()
now uses cpp11bigwig on CRAN. Theset_strand
param was removed to be more consistent with expected bigWig contents. -
read_gtf()
was deprecated. The rtracklayer package used for this functionality is no longer a dependency of valr due to errors from CRAN AddressSantizer checks of the UCSC c-library code vendored in rtracklayer. -
valr now depends on R >= 4.0.0.
-
Address NOTE on CRAN about Rd link targets.
-
Change maintainer email address.
- Make vdiffr dependency optional during package testing.
- Added
bed_genomecov()
to compute interval coverage across a genome.
read_bed
and related functions now automatically calculate the fields. Use ofn_fields
was deprecated.
-
bed_closest()
now reports all x intervals, even when there are no closest y intervals (e.g. when there is no matching chromosome in y intervals). These intervals are returned populated withNA
for.overlap
,.dist
and y interval locations. -
Reimplemented
bed_closest()
to use binary search rather than an interval tree search. The closest y interval can be missed with the previous search strategy in high depth interval trees. -
Fix off by one error when using
max_dist
argument inbed_cluster()
(#401).
-
Removed
SystemRequirements
from DESCRIPTION to eliminate a NOTE on CRAN. -
bed_coverage()
now reports intervals fromx
with no matching group iny
(#395).
-
Updated intervalTree header to commit f0c4046
-
valr now uses cli for more consistent errors and messages during interactive use.
-
deprecated
genome
argument tobed_makewindows()
was removed.
- Handle
max_dist
for first intervals inbed_cluster()
(#388)
-
Fixed intron score numbering error in
create_introns
(#377 @sheridar) -
Fixed bug in handling of list inputs for
bed_intersect()
(#380 @sheridar) -
Added
read_bigwig
andread_gtf
functions to import data into valr compatible tibbles (#379) -
Kent Riemondy is now maintainer.
- Update to prepare for readr 2.0.0
-
RMariaDB
has replaced the deprecatedRMySQL
package as the database backend. -
valr now imports Rcpp, which should have always been the case, but was masked by its Import by readr, which recently dropped use of Rcpp.
- Fixed rchk unprotect error (#365)
-
trbl_interval()
andtrbl_genome()
customtibble
subclasses have been deemed unnecessary and have been removed from the package. -
coercing
GRanges
to avalr
compatible data.frame now uses thegr_to_bed()
function rather thanas.trbl_interal()
methods.
-
dplyr version < 0.8.0 is no longer supported due to unnecessary code bloat and challenges with handling multiple grouping structures (#359).
-
The
sort_by
argument ofbed_random()
has been changed tosorted
, and will now by default usebed_sort()
to sort the output, rather than rely on naming the sorting columns. Sorting can be suppressed by usingsorted = FALSE
. -
bed_sort()
now uses base R sorting with theradix
method for increased speed. (#353) -
tbls
processed bybed_merge()
orbed_sort()
no longer store eithermerged
orsorted
as attributes, due to these attributes being rarely checked in the codebase and potential sources of unexpected behavior.
-
Fixed
bed_closest()
to prevent erroneous intervals being reported when adjacent closest intervals are present in they
table. (#348) -
Factor columns that are not used for grouping are returned as factors rather than inappropriately being coerced to integer vectors (#360)
- Internal
Rcpp
functions have been reorganized to remove all dependencies ondplyr
C++ functions.
-
Due to internal refactoring of Rcpp functions, only data.frames containing Numeric, Logical, Integer, Character, and List column types are supported. Columns containing Raw, Complex, or other R classes are not supported and will issue an error.
-
Factors are now disallowed from grouping variables in multiset operations to avoid sort order discrepancies, and compatibility with factor handling in
dplyr
v.0.8.0. Factors will now be internally type-converted to character and a warning is issued.
- Changed the behavior of
as.tbl_interval()
to callas_tibble()
only on non-tibble input, which prevents groups from being stripped fromtibble()
input (#338).
-
Added new function,
bed_partition()
, which is similar tobed_merge()
but collapses intervals to elemental intervals rather than the maximal overlapping region.bed_partition()
also can compute summaries of data from overlapping intervals. See examples inbed_partition()
and timings invignette('benchmarks')
@kriemo. -
Several explicit comparisons to the Bioconductor GenomicRanges library are included for users considering using valr. See examples in
as.tbl_interval()
and timings invignette('benchmarks')
.
-
All relevant tests from bedtool2 were ported into valr. Bugs identified in corner cases by new tests were fixed (#328 @raysinesis)
-
bed_jaccard()
now works with grouped inputs (#216) -
Update dplyr header files to v0.7
-
bed_intersect()
and internalintersect_impl
were refactored to enable return of non-intersecting intervals. -
The genome argument to
bed_makewindows()
was deprecated and will produce a warning if used. Also error handling was added to check and warn if there are intervals smaller than the requested window size inmakewindows_impl()
(#312 @kriemo)
-
Fixed off by one error in reported distances from
bed_closest()
. Distances reported now are the same asbedtools closest
behavior (#311). -
bed_glyph()
acceptstrbl_intervals
named other thanx
andy
(#318). -
bed_makewindows()
now returns the number of windows specified bynum_win
when the input intervals are not evenly divisble intonum_win
, consistent withbedtools
behavior. -
The output of
findOverlaps()
is now sorted insubtract_impl()
to prevent reporting intervals that should have been dropped when callingbed_subtract()
(#316 @kriemo)
-
A manuscript describing valr has been published in F1000Research.
-
New S3 generic
as.tbl_interval()
convertsGenomicRanges::GRanges
objects totbl_interval
. -
New
create_tss()
for creating transcription start sites. -
Improve documentation of interval statistics with more complex examples.
bed_sort()
has been de-deprecated to reducearrange
calls in library code.
bed_merge()
now reports start/end columns if spec is provided (#288)
-
New
create_introns()
,create_utrs5()
andcreate_utrs3()
functions for generating features from BED12 files. -
Speed-ups in
bed_makewindows()
(~50x),bed_merge()
(~4x), andbed_flank()
(~4x) (thanks to @kriemo and @sheridar). Thanks to the sponsors of the Biofrontiers Hackathon for the caffeine underlying these improvements.
- intervals from
bed_random()
are now sorted properly.
-
Package dplyr v0.5.0 headers with valr to remove dplyr LinkingTo dependency.
-
bed_intersect()
now accepts multiple tbls for intersection (#220 @kriemo). -
new
tbl_interval()
andtbl_genome()
that wrap tibbles and enforce strict column naming.trbl_interval()
andtrbl_genome()
are constructors that taketibble::tribble()
formatting andis.tbl_interval()
andis.tbl_genome()
are used to check for valid classes.
- intervals returned from
bed_random()
are sorted bychrom
andstart
by default.
- Merge intervals in
bed_jaccard()
and use numeric values for calculation (fixes #204).
- Deprecate
bed_sort()
in favor of usingdplyr::arrange()
explicitly (fixes #134).
-
add
src/init.c
that callsR_registerRoutines
andR_useDynamicSymbols
to address NOTE in r-devel -
Deprecate
dist
parameter inbed_closest()
in favor of using user supplied functions (#182 @kriemo) -
Make
.id
values sequential across chroms inbed_cluster()
output (#171) -
Transfer repository to http://github.com/rnabioco/valr, update links and docs.
-
Move shiny app to new repo (http://github.com/rnabioco/valrdata).
-
Add Kent Riemondy to LICENSE file.
bed_merge()
now merges contained intervals (#177)
-
test / vignette guards for Suggested RMySQL
-
fixed memory leak in absdist.cpp
-
fixed vignette entry names
- initial release on CRAN