Skip to contents

Separates locus tags mentioned in full text and expands ranges like YPO1970-74 into new rows

Usage

separate_tags(txt, pattern, column = "text")

Arguments

txt

a table

pattern

regular expression to match locus tags like YPO[0-9-]+ or the locus tag prefix like YPO.

column

column name to search, default "text"

Value

a tibble with locus tag, matching text and rows.

Author

Chris Stubben

Examples

x <- data.frame(row = 1, text = "some genes like YPO1002 and YPO1970-74")
separate_tags(x, "YPO")
#>            id      match row                                   text
#> 1     YPO1002    YPO1002   1 some genes like YPO1002 and YPO1970-74
#> 1.1   YPO1970 YPO1970-74   1 some genes like YPO1002 and YPO1970-74
#> 1.1.1 YPO1971 YPO1970-74   1 some genes like YPO1002 and YPO1970-74
#> 1.1.2 YPO1972 YPO1970-74   1 some genes like YPO1002 and YPO1970-74
#> 1.1.3 YPO1973 YPO1970-74   1 some genes like YPO1002 and YPO1970-74
#> 1.1.4 YPO1974 YPO1970-74   1 some genes like YPO1002 and YPO1970-74