Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error in Join solo LTR Copy Number Estimation table #10

Open
zyqzyqzyq opened this issue Oct 22, 2019 · 3 comments
Open

Error in Join solo LTR Copy Number Estimation table #10

zyqzyqzyq opened this issue Oct 22, 2019 · 3 comments

Comments

@zyqzyqzyq
Copy link

Hello, @HajkD,
I keep getting this error in the Join solo LTR Copy Number Estimation table after Finished LTR CNV estimation

Filter hit results...
Estimate CNV for each LTR sequence...
Finished LTR CNV estimation!
Join solo LTR Copy Number Estimation table: nrow(df) = 8387 candidates.
unique(ID) = 8387 candidates.
unique(orf.id) = 8387 candidates.
Error: Column `cn_3ltr` must be length 8387 (the number of rows) or one, not 0
Stop executing 

Then , I checked the intermediate files ,find G_soloLTRs_3ltr.bed and G_solo_LTRs_5ltr.bed have a Slight difference (also find in your source code here)

# write estimated solo LTR loci to LTRpred output folder
            cn2bed(
                solo.ltr.cn$pred_3ltr,
                type = "solo",
                filename = paste0(chopped.foldername,"**_soloLTRs_3ltr**"),
                output = output.path
            )         

            cn2bed(
                solo.ltr.cn$pred_5ltr,
                type = "solo",
                filename = paste0(chopped.foldername,"**_solo_LTRs_5ltr**"),
                output = output.path
            )

So, if this code mistake result in the error ? thank you!

Best,
zyq

@HajkD
Copy link
Owner

HajkD commented Oct 22, 2019

Hi @zyqzyqzyq

Thank you for contacting me.

The code you refer to isn't an error. It simply returns the solo-LTR copies found with BLAST: one file using the 3' LTR as BLAST query against the reference genome and one file using the 5' LTR as BLAST query against the reference genome (see Figure 1 for a scheme of 3' and 5' LTRs).

# write estimated solo LTR loci to LTRpred output folder
# this file output includes BLAST hits found with annotated 3' LTRs as BLAST query 
            cn2bed(
                solo.ltr.cn$pred_3ltr,
                type = "solo",
                filename = paste0(chopped.foldername,"**_soloLTRs_3ltr**"),
                output = output.path
            )  
       
# this file output includes BLAST hits found with annotated 5' LTRs as BLAST query 
            cn2bed(
                solo.ltr.cn$pred_5ltr,
                type = "solo",
                filename = paste0(chopped.foldername,"**_solo_LTRs_5ltr**"),
                output = output.path
            )

The sequences in both files differ, because 3' LTRs and 5' LTRs are not always 100% identical. This is quantified in the LTRpred output column ltr_similarity (see here).

So the error Error: Column cn_3ltr must be length 8387 (the number of rows) or one, not 0 must derive from another cause.

Could you please provide me with an example LTRpred command call in which this error occurs? This will allow me to further troubleshoot.

Many thanks!

@zyqzyqzyq
Copy link
Author

Thank you for your answer, this is my LTRpred command call :

library(LTRpred)
setwd = ("/CAT2/zhengyongqiang/escl/default-wenjianjia")
LTRpred(
genome.file = "/CAT2/zhengyongqiang/escl/default-wenjianjia/aaa/G.arboreum_CRI-A2_assembly_v1.0.fasta",
copy.number.est = TRUE,
cores = 20
)

What result in the error Error: Column cn_3ltr must be length 8387 (the number of rows) or one, not 0 Thank you very much !

@HajkD
Copy link
Owner

HajkD commented Oct 23, 2019

Ok, I will have a look at this and see if I can reproduce this error.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants