Skip to content

The running time of flatxml #2

@HarwayZ

Description

@HarwayZ

The faltxml is a excellent work for parsing XML data. But there is a problem that I encountered during using the fxml_toDataFrame function. This function need a longer time for running multi files with parsing the whole file to a dataframe. such as:

##
elemids <- unique(test$elemid.)
  for(i in elemids){
    yy <- fxml_toDataFrame(test,siblings.of = i)
    if (nrow(yy) >= 1) { 
      j <- j+1
      yy.dat[[j]] <- yy
    print(c(i,j))
  } else next()
}

For a test data with elemids > 300. The loop above need > 1 min. I work with file > 1000, which take more than 10 hours. Would you please give some advice?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions