当我期望R中的数据帧输出时,为什么Rccp返回类似列表的输出?

如何解决当我期望R中的数据帧输出时,为什么Rccp返回类似列表的输出?

我正在尝试编写一个.cpp文件,该文件接受一个输入向量,并输出一个两列数据帧,其中包含来自输​​入向量的所有可能组合。我的输出给出了所需的值,但没有给出数据框。如何更改.cpp文件以获取数据帧输出?

我的 possible_combos.cpp 文件如下所示:

#include <Rcpp.h>
using namespace Rcpp;

// [[Rcpp::export]]
GenericVector C_all_combos(GenericVector a) {
  int vec_length = a.size();
  int vec_length_sq = vec_length*vec_length; 
  GenericVector expand_vector_a(vec_length_sq);
  GenericVector expand_vector_b(vec_length_sq);  
  for (int i=0; i<vec_length_sq; i++) { expand_vector_a[i] = a[i / vec_length]; };
  for (int i=0; i<vec_length_sq; i++) { expand_vector_b[i] = a[i % vec_length]; };
  DataFrame my_df = DataFrame::create(Named("v_1") = expand_vector_a,Named("v_2") = expand_vector_b);
return my_df;
}

/*** R
C_all_combos(c(1,"Cars",2.3))
  */

运行Rcpp::sourceCpp("possible_combos.cpp")所需的输出是:

    v_1    v_2
    1       1
    1       Cars
    1       2.3
    Cars    1
    Cars    Cars
    Cars    2.3
    2.3     1
    2.3     Cars
    2.3     2.3

但是我得到的是:

    v_1..1. v_1..1..1 v_1..1..2 v_1..Cars. v_1..Cars..1 v_1..Cars..2 v_1..2.3. v_1..2.3..1 v_1..2.3..2
1       1         1         1       Cars         Cars         Cars       2.3         2.3         2.3
  v_2..1. v_2..Cars. v_2..2.3. v_2..1..1 v_2..Cars..1 v_2..2.3..1 v_2..1..2 v_2..Cars..2 v_2..2.3..2
1       1       Cars       2.3         1         Cars         2.3         1         Cars         2.3

感谢任何提示!我熟悉expand.grid()之类的出色R函数,但想尝试其他方法。

解决方法

主要问题是Rcpp::GenericVectorlist,因此行为与R一致。我在下面显示了此信息,并给出了一种使用模板函数针对每种输入类型的特殊情况的解决方案

#include <Rcpp.h>
using namespace Rcpp;

// essentially your code
// [[Rcpp::export]]
DataFrame C_all_combos(GenericVector a) {
  size_t const vec_length = a.size(),vec_length_sq = vec_length * vec_length; 
  GenericVector expand_vector_a(vec_length_sq),expand_vector_b(vec_length_sq);  
  
  for (size_t i = 0; i < vec_length_sq; i++){ 
    expand_vector_a[i] = a[i / vec_length];
    expand_vector_b[i] = a[i % vec_length];
  }
  
  return DataFrame::create(_["v_1"] = expand_vector_a,_["v_2"] = expand_vector_b,_["stringsAsFactors"] = false);
}

// template function used in the new solution
template<class T>
DataFrame C_all_combos_gen(T a) {
  size_t const vec_length = a.size(),vec_length_sq = vec_length * vec_length; 
  T expand_vector_a(vec_length_sq),_["stringsAsFactors"] = false);
}

// export particular versions
// [[Rcpp::export]]
DataFrame C_all_combos_int(IntegerVector a){
  return C_all_combos_gen<IntegerVector>(a);
}

// [[Rcpp::export]]
DataFrame C_all_combos_char(CharacterVector a){
  return C_all_combos_gen<CharacterVector>(a);
}

// [[Rcpp::export]]
DataFrame C_all_combos_num(NumericVector a){
  return C_all_combos_gen<NumericVector>(a);
}

// [[Rcpp::export]]
DataFrame C_all_combos_log(LogicalVector a){
  return C_all_combos_gen<LogicalVector>(a);
}

我们现在可以运行以下R代码

  1. 说明您的代码中的行为与R一致。
  2. 表明该解决方案有效。
######
# the issue with your code. Repeat your call
C_all_combos(c(1,"Cars",2.3))
#R>   v_1..1. v_1..1..1 v_1..1..2 v_1..Cars. v_1..Cars..1 v_1..Cars..2 v_1..2.3. v_1..2.3..1 v_1..2.3..2 v_2..1. v_2..Cars. v_2..2.3. v_2..1..1 v_2..Cars..1 v_2..2.3..1 v_2..1..2
#R> 1       1         1         1       Cars         Cars         Cars       2.3         2.3         2.3       1       Cars       2.3         1         Cars         2.3         1
#R>   v_2..Cars..2 v_2..2.3..2
#R> 1         Cars         2.3

# amounts to doing the following in R which yields the same
all_combs <- expand.grid(v_1 = c(1,2.3),v_2 = c(1,stringsAsFactors = FALSE)
data.frame(v_1 = as.list(all_combs$v_2),v_2 = as.list(all_combs$v_1))
#R>   v_1..1. v_1..1..1 v_1..1..2 v_1..Cars. v_1..Cars..1 v_1..Cars..2 v_1..2.3. v_1..2.3..1 v_1..2.3..2 v_2..1. v_2..Cars. v_2..2.3. v_2..1..1 v_2..Cars..1 v_2..2.3..1 v_2..1..2
#R> 1       1         1         1       Cars         Cars         Cars       2.3         2.3         2.3       1       Cars       2.3         1         Cars         2.3         1
#R>   v_2..Cars..2 v_2..2.3..2
#R> 1         Cars         2.3

######
# here is a solution with the template function
C_all_combos_R <- function(a){
  if(is.logical(a))
    return(C_all_combos_log(a))
  else if(is.integer(a))
    return(C_all_combos_int(a))
  else if(is.numeric(a))
    return(C_all_combos_num(a))
  else if(is.character(a))
    return(C_all_combos_char(a))
  
  stop("C_all_combos_R not implemented")
}

# it works
C_all_combos_R(c(1,2.3))
#R>    v_1  v_2
#R> 1    1    1
#R> 2    1 Cars
#R> 3    1  2.3
#R> 4 Cars    1
#R> 5 Cars Cars
#R> 6 Cars  2.3
#R> 7  2.3    1
#R> 8  2.3 Cars
#R> 9  2.3  2.3

在C ++等中进行类型检查

您还可以在C ++中进行所有类型检查,避免使用昂贵的整数除法和模运算,并避免像AEF这样的DataFrame构造函数

#include <Rcpp.h>
using namespace Rcpp;

template<int T>
SEXP C_all_combos_gen_two(Vector<T> a) {
  size_t const vec_length = a.size(),vec_length_sq = vec_length * vec_length; 
  Vector<T> expand_vector_a(vec_length_sq),expand_vector_b(vec_length_sq);  
  
  size_t i(0L);
  for(size_t jj = 0L; jj < vec_length; ++jj)
    for(size_t ii = 0L; ii < vec_length; ++i,++ii){
      expand_vector_a[i] = a[jj];
      expand_vector_b[i] = a[ii];
    }
  
  List out = List::create(_["v_1"] = expand_vector_a,_["v_2"] = expand_vector_b);
  
  out.attr("class") = "data.frame";
  out.attr("row.names") = Rcpp::seq(1,vec_length_sq);
  
  return out;
}

// [[Rcpp::export]]
SEXP C_all_combos_cpp(SEXP a){
  switch( TYPEOF(a) ){
  case INTSXP : return C_all_combos_gen_two<INTSXP>(a);
  case REALSXP: return C_all_combos_gen_two<REALSXP>(a);
  case STRSXP : return C_all_combos_gen_two<STRSXP>(a);
  case LGLSXP : return C_all_combos_gen_two<LGLSXP>(a);
  case VECSXP : return C_all_combos_gen_two<VECSXP>(a);
  default: Rcpp::stop("C_all_combos_cpp not implemented");
  }
  
  return DataFrame();
}

新版本产生

C_all_combos_cpp(c(1,2.3))
#R>    v_1  v_2
#R> 1    1    1
#R> 2    1 Cars
#R> 3    1  2.3
#R> 4 Cars    1
#R> 5 Cars Cars
#R> 6 Cars  2.3
#R> 7  2.3    1
#R> 8  2.3 Cars
#R> 9  2.3  2.3

,与AEF's解决方案相比,它是快速的

C_all_combos_cpp(c(1,2.3))

options(digits = 3)
library(bench)
mark(C_all_combos_cpp = C_all_combos_cpp(c(1,2.3)),AEF              = C_all_combos_aef(c(1,check = FALSE)
#R> # A tibble: 2 x 13
#R>   expression            min   median `itr/sec` mem_alloc `gc/sec` n_itr  n_gc total_time  
#R>   <bch:expr>       <bch:tm> <bch:tm>     <dbl> <bch:byt>    <dbl> <int> <dbl>   <bch:tm> 
#R> 1 C_all_combos_cpp   4.05µs   5.49µs   169097.    6.62KB     16.9  9999     1     59.1ms
#R> 2 AEF               15.76µs  16.96µs    57030.    2.49KB     45.7  9992     8    175.2ms

larger_num <- rnorm(100)
mark(C_all_combos_cpp = C_all_combos_cpp(larger_num),AEF              = C_all_combos_aef(larger_num),check = FALSE)
#R> # A tibble: 2 x 13
#R>   expression            min   median `itr/sec` mem_alloc `gc/sec` n_itr  n_gc total_time
#R>   <bch:expr>       <bch:tm> <bch:tm>     <dbl> <bch:byt>    <dbl> <int> <dbl>   <bch:tm>
#R> 1 C_all_combos_cpp   30.9µs   37.7µs    20817.     198KB     88.0  6862    29      330ms
#R> 2 AEF               167.9µs  178.4µs     5558.     199KB     21.5  2585    10      465ms

为完整起见,这是多余的C ++代码

// [[Rcpp::export]]
SEXP C_all_combos_aef(GenericVector a) {
  int vec_length = a.size();
  int vec_length_sq = vec_length * vec_length;
  GenericVector expand_vector_a(vec_length_sq);
  GenericVector expand_vector_b(vec_length_sq);
  for (int i=0; i<vec_length_sq; i++) { expand_vector_a[i] = a[i / vec_length]; };
  for (int i=0; i<vec_length_sq; i++) { expand_vector_b[i] = a[i % vec_length]; };
  
  List my_df = List::create(Named("v_1") = expand_vector_a,Named("v_2") = expand_vector_b);
  
  
  my_df.attr("class") = "data.frame";
  my_df.attr("row.names") = Rcpp::seq(1,vec_length_sq);
  
  return my_df;
}
,

如另一个答案所述,GenericVector是一个列表,并且不能使用Rcpp DataFrame构造函数创建具有List列的DataFrame。但是,您可以创建一个List并将其手动转换为data.frame,并将其返回为SEXP:

#include <Rcpp.h>
using namespace Rcpp;

// [[Rcpp::export]]
SEXP C_all_combos(GenericVector a) {
  int vec_length = a.size();
  int vec_length_sq = vec_length*vec_length;
  GenericVector expand_vector_a(vec_length_sq);
  GenericVector expand_vector_b(vec_length_sq);
  for (int i=0; i<vec_length_sq; i++) { expand_vector_a[i] = a[i / vec_length]; };
  for (int i=0; i<vec_length_sq; i++) { expand_vector_b[i] = a[i % vec_length]; };

  List my_df = List::create(Named("v_1") = expand_vector_a,Named("v_2") = expand_vector_b);


  my_df.attr("class") = "data.frame";
  my_df.attr("row.names") = Rcpp::seq(1,vec_length_sq);

  return my_df;
}

/*** R
C_all_combos(c(1,2.3))
*/

版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。

相关推荐


依赖报错 idea导入项目后依赖报错,解决方案:https://blog.csdn.net/weixin_42420249/article/details/81191861 依赖版本报错:更换其他版本 无法下载依赖可参考:https://blog.csdn.net/weixin_42628809/a
错误1:代码生成器依赖和mybatis依赖冲突 启动项目时报错如下 2021-12-03 13:33:33.927 ERROR 7228 [ main] o.s.b.d.LoggingFailureAnalysisReporter : *************************** APPL
错误1:gradle项目控制台输出为乱码 # 解决方案:https://blog.csdn.net/weixin_43501566/article/details/112482302 # 在gradle-wrapper.properties 添加以下内容 org.gradle.jvmargs=-Df
错误还原:在查询的过程中,传入的workType为0时,该条件不起作用 &lt;select id=&quot;xxx&quot;&gt; SELECT di.id, di.name, di.work_type, di.updated... &lt;where&gt; &lt;if test=&qu
报错如下,gcc版本太低 ^ server.c:5346:31: 错误:‘struct redisServer’没有名为‘server_cpulist’的成员 redisSetCpuAffinity(server.server_cpulist); ^ server.c: 在函数‘hasActiveC
解决方案1 1、改项目中.idea/workspace.xml配置文件,增加dynamic.classpath参数 2、搜索PropertiesComponent,添加如下 &lt;property name=&quot;dynamic.classpath&quot; value=&quot;tru
删除根组件app.vue中的默认代码后报错:Module Error (from ./node_modules/eslint-loader/index.js): 解决方案:关闭ESlint代码检测,在项目根目录创建vue.config.js,在文件中添加 module.exports = { lin
查看spark默认的python版本 [root@master day27]# pyspark /home/software/spark-2.3.4-bin-hadoop2.7/conf/spark-env.sh: line 2: /usr/local/hadoop/bin/hadoop: No s
使用本地python环境可以成功执行 import pandas as pd import matplotlib.pyplot as plt # 设置字体 plt.rcParams[&#39;font.sans-serif&#39;] = [&#39;SimHei&#39;] # 能正确显示负号 p
错误1:Request method ‘DELETE‘ not supported 错误还原:controller层有一个接口,访问该接口时报错:Request method ‘DELETE‘ not supported 错误原因:没有接收到前端传入的参数,修改为如下 参考 错误2:cannot r
错误1:启动docker镜像时报错:Error response from daemon: driver failed programming external connectivity on endpoint quirky_allen 解决方法:重启docker -&gt; systemctl r
错误1:private field ‘xxx‘ is never assigned 按Altʾnter快捷键,选择第2项 参考:https://blog.csdn.net/shi_hong_fei_hei/article/details/88814070 错误2:启动时报错,不能找到主启动类 #
报错如下,通过源不能下载,最后警告pip需升级版本 Requirement already satisfied: pip in c:\users\ychen\appdata\local\programs\python\python310\lib\site-packages (22.0.4) Coll
错误1:maven打包报错 错误还原:使用maven打包项目时报错如下 [ERROR] Failed to execute goal org.apache.maven.plugins:maven-resources-plugin:3.2.0:resources (default-resources)
错误1:服务调用时报错 服务消费者模块assess通过openFeign调用服务提供者模块hires 如下为服务提供者模块hires的控制层接口 @RestController @RequestMapping(&quot;/hires&quot;) public class FeignControl
错误1:运行项目后报如下错误 解决方案 报错2:Failed to execute goal org.apache.maven.plugins:maven-compiler-plugin:3.8.1:compile (default-compile) on project sb 解决方案:在pom.
参考 错误原因 过滤器或拦截器在生效时,redisTemplate还没有注入 解决方案:在注入容器时就生效 @Component //项目运行时就注入Spring容器 public class RedisBean { @Resource private RedisTemplate&lt;String
使用vite构建项目报错 C:\Users\ychen\work&gt;npm init @vitejs/app @vitejs/create-app is deprecated, use npm init vite instead C:\Users\ychen\AppData\Local\npm-