kettle入门_代码007(未授权)

/*
main 方法对应于processRow(..)方法。
主要用来处理单行数据，如对数据进行加工或添加字段
另外，我们也可以在方法外部定义变量。
*/

public boolean processRow(StepMetaInterface smi, StepDataInterface sdi) throws KettleException {
  if (first) {
    first = false;

    /* TODO: Your code here. (Using info fields)

    FieldHelper infoField = get(Fields.Info, "info_field_name");

    RowSet infoStream = findInfoRowSet("info_stream_tag");

    Object[] infoRow = null;

    int infoRowCount = 0;

    // Read all rows from info step before calling getRow() method, which returns first row from any
    // input rowset. As rowMeta for info and input steps varies getRow() can lead to errors.
    while((infoRow = getRowFrom(infoStream)) != null){

      // do something with info data
      infoRowCount++;
    }
    */
  }

  Object[] r = getRow();

  if (r == null) {
    setOutputDone();
    return false;
  }

  // It is always safest to call createOutputRow() to ensure that your output row's Object[] is large
  // enough to handle any new fields you are creating in this step.
  r = createOutputRow(r, data.outputRowMeta.size());

  /* TODO: Your code here. (See Sample)

  // 从流中获取字段对应的值，如果字段名不存在，会抛出异常
  String foobar = get(Fields.In, "a_fieldname").getString(r);

  foobar += "bar";
    
  // 把数据设置到对应字段中，如果output_fieldname字段不存在，则可以在代码编辑区的下方字段设置框中设置字段信息。详情看下图：
  get(Fields.Out, "output_fieldname").setValue(r, foobar);

  */
  // Send the row on to the next step.
  putRow(data.outputRowMeta, r);

  return true;
}

/*
组件被启动时，执行一次，常用来初始化一些资源数据
*/
public boolean init(StepMetaInterface stepMetaInterface, StepDataInterface stepDataInterface) {
  return parent.initImpl(stepMetaInterface, stepDataInterface);
}

/*
流结束时，被调用一次，常用来销毁资源，或做一些后置处理，通知消息等操作
*/
public void dispose(StepMetaInterface smi, StepDataInterface sdi) {
  parent.disposeImpl(smi, sdi);
}

// 用来获取环境变量的值
String getVariable = getVariable(variableName, defaultValue);

显示所有内容

声明：本站所有文章，如无特殊说明或标注，均为本站原创发布。任何个人或组织，在未征得本站同意时，禁止复制、盗用、采集、发布本站内容到任何网站、书籍等各类媒体平台。如若本站内容侵犯了原著者的合法权益，可联系我们进行处理。

kettle 模块

一、kettle是什么

二、kettle 安装与基本使用

2.1 kettle 安装

2.2 kettle的基础使用

2.21 做一个简单的转换样例

2.2.2 小技巧

三、转换常用组件使用

3.1 输入模块

3.1.1 生成记录

3.1.2 表输入

3.2 输出模块

3.2.1表输出

3.3.2 插入/更新

3.3.3 更新

3.3 转换模块

3.3.1 字符串操作

3.3.2 字符串替换

3.3.3 字段选择

3.3.4 设置字段值

3.3.5 计算器

3.3.6 去除重复记录

3.3.7 值映射

3.4 应用模块

3.4.1 替换NULL值

3.5 流程模块

3.6 脚本模块

3.6.1 java 代码

3.7 连接模块

3.7.1 记录集连接

3.7.2 合并记录

3.8 统计模块

3.8.1 聚合

3.9 作业模块

3.9.1 设置变量

3.10 转换中的通用技巧

3.10.1占位符

3.10.2 查看显示输入字段，显示输出字段

3.10.3 数据发送

3.10.4 改变开始复制的数量

3.10.5 hop/连接线/跳

四、任务中的常用组件

4.1 通用模块

4.1.1 start

4.1.2 转换

4.1.2 作业

4.1.3 设置变量

1.作用：

4.1.4 成功

4.2 脚本模块

4.2.1 SQL

4.3 文件管理模块

4.3.1 等待文件

4.4 任务中的通用技巧

4.4.1 hop/连接线/跳

4.4.2 并行

相关文章

发表回复 取消回复

3.10.5 h op/连接线/跳

发表回复取消回复