前端

发布日期: 2025-09-25

文章字数: 1.7k

阅读时长: 7 分

基于 TensorFlow.js 的前端分类 AI 系统

背景

最近工作中实现了一个需求：Scratch 积木编程实现分类 AI 功能，包括数据处理、神经网络训练、根据模型预测结果和可视化功能。记录一下前端基于 TensorFlow.js 的实现思路。

详细步骤说明

1. 数据准备阶段

flowchart LR
    A[原始数据] --> B[数据清洗]
    B --> C[格式转换]
    C --> D[特征提取]
    D --> E[标签编码]

    B1[去除缺失值] --> B
    B2[去除异常值] --> B
    B3[数据去重] --> B

    C1[数组格式化] --> C
    C2[数据类型转换] --> C

    D1[特征选择] --> D
    D2[特征缩放] --> D
    D3[特征组合] --> D

    E1[One-Hot编码] --> E
    E2[标签映射] --> E

2. 模型构建阶段

flowchart TD
    A[确定问题类型] --> B{分类还是回归?}
    B -->|分类| C[选择分类模型]
    B -->|回归| D[选择回归模型]

    C --> C1[神经网络]
    C --> C2[决策树]
    C --> C3[SVM]
    C --> C4[随机森林]

    D --> D1[线性回归]
    D --> D2[神经网络]
    D --> D3[决策树]

    C1 --> E[设计网络架构]
    C2 --> F[设置参数]
    C3 --> F
    C4 --> F
    D1 --> F
    D2 --> E
    D3 --> F

    E --> G[输入层设计]
    E --> H[隐藏层设计]
    E --> I[输出层设计]

    G --> J[编译模型]
    H --> J
    I --> J
    F --> J

3. 训练优化循环

flowchart TD
    A[初始化模型] --> B[设置训练参数]
    B --> C[开始训练]
    C --> D[前向传播]
    D --> E[计算损失]
    E --> F[反向传播]
    F --> G[更新权重]
    G --> H[记录指标]
    H --> I{达到停止条件?}
    I -->|否| J[下一个批次]
    J --> D
    I -->|是| K[训练完成]

    B --> B1[学习率]
    B --> B2[批次大小]
    B --> B3[训练轮数]
    B --> B4[优化器]

    I --> I1[最大轮数]
    I --> I2[早停条件]
    I --> I3[收敛判断]

    style C fill:#f3e5f5
    style K fill:#e8f5e8

项目的具体实现流程

flowchart TD
    A[用户输入特征数据excel] --> B[setTrainingFeatures
数据格式转换]
    B --> C[用户输入分类标签数据] --> D[setTrainingLabels
标签处理]
    D --> E[用户配置网络结构] --> F[initializeModel
模型初始化]
    F --> G[trainModel
开始训练]

    G --> G1[创建Sequential模型]
    G1 --> G2[添加Dense层]
    G2 --> G3[模型编译
Adam + CrossEntropy]
    G3 --> G4[数据张量化]
    G4 --> G5[One-Hot编码]
    G5 --> G6[model.fit训练]

    G6 --> G7[训练回调]
    G7 --> G8[更新可视化]
    G8 --> G9[训练完成]

    G9 --> H[predictWithModel
模型预测]
    H --> I[输出预测结果]

    style G fill:#f3e5f5
    style I fill:#e8f5e8

系统架构设计

核心技术栈

TensorFlow.js: 提供深度学习能力
D3.js: 实现数据可视化
Canvas API: 渲染决策边界

系统组件架构

type Data = {
  currentGenerationIndex?: number; // 当前训练次数
  features?: number[][]; // 训练特征矩阵
  labels?: string[]; // 训练标签
  classLabels?: string[]; // 分类枚举
  tempModelMap?: {
    // 模型存储
    [modelId: string]: tf.Sequential;
  };
  probabilities?: number[]; // 预测概率
  classification?: string; // 预测结果
};

核心功能实现

1. 智能数据预处理

系统支持灵活的特征数据输入，自动处理列式到行式的数据转换：

setTrainingFeatures(args: BlockArgument) {
  const dynamicArgs = getDynamicArgs<typeof args>(args, 'FEATURES_');

  const featuresList = Object.values(dynamicArgs).map(_features =>
    Array.isArray(_features) ? _features : [Number(_features)]
  );

  // 将列式存储的特征转换为行式存储 [feature1_array, feature2_array] → [[sample1_f1, sample1_f2], ...]
  // [5.1, 4.9, 5.9], [3.5, 3, 3] => [[5.1, 3.5],[4.9, 3.0],[5.9, 3.0]]
  this.blockData.features = featuresList[0].map((_, i) =>
    featuresList.map(arr => Number(arr[i]))
  );
}

2. 可配置神经网络架构

系统允许用户自定义网络结构，支持多隐藏层配置：

async trainModel(args: { MODEL: string }) {
  const model = tf.sequential();

  // 输入层 - 自适应特征维度
  model.add(tf.layers.dense({
    units: features[0].length * 2, // 神经元为特征数量的两倍
    inputShape: [features[0].length],
    activation: 'relu'
  }));

  // 隐藏层 - 用户可配置
  for (let i = 1; i < hiddenSizes.length; i++) {
    model.add(tf.layers.dense({
      units: hiddenSizes[i],
      activation: 'relu'
    }));
  }

  // 输出层 - 自适应类别数
  model.add(tf.layers.dense({
    units: classLabels.length,
    activation: 'softmax'
  }));
}

关键特性：

动态网络结构：根据输入特征和输出类别自适应
ReLU 激活函数：解决梯度消失问题
Softmax 输出：多分类概率分布

3. 高效训练流程

采用现代深度学习最佳实践：

// 模型编译
model.compile({
  optimizer: tf.train.adam(), // Adam优化器
  loss: "categoricalCrossentropy", // 多分类损失函数
  metrics: ["accuracy"], // 准确率监控
});

// One-Hot编码
const label2Index = Object.fromEntries(classLabels.map((v, i) => [v, i]));
const ys = tf.oneHot(
  tf.tensor1d(
    labels.map((l) => label2Index[l]),
    "int32"
  ),
  classLabels.length
);

// 训练执行
await model.fit(xs, ys, {
  epochs: 20,
  verbose: 0,
  shuffle: true, // 样本随机打乱，提高收敛
  validationSplit: 0.2,
  callbacks: {
    onEpochEnd: (epoch, logs) => {
      // 实时可视化更新
      this.updateVisualization(model, epoch);
    },
  },
});

4. 智能预测系统

结果预测

predictWithModel(args: BlockArgument) {
  const inputFeatures = Object.values(dynamicArgs).map(feature => Number(feature));

  const predictions = this.blockData.tempModelMap[args.MODEL].predict(
    tf.tensor2d([inputFeatures])
  ) as tf.Tensor;

  const probabilities = predictions.dataSync(); // 每个结果的概率
  const maxIndex = probabilities.indexOf(Math.max(...probabilities));
  const classification = this.blockData.classLabels[maxIndex];

  this.blockData.probabilities = Array.from(probabilities);
  this.blockData.classification = classification;

  predictions.dispose(); // 内存管理
}

可视化系统

alt text

1. 训练数据散点图

使用 D3.js 创建交互式散点图：

drawScatterPlot(options: {
  features: number[][];
  labels: string[];
  classLabels: string[];
}) {
  const svg = d3.select(container).append('svg');
  const color = d3.scaleOrdinal().domain(classLabels).range(d3.schemeCategory10);

  svg.selectAll('circle')
    .data(features.map((f, i) => ({ x: f[0], y: f[1], label: labels[i] })))
    .enter().append('circle')
    .attr('cx', d => xScale(d.x))
    .attr('cy', d => yScale(d.y))
    .attr('fill', d => color(d.label));
}

2. 实时决策边界渲染

通过 Canvas API 实现高性能决策边界可视化：

drawDecisionBoundary(options: { model: tf.Sequential }) {
  const canvas = document.createElement('canvas');
  const ctx = canvas.getContext('2d');

  // 批量预测网格点
  const allInputs = [];
  for (let py = 0; py < plotHeight; py++) {
    for (let px = 0; px < plotWidth; px++) {
      const fx = xScale.invert(px + margin.left);
      const fy = yScale.invert(py + margin.top);
      allInputs.push([fx, fy, ...featureMeans.slice(2)]);
    }
  }

  // 一次性推理优化性能
  const inputTensor = tf.tensor2d(allInputs);
  const probsTensor = model.predict(inputTensor) as tf.Tensor;
  const probsArr = probsTensor.dataSync();

  // 渲染像素
  this.renderDecisionBoundaryImage(ctx, classIndexMatrix, color);
}

系统优势

直观可视化: 实时展示训练过程和决策边界
模块化设计: 每个功能独立，便于理解
即时反馈: 训练和预测结果立即可见
前端 AI: 完全在浏览器中运行，无需服务器

性能优化策略

1. 内存管理

// 及时释放张量资源
xs.dispose();
ys.dispose();
predictions.dispose();

2. 批量处理

// 批量预测替代逐点预测
const inputTensor = tf.tensor2d(allInputs);
const probsTensor = model.predict(inputTensor) as tf.Tensor;

3. Canvas 优化

// 预缓存颜色避免重复计算
const colorCache = {};
for (let i = 0; i < 20; ++i) {
  const c = d3.color(color(i));
  if (c) colorCache[i] = [c.r, c.g, c.b, 180];
}

.Paly

https://mrw1108.github.io/2025/09/25/ji-yu-tensorflow.js-de-qian-duan-fen-lei-ai-xi-tong/

本博客所有文章除特別声明外，均采用 CC BY 4.0 许可协议。转载请注明来源 .Paly !

WebWorker原理详解

原理到实践，全面解析WebWorker的核心概念与应用场景

2025-12-01 前端

JavaScript WebWorker 多线程

Elpis 学习心得

基于 node + vue3 + webpack5 ，完成一个名为 elpis 的全栈开发框架。Elpis 思想：通过 80% 的抽离和重复工作支持配置化，20% 的定制化工作，减少 CRUD 体力活的比例，让程序员更专注于创造

2025-06-19 前端

实践 node koa vue3 webpack5

基于 TensorFlow.js 的前端分类AI系统

基于 TensorFlow.js 的前端分类 AI 系统

背景

详细步骤说明

1. 数据准备阶段

2. 模型构建阶段

3. 训练优化循环

项目的具体实现流程

系统架构设计

核心技术栈

系统组件架构

核心功能实现

1. 智能数据预处理

2. 可配置神经网络架构

3. 高效训练流程

4. 智能预测系统

可视化系统

1. 训练数据散点图

2. 实时决策边界渲染

系统优势

性能优化策略

1. 内存管理

2. 批量处理

3. Canvas 优化