用筛选功能准备更干净的数据集

12月 9, 2024
DataSpell 2024.3 首次推出 Data Wrangler,这是一款无代码工具,可简化数据清理和准备,从而节省时间并提高工作效率。

继续用英语阅读:

DataSpell by JetBrains is an Integrated Development Environment (IDE) specifically designed for data analysts and engineers. It allows you to write Python scripts, run SQL queries, analyze data in Jupyter notebooks, manage dbt workflows, and connect to databases, all within one seamless interface. DataSpell empowers you to stay focused on exploring insights and delivering high-quality results by combining rich data analysis tools with features like real-time error checking, code quality analysis, and built-in support for best practices like testing and documentation.

The DataSpell 2024.3 update introduces Data Wrangler, a no-code tool designed to streamline data cleaning and preparation, addressing the significant time investment often required in these processes. Initially focused on tabular data, Data Wrangler offers intuitive actions such as filtering, cleaning, and replacing data, alongside advanced statistical functions like min-max scaling, Z-score normalization, and various methods for outlier detection and skewness reduction. Enhancing productivity, the tool generates code for transformations, allowing users to seamlessly incorporate changes into their workflows or export results in versatile formats. With its guided approach and user-friendly interface, Data Wrangler demonstrates JetBrains' commitment to automate and optimize data manipulation tasks for data professionals.

To see a full list of what's new in version 2024.3, see our release notes.

DataSpell is licensed per user as a commercial annual subscription. See our DataSpell licensing page for full details.

DataSpell is available individually or as part of JetBrains All Products Pack.