Frosts

pandas-inspired Excel scripting for Office Scripts

Project maintained by JoeyRussoniello Hosted on GitHub Pages — Theme by mattgraham

🔗 Merging

Now that you’ve learned how to aggregate and summarize data within a DataFrame, you may often need to combine multiple DataFrames to enrich your analysis. Whether you are combining datasets based on shared keys, appending rows, or joining columns, the ability to merge DataFrames is essential in data processing.

frosts provides two powerful methods for combining DataFrames: .merge() for key-based joins, and .concat() for stacking data row-wise.

.merge()
.validate_keys()
.concat()

`.merge(other: DataFrame, on: string[], how: "inner" | "left" | "outer" = "inner")`

Merges the current DataFrame with another one based on key columns, similar to SQL joins.

Parameters

other: Another DataFrame to merge with
on: Column name(s) used as the join key(s).
how: Type of join to perform.
- "inner" (default): Only keeps rows with matches in both DataFrames.
- "left": Keeps all rows from the current DataFrame, matching where possible from other.
- "outer": Keeps all rows from both DataFrames, filling in null for missing values.

Examples

Let’s say we have two DataFrames

employees

EmployeeID	Name	Department
1	Alice	HR
2	Bob	Engineering
3	Charlie	Marketing
4	Diana	Sales

salaries

EmployeeID	Salary
2	90000
3	75000
4	68000
5	72000

Inner Join

employees.merge(salaries,["Employee ID"],"inner");

EmployeeID	Name	Department	Salary
2	Bob	Engineering	90000
3	Charlie	Marketing	75000
4	Diana	Sales	68000

Only rows where EmployeeID exists in both DataFrames will appear.

Left Join

employees.merge(salaries,["Employee ID"], "left");

EmployeeID	Name	Department	Salary
1	Alice	HR	null
2	Bob	Engineering	90000
3	Charlie	Marketing	75000
4	Diana	Sales	68000

Keeps all rows from employees, adds data from salaries when possible

Outer Join

EmployeeID	Name	Department	Salary
1	Alice	HR	null
2	Bob	Engineering	90000
3	Charlie	Marketing	75000
4	Diana	Sales	68000
5	null	null	72000

Keep all rows from both DataFrames, filling missing values with null.

Joining with Multiple Columns

You can also join on multiple shared keys, for example the following join

const df1 = new DataFrame([
  ["EmployeeID", "Date",     "HoursWorked"],
  [101,          "2024-01-01", 8],
  [101,          "2024-01-02", 7],
  [102,          "2024-01-01", 6]
]);

const df2 = new DataFrame([
  ["EmployeeID", "Date",     "Project"],
  [101,          "2024-01-01", "Alpha"],
  [101,          "2024-01-02", "Beta"],
  [103,          "2024-01-01", "Gamma"]
]);

const result = df1.merge(df2, ["EmployeeID", "Date"], "inner");

Would result in this table

EmployeeID	Date	HoursWorked	Project
101	2024-01-01	8	Alpha
101	2024-01-02	7	Beta

This type of join is especially useful when working with time series data or logs and need to match both an ID and a timestamp, as shown here.

`.validate_key(key: DataFrame, on: [string, string] | string, errors: "raise" | "return" = "raise")`

Checks whether all join key values in the current DataFrame exist in the corresponding column of another DataFrame.

Parameters

key: A reference DataFrame (typically the lookup table or foreign key source).
on: Column(s) to match.
- If a string, the same column name is used in both DataFrames.
- If a tuple [leftCol, rightCol], matches this[leftCol] to key[rightCol].
errors: What to do if mismatches are found.
- “raise” (default): Throws an error with all unmatched values.
- “return”: Returns an array of unmatched values (allows for graceful handling).

Returns

If errors = "return": An array of missing values.
If errors = "raise": Throws an error and stops execution if mismatches are found.
If all keys are valid: Returns void.

✅ Use When:

You’re about to perform a .merge() and want to verify key consistency.
You’re validating referential integrity between two datasets.
You need to detect and handle unexpected join mismatches (e.g., early fail or alert system).

Examples

1) Checking keys with the same column names

df.check_key(referenceTable, "ProjectID");

Validates that all ProjectIDs in df exist in referenceTable.

2) Checking keys with different column names

df.check_key(referenceTable, ["UserID", "StaffID"]);

Checks if every UserID in df has a match in the StaffID column of referenceTable.

3) Failing fast on missing keys

df.check_key(referenceTable, "ProjectID", "raise");

If any ProjectID is not found, would throw

KeyIncompleteError: The following values were not found in the selected key
[1234, 4567, 8910]

4) Graceful fallback

const missing = df.check_key(referenceTable, "ProjectID", "return");
if (missing?.length) {
  // Return here instead of crashing main
  // possibly send to PowerAutomate/logic app
  return JSON.stringify(missing)  
  //Output: "[1234,4567,8910]"
}

`.concat(other:DataFrame, columnSelection: ("inner"|"outer"|"left") = "outer")`

The .concat() method appends the rows of the other DataFrame to the current one. It aligns columns based on the columnSelection mode

"outer" includes all columns from both DataFrames, filling empties with null (default)
"inner" includes only shared columns
'left' includes all columns from the first DataFrame, filling missing values in the second with null

Concatenation Exampels

For the inputs tables

df1

Name	Age	Department
Alice	30	Sales
Bob	25	Marketing

df2

Name	Age	Location
Carol	28	New York
Dave	35	Chicago

Outer Concatenation (default)

df1.concat(df2)

Name	Age	Department	Location
Alice	30	Sales	null
Bob	25	Marketing	null
Carol	28	null	New York
Dave	35	null	Chicago

Inner Concatenation

df1.concat(df2, "inner");

Name	Age
Alice	30
Bob	25
Carol	28
Dave	35

Left Concatenation

df1.concat(df2, "left");

Name	Age	Department
Alice	30	Sales
Bob	25	Marketing
Carol	28	null
Dave	35	null

✅ With your datasets successfully combined, the final step is often saving or sharing your results—let’s look at how to export and import DataFrames using Excel, CSV, and JSON.

Continue to Input/Output

Return to API Reference

Frosts

🔗 Merging

Table of Contents

.merge(other: DataFrame, on: string[], how: "inner" | "left" | "outer" = "inner")

Parameters

Examples

Inner Join

Left Join

Outer Join

Joining with Multiple Columns

.validate_key(key: DataFrame, on: [string, string] | string, errors: "raise" | "return" = "raise")

Parameters

Returns

Examples

.concat(other:DataFrame, columnSelection: ("inner"|"outer"|"left") = "outer")

Concatenation Exampels

Outer Concatenation (default)

Inner Concatenation

Left Concatenation

`.merge(other: DataFrame, on: string[], how: "inner" | "left" | "outer" = "inner")`

`.validate_key(key: DataFrame, on: [string, string] | string, errors: "raise" | "return" = "raise")`

`.concat(other:DataFrame, columnSelection: ("inner"|"outer"|"left") = "outer")`