Join-Object
Join-Object copied to clipboard
Combines two objects lists based on a related property between them.
Join-Object
Combines two object lists based on a related property between them.
Description
Combines properties from one or more objects. It creates a set that can be saved as a new object or used as it is. An object join is a means for combining properties from one (self-join) or more object lists by using values common to each.
Main features:
- Intuitive (SQL like) syntax
- Smart property merging
- Predefined join commands for updating, merging and specific join types
- Well defined pipeline for the (left) input objects and output objects (preserves memory when correctly used)
- Performs about 40% faster than Compare-Object on large object lists
- Supports a list of (custom) objects, strings or primitives and dictionaries (e.g. hash tables) and data tables for input
- Smart properties and calculated property expressions
- Custom relation expressions
- Easy installation (dot-sourcing)
- Supports PowerShell for Windows (5.1) and PowerShell Core
The Join-Object cmdlet reveals the following proxy commands with their own (-JoinType and -Property) defaults:
InnerJoin-Object(AliasInnerJoinorJoin), combines the related objectsLeftJoin-Object(AliasLeftJoin), combines the related objects and adds the rest of the left objectsRightJoin-Object(AliasRightJoin), combines the related objects and adds the rest of the right objectsFullJoin-Object(AliasFullJoin), combines the related objects and adds the rest of the left and right objectsOuterJoin-Object(AliasOuterJoin), returns the symmetric difference of the unrelated objectsCrossJoin-Object(AliasCrossJoin), combines each left object with each right objectUpdate-Object(AliasUpdate), updates the left object with the related right objectMerge-Object(AliasMerge), updates the left object with the related right object and adds the rest of the new (unrelated) right objectsGet-Difference(AliasDiffers), returns the symmetric different objects and their properties
Installation
There are two versions of this Join-Object cmdlet (both versions supply the same functionality):
Install-Module -Name JoinModule
Install-Script -Name Join
(Or rename the Join.psm1 module to a Join.ps1 script file) and invoked the script by dot sourcing:
. .\Join.ps1
Examples
Example 1
PS C:\> $Employee
Id Name Country Department Age ReportsTo
-- ---- ------- ---------- --- ---------
1 Aerts Belgium Sales 40 5
2 Bauer Germany Engineering 31 4
3 Cook England Sales 69 1
4 Duval France Engineering 21 5
5 Evans England Marketing 35
6 Fischer Germany Engineering 29 4
PS C:\> $Department
Name Country
---- -------
Engineering Germany
Marketing England
Sales France
Purchase France
PS C:\> # Join the employees with the departments based on the country
PS C:\> $Employee | InnerJoin $Department -On Country | Format-Table
Id Name Country Department Age ReportsTo
-- ---- ------- ---------- --- ---------
2 {Bauer, Engineering} Germany Engineering 31 4
3 {Cook, Marketing} England Sales 69 1
4 {Duval, Sales} France Engineering 21 5
4 {Duval, Purchase} France Engineering 21 5
5 {Evans, Marketing} England Marketing 35
6 {Fischer, Engineering} Germany Engineering 29 4
Example 2
PS C:\> # Full join the employees with the departments based on the department name
PS C:\> # and Split the names over differend properties
PS C:\> $Employee | InnerJoin $Department -On Department -Equals Name -Discern Employee, Department | Format-Table
Id Name EmployeeCountry DepartmentCountry Department Age ReportsTo
-- ---- --------------- ----------------- ---------- --- ---------
1 Aerts Belgium France Sales 40 5
2 Bauer Germany Germany Engineering 31 4
3 Cook England France Sales 69 1
4 Duval France Germany Engineering 21 5
5 Evans England England Marketing 35
6 Fischer Germany Germany Engineering 29 4
Example 3
PS C:\> $Changes
Id Name Country Department Age ReportsTo
-- ---- ------- ---------- --- ---------
3 Cook England Sales 69 5
6 Fischer France Engineering 29 4
7 Geralds Belgium Sales 71 1
PS C:\> # Apply the changes to the employees
PS C:\> $Employee | Merge $Changes -On Id | Format-Table
Id Name Country Department Age ReportsTo
-- ---- ------- ---------- --- ---------
1 Aerts Belgium Sales 40 5
2 Bauer Germany Engineering 31 4
3 Cook England Sales 69 5
4 Duval France Engineering 21 5
5 Evans England Marketing 35
6 Fischer France Engineering 29 4
7 Geralds Belgium Sales 71 1
Example 4
PS C:\> # (Self) join each employee with its each manager
PS C:\> LeftJoin $Employee -On ReportsTo -Equals Id -Property @{ Name = 'Left.Name' }, @{ Manager = 'Right.Name' }
Name Manager
---- -------
Aerts Evans
Bauer Duval
Cook Aerts
Duval Evans
Evans
Fischer Duval
Example 5
PS C:\> # Add an Id to the department list
PS C:\> 1..9 |Join $Department -ValueName Id
Id Name Country
-- ---- -------
1 Engineering Germany
2 Marketing England
3 Sales France
4 Purchase France
Example 6
PS C:\> $a = 'a1', 'a2', 'a3', 'a4'
PS C:\> $b = 'b1', 'b2', 'b3', 'b4'
PS C:\> $c = 'c1', 'c2', 'c3', 'c4'
PS C:\> $d = 'd1', 'd2', 'd3', 'd4'
PS C:\> # Join (transpose) multiple arrays to a collection array
PS C:\> $a |Join $b |Join $c |Join $d |% { "$_" }
a1 b1 c1 d1
a2 b2 c2 d2
a3 b3 c3 d3
a4 b4 c4 d4
Example 7
PS C:\> # Create objects with named properties from multiple arrays
PS C:\> $a |Join $b |Join $c |Join $d -Name a, b, c, d
a b c d
- - - -
a1 b1 c1 d1
a2 b2 c2 d2
a3 b3 c3 d3
a4 b4 c4 d4
Parameters
-LeftObject <object list, data table or list of hash tables>
The left object list, usually provided through the pipeline, to be joined.
Note: a self-join on the LeftObject list will be performed if the RightObject is omitted.
-RightObject <object list, data table or list of hash tables>
The right object list, provided by the first argument, to be joined.
Note: a self-join on the RightObject list will be performed if the LeftObject is omitted.
-On <String[]>
The -On parameter (alias -Using) defines which objects should be joined together.
If the -Equals parameter is omitted, the value(s) of the properties listed by the -On parameter should be equal at both sides in order to join the left object with the right object.
If the -On parameter contains an expression, the expression will be evaluted where $_, $PSItem and $Left contains the currect object. The result of the expression will be compared to right object property defined by the -Equals parameter.
Note 1: The list of properties defined by the -On parameter will be complemented with the list of
properties defined by the -Equals parameter and vice versa.
Note 2: Related properties will be merged to a single property by default (see also the -Property parameter).
Note 3: If the -On and the -Using parameter are omitted, a side-by-side join is returned.
-Equals <String[]>
If the -Equals parameter is supplied, the value(s) of the left object properties listed by the -On
parameter should be equal to the value(s)of the right object listed by the -Equals parameter in order to join the left object with the right object.
If the -Equals parameter contains an expression, the expression will be evaluted where $_, $PSItem and $Right contains the currect object. The result of the expression will be compared to left object property defined by the -On parameter.
Note 1: The list of properties defined by the -Equal parameter will be complemented with the list of properties defined by the -On parameter and vice versa.
Note 2: A property will be omitted if it exists on both sides and if the property at the other side is related to another property.
Note 3: The -Equals parameter can only be used with the -On parameter.
-Strict
If the -Strict switch is set, the comparison between the related properties defined by the -On Parameter (and the -Equals parameter) is based on a strict equality (both type and value need to be equal).
-MatchCase
If the -MatchCase (alias -CaseSensitive) switch is set, the comparison between the related properties defined by the -On Parameter (and the -Equals parameter) will case sensitive.
-Using <ScriptBlock>
Any conditional expression that requires to evaluate to true in order to join the left object with the right object.
The following variables are exposed for a (ScriptBlock) expression:
$_: iterates each property name$Left: a hash table representing the current left object (each self-contained-LeftObject). The hash table will be empty (@{}) in the outer part of a left join or full join.$LeftIndex: the index of the left object ($Nullin the outer part of a right- or full join)$Right: a hash table representing the current right object (each self-contained-RightObject) The hash table will be empty (@{}) in the outer part of a right join or full join.$RightIndex: the index of the right object ($Nullin the outer part of a left- or full join)
Note 1: The -Using parameter has the most complex comparison possibilities but is considerable slower than the -On parameter.
Note 2: The -Using parameter cannot be used with the -On parameter.
-Where <ScriptBlock>
An expression that defines the condition to be met for the objects to be returned. See the Using parameter for available expression variables.
-Discern <String[]>
By default unrelated properties with the same name will be collected in a single object property.
The -Discern parameter (alias -NameItems) defines how to rename the object properties and divide them over multiple properties. If a given name pattern contains an asterisks (*), the asterisks will be replaced with the original property name. Otherwise, the property name for each property item will be prefixed with the given name pattern.
The property collection of multiple (chained) join commands can be divided in once from the last join command in the change. The rename patterns are right aligned, meaning that the last renamed pattern will be applied to the last object joined. If there are less rename patterns than property items, the rest of the (left most) property items will be put in a fixed array under the original property name.
Note 1: Only properties with the same name on both sides will not be renamed.
Note 2: Related properties (with an equal value defined by the -On parameter) will be merged to a signle item.
-Property <(HashTable or String)[]>
A hash table or list of property names (strings) and/or hash tables that define a new selection of
property names and values
Hash tables should be in the format @{<PropertyName> = <Expression>} where the <Expression> is a ScriptBlock or a smart property (string) and defines how the specific left and right properties should be merged.
The following smart properties are available:
-
A general property:
'<Property Name>', where<Property Name>represents the property name of the left and/or right property, e.g.@{ MyProperty = 'Name' }. If the property exists on both sides, an array holding both values will be returned. In the outer join, the value of the property will be$Null. This smart property is similar to the expression:@{ MyProperty = { @($Left['Name'], $Right['Name']) } } -
A general wildcard property:
'*', where*represents the property name of the current property, e.g.'MyProperty' in @{ MyProperty = '*' }. If the property exists on both sides:- and the properties are unrelated, an array holding both values will be returned
- and the properties are related to each other, the (equal) values will be merged in one property value
- and the property at the other side is related to an different property, the property is omitted
The argument:
-Property *, will apply a general wildcard on all left and right properties. -
A left property:
'Left.<Property Name>', or right property:'Right.<Property Name>', where<Property Name>represents the property name of the either the left or right property. If the property doesn't exist, the value of the property will be$Null. -
A left wildcard property:
'Left.*', or right wildcard property:'Right.*', where*represents the property name of the current the left or right property, e.g.'MyProperty' in @{ MyProperty = 'Left.*' }. If the property doesn't exist (in an outer join), the property with the same name at the other side will be taken. If the property doesn't exist on either side, the value of the property will be$Null.The argument:
-Property 'Left.*', will apply a left wildcard property on all the left object properties.
If the -Property parameter and the -Discern parameter are omitted, a general wildcard property is applied on all the left and right properties.
The last defined expression or smart property will overrule any previous defined properties.
-ValueName <String>
Defines the default name for the property name in case a scalar array is joined with an object array.
-JoinType <'Inner'|'Left'|'Right'|'Full'|'Cross'>
Defines which unrelated objects should be included (see: Description). The default is 'Inner'.
Note: It is recommended to use the related proxy commands (... | <JoinType>-Object ...) instead.
Please give a 👍 if you support the proposal to Add a Join-Object cmdlet to the standard PowerShell equipment (#14994)