String Compression

Medium

String Compression

Medium

Solution

Naive Solution

A naive solution would involve creating a new string to store the compressed characters. We iterate through the input chars array, count consecutive repeating characters, and then append the character and its count (if the count is greater than 1) to the new string. Finally, copy the compressed string back to the chars array.

Code (Python):

def compress_naive(chars):
    if not chars:
        return 0

    compressed = ""
    count = 1
    for i in range(len(chars)):
        if i + 1 < len(chars) and chars[i] == chars[i + 1]:
            count += 1
        else:
            compressed += chars[i]
            if count > 1:
                compressed += str(count)
            count = 1

    chars[:] = list(compressed)
    return len(chars)

Big(O) Analysis:

Time Complexity: O(n), where n is the length of the input array, as we iterate through the array once.
Space Complexity: O(n), in the worst case, the compressed string could have the same length as the input array.

Optimal Solution

The optimal solution uses a two-pointer approach with constant extra space. We use one pointer (write_ptr) to track the position where we write the compressed characters and another pointer (read_ptr) to iterate through the input array.

Algorithm:

Initialize write_ptr and read_ptr to 0.
Iterate through the chars array using read_ptr.
For each group of consecutive repeating characters, count the occurrences.
Write the character to chars[write_ptr] and increment write_ptr.
If the count is greater than 1, convert the count to a string and write each digit to chars starting from write_ptr.
Return write_ptr, which represents the new length of the compressed array.

Code (Python):

def compress(chars):
    write_ptr = 0
    read_ptr = 0

    while read_ptr < len(chars):
        char = chars[read_ptr]
        count = 0
        while read_ptr < len(chars) and chars[read_ptr] == char:
            read_ptr += 1
            count += 1

        chars[write_ptr] = char
        write_ptr += 1

        if count > 1:
            for digit in str(count):
                chars[write_ptr] = digit
                write_ptr += 1

    return write_ptr

Big(O) Analysis:

Time Complexity: O(n), where n is the length of the input array, as we iterate through the array once.
Space Complexity: O(1), as we use constant extra space.

Edge Cases:

Empty input array: The code should handle an empty array gracefully by returning 0.
Single character input: The code should correctly handle a single character input by returning 1 and leaving the array unchanged.
All same characters: The code should compress the array to the character followed by the count.
Characters with counts >= 10: Ensure that multi-digit counts are correctly handled by converting them to strings and writing individual digits to the array.

Summary

The optimal solution uses a two-pointer approach to compress the character array in-place with constant extra space. This approach is more efficient than the naive solution, which uses O(n) extra space.

Solution

Naive Solution

Code (Python):

def compress_naive(chars):
    if not chars:
        return 0

    compressed = ""
    count = 1
    for i in range(len(chars)):
        if i + 1 < len(chars) and chars[i] == chars[i + 1]:
            count += 1
        else:
            compressed += chars[i]
            if count > 1:
                compressed += str(count)
            count = 1

    chars[:] = list(compressed)
    return len(chars)

Big(O) Analysis:

Time Complexity: O(n), where n is the length of the input array, as we iterate through the array once.
Space Complexity: O(n), in the worst case, the compressed string could have the same length as the input array.

Optimal Solution

Algorithm:

Initialize write_ptr and read_ptr to 0.
Iterate through the chars array using read_ptr.
For each group of consecutive repeating characters, count the occurrences.
Write the character to chars[write_ptr] and increment write_ptr.
If the count is greater than 1, convert the count to a string and write each digit to chars starting from write_ptr.
Return write_ptr, which represents the new length of the compressed array.

Code (Python):

def compress(chars):
    write_ptr = 0
    read_ptr = 0

    while read_ptr < len(chars):
        char = chars[read_ptr]
        count = 0
        while read_ptr < len(chars) and chars[read_ptr] == char:
            read_ptr += 1
            count += 1

        chars[write_ptr] = char
        write_ptr += 1

        if count > 1:
            for digit in str(count):
                chars[write_ptr] = digit
                write_ptr += 1

    return write_ptr

Big(O) Analysis:

Time Complexity: O(n), where n is the length of the input array, as we iterate through the array once.
Space Complexity: O(1), as we use constant extra space.

Edge Cases:

Empty input array: The code should handle an empty array gracefully by returning 0.
Single character input: The code should correctly handle a single character input by returning 1 and leaving the array unchanged.
All same characters: The code should compress the array to the character followed by the count.
Characters with counts >= 10: Ensure that multi-digit counts are correctly handled by converting them to strings and writing individual digits to the array.