Geek Slack

Course Content
Working with Pandas DataFrame
A DataFrame is the core data structure in Pandas, providing a two-dimensional table for storing data. It is similar to a spreadsheet or SQL table and consists of rows and columns, where each column can be of a different data type (e.g., integer, float, string).DataFrames are extremely versatile and essential for data manipulation and analysis tasks. You can create, manipulate, and analyze data stored in DataFrames with a variety of operations that are both powerful and easy to use.
0/7
Working with Date and Time in Pandas
Handling date and time data is an essential part of data analysis. Pandas provides robust functionality for working with time series data, allowing you to parse, manipulate, and perform various operations on date and time information. This chapter will guide you through the essential tools and methods provided by Pandas to handle date and time data effectively.Pandas offers the datetime module along with its own Timestamp and DatetimeIndex classes to work with time data. It also provides a set of methods for parsing, converting, and manipulating time-related data.
0/5
Working with Text Data in Pandas
Text data, also known as string data, is one of the most common data types encountered in data analysis. In Pandas, strings are represented as object dtype, which is typically used for text or mixed types. Pandas provides a variety of functions to efficiently handle and manipulate text data, whether you're cleaning, transforming, or extracting meaningful information.This chapter will guide you through the various tools and techniques that Pandas provides for working with text data. We’ll explore string manipulation methods, cleaning operations, and more advanced use cases like extracting specific patterns using regular expressions.
0/4
Pandas Tutorial
    About Lesson

    When working with text data in Pandas, string operations can be performed on entire columns using the .str accessor, which provides vectorized string functions.

    a. Creating a DataFrame with Text Data

    Let’s start by creating a DataFrame that contains text data to work with.

     

    import pandas as pd
    
    data = {
        'Name': ['Alice', 'Bob', 'Charlie', 'David'],
        'City': ['New York', 'Los Angeles', 'San Francisco', 'Chicago'],
        'Occupation': ['Engineer', 'Artist', 'Scientist', 'Chef']
    }
    
    df = pd.DataFrame(data)
    print(df)
    

    Output:

          Name           City   Occupation
    0    Alice       New York     Engineer
    1      Bob    Los Angeles       Artist
    2  Charlie  San Francisco    Scientist
    3    David        Chicago         Chef
    
    b. Accessing Text Data with .str

    To access the string methods, use the .str accessor, followed by the string function you want to use.

    Example 1: Converting to Uppercase

     

    df['City_upper'] = df['City'].str.upper()
    print(df)
    

    Output:

          Name           City   Occupation     City_upper
    0    Alice       New York     Engineer      NEW YORK
    1      Bob    Los Angeles       Artist    LOS ANGELES
    2  Charlie  San Francisco    Scientist  SAN FRANCISCO
    3    David        Chicago         Chef        CHICAGO
    

    Example 2: Finding the Length of Strings

    You can get the length of each string in a column using the .str.len() method.

    df['City_length'] = df['City'].str.len()
    print(df)
    

    Output:

          Name           City   Occupation     City_upper  City_length
    0    Alice       New York     Engineer      NEW YORK           8
    1      Bob    Los Angeles       Artist    LOS ANGELES          11
    2  Charlie  San Francisco    Scientist  SAN FRANCISCO          13
    3    David        Chicago         Chef        CHICAGO           7