Fix typos (#878)

treysp · web-flow · commit bb528e9130a0 · 2023-05-22T12:30:31.000-05:00
diff --git a/docs/comparisons.md b/docs/comparisons.md
@@ -5,7 +5,7 @@
 There are many tools and frameworks in the data ecosystem. This page tries to make sense of it all.
 
 ## dbt
-[dbt](https://www.getdbt.com/) is a tool for data transformations. It is a pioneer in this space and has shown how valuable transformation frameworks can be. Although dbt is a fanstastic tool, it has trouble scaling with data and organizational size.
+[dbt](https://www.getdbt.com/) is a tool for data transformations. It is a pioneer in this space and has shown how valuable transformation frameworks can be. Although dbt is a fantastic tool, it has trouble scaling with data and organizational size.
 
 dbt built their product focused on simple data transformations. By default, it fully refreshes data warehouses by executing templated SQL in the correct order.
 
@@ -107,7 +107,7 @@ WHERE d.ds BETWEEN @start_ds AND @end_ds
 #### Data leakage
 dbt does not check whether the data inserted into an incremental table should be there or not. This can lead to problems and consistency issues, such as late-arriving data overriding past partitions. These problems are called "data leakage."
 
-SQLMesh wraps all queries in a subquery with a time filter under the hood to enforce that the data inserted for a particular batch is as expected and reproducible everytime.
+SQLMesh wraps all queries in a subquery with a time filter under the hood to enforce that the data inserted for a particular batch is as expected and reproducible every time.
 
 In addition, dbt only supports the 'insert/overwrite' incremental load pattern for systems that natively support it. SQLMesh enables 'insert/overwrite' on any system, because it is the most robust approach to incremental loading, while 'Append' pipelines risk data inaccuracy in the variety of scenarios where your pipelines may run more than once for a given date.
 
diff --git a/docs/concepts/models/overview.md b/docs/concepts/models/overview.md
@@ -33,7 +33,7 @@ The `SELECT` expression of a model must follow certain conventions for SQLMesh t
 ### Unique column names
 The final `SELECT` of a model's query must contain unique column names.
 
-### Explict types
+### Explicit types
 SQLMesh encourages explicit type casting in the final `SELECT` of a model's query. It is considered a best practice to prevent unexpected types in the schema of a model's table.
 
 SQLMesh uses the postgres `x::int` syntax for casting; the casts are automatically transpiled to the appropriate format for the execution engine.
@@ -55,11 +55,11 @@ This example demonstrates non-inferrable, inferrable, and explicit aliases:
 ```sql linenums="1"
 SELECT
   1, -- not inferrable
-  x + 1, -- not infererrable
-  SUM(x), -- not infererrable
+  x + 1, -- not inferrable
+  SUM(x), -- not inferrable
   x, -- inferrable as x
   x::int, -- inferrable as x
-  x + 1 AS x, -- explictly x
+  x + 1 AS x, -- explicitly x
   SUM(x) as x, -- explicitly x
 ```
 
@@ -87,7 +87,7 @@ Name is ***required*** and must be ***unique***.
 - Start is used to determine the earliest time needed to process the model. It can be an absolute date/time (`2022-01-01`), or a relative one (`1 year ago`).
 
 ### cron
-- Cron is used to schedule your model to process or refresh at a certain interval. It uses [croniter](https://github.com/kiorky/croniter) under the hood, so expressions such as `@daily` can be used. A model's `IntervalUnit` is determined implicity by the cron expression.
+- Cron is used to schedule your model to process or refresh at a certain interval. It uses [croniter](https://github.com/kiorky/croniter) under the hood, so expressions such as `@daily` can be used. A model's `IntervalUnit` is determined implicitly by the cron expression.
 
 ### storage_format
 - Storage format is a property for engines such as Spark or Hive that support storage formats such as  `parquet` and `orc`.
@@ -112,7 +112,7 @@ For models that are incremental, the following parameters can be specified in th
 - Batch size is used to optimize backfilling incremental data. It determines the maximum number of intervals to run in a single job. For example, if a model specifies a cron of `@hourly` and a batch_size of `12`, when backfilling 3 days of data, the scheduler will spawn 6 jobs. (3 days * 24 hours/day = 72 hour intervals to fill. 72 intervals / 12 intervals per job = 6 jobs.)
 
 ## Macros
-Macros can be used for passing in paramaterized arguments such as dates, as well as for making SQL less repetitive. By default, SQLMesh provides several predefined macro variables that can be used. Macros are used by prefixing with the `@` symbol. For more information, refer to [macros](../macros.md).
+Macros can be used for passing in parameterized arguments such as dates, as well as for making SQL less repetitive. By default, SQLMesh provides several predefined macro variables that can be used. Macros are used by prefixing with the `@` symbol. For more information, refer to [macros](../macros.md).
 
 ## Statements
 Models can have additional statements that run before the main query. This can be useful for loading things such as [UDFs](../glossary.md#user-defined-function-udf).
diff --git a/docs/concepts/models/python_models.md b/docs/concepts/models/python_models.md
@@ -1,6 +1,6 @@
 # Python models
 
-Although SQL is a powerful tool, some use cases are better handled by Python. For example, Pyton may be a better option in pipelines that involve machine learning, interacting with external APIs, or complex business logic that cannot be expressed in SQL. 
+Although SQL is a powerful tool, some use cases are better handled by Python. For example, Python may be a better option in pipelines that involve machine learning, interacting with external APIs, or complex business logic that cannot be expressed in SQL. 
 
 SQLMesh has first-class support for models defined in Python; there are no restrictions on what can be done in the Python model as long as it returns a Pandas or Spark DataFrame instance.
 
diff --git a/docs/concepts/overview.md b/docs/concepts/overview.md
@@ -3,7 +3,7 @@
 This page provides a conceptual overview of what SQLMesh does and how its components fit together.
 
 ## What SQLMesh is
-SQLMesh is a Python framework that automates everything needed to run a scaleable data transformation platform. SQLMesh works with a variety of [engines and orchestrators](../integrations/overview.md). 
+SQLMesh is a Python framework that automates everything needed to run a scalable data transformation platform. SQLMesh works with a variety of [engines and orchestrators](../integrations/overview.md). 
 
 It was created with a focus on both data and organizational scale and works regardless of your data warehouse or SQL engine's capabilities.
 
diff --git a/docs/guides/projects.md b/docs/guides/projects.md
@@ -4,7 +4,7 @@
 
 ---
 
-Before getting started, ensure that you meet the [prerequsities](../prerequisites.md) for using SQLMesh.
+Before getting started, ensure that you meet the [prerequisites](../prerequisites.md) for using SQLMesh.
 
 ---
 
@@ -58,7 +58,7 @@ To create a project from the command line, follow these steps:
 
 To edit an existing project, open the project file you wish to edit in your preferred editor.
 
-If using CLI or Notebook, you can open a file in your project for editing by using the `sqlmesh` command with the `-p` varaible, and pointing to your project's path as follows:
+If using CLI or Notebook, you can open a file in your project for editing by using the `sqlmesh` command with the `-p` variable, and pointing to your project's path as follows:
 
 ```bash
 sqlmesh -p <your-project-path>
diff --git a/docs/index.md b/docs/index.md
@@ -18,7 +18,7 @@ Here are some challenges that data teams run into, especially when data sizes in
     * Validating changes to data pipelines before deploying to production is an uncertain and sometimes expensive process. Although branches can be deployed to environments, when merged to production, the code is re-run. This is wasteful and generates uncertainty because the data is regenerated.
 
 1. Silos transform data lakes to data swamps
-    * The difficulty and cost of making changes to core pipelines can lead to duplicate pipelines with minor customizations. The inability to easily make and validate changes causes contributors to follow the "path of least resistence". The proliferation of similar tables leads to additional costs, inconsistencies, and maintenance burden.
+    * The difficulty and cost of making changes to core pipelines can lead to duplicate pipelines with minor customizations. The inability to easily make and validate changes causes contributors to follow the "path of least resistance". The proliferation of similar tables leads to additional costs, inconsistencies, and maintenance burden.
 
 ## What is SQLMesh?
 SQLMesh consists of a CLI, a Python API, and a Web UI to make data pipeline development and deployment easy, efficient, and safe.