Metadata-Version: 2.1
Name: spark_gaps_date_rorc_tools
Version: 0.0.14
Summary: spark_gaps_date_rorc_tools
Home-page: https://github.com/jonaqp/spark_gaps_date_rorc_tools/
Author: Jonathan Quiza
Author-email: jony327@gmail.com
License: UNKNOWN
Download-URL: https://github.com/jonaqp/spark_gaps_date_rorct_tools/archive/main.zip
Description: # spark_gaps_date_rorc_tools
        
        
        [![Github License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)
        [![Updates](https://pyup.io/repos/github/woctezuma/google-colab-transfer/shield.svg)](pyup)
        [![Python 3](https://pyup.io/repos/github/woctezuma/google-colab-transfer/python-3-shield.svg)](pyup)
        [![Code coverage](https://codecov.io/gh/woctezuma/google-colab-transfer/branch/master/graph/badge.svg)](codecov)
        
        
        
        
        spark_gaps_date_rorc_tools is a Python library that implements get gaps dates
        ## Installation
        
        The code is packaged for PyPI, so that the installation consists in running:
        ```sh
        pip install spark-gaps-date-rorc-tools 
        ```
        
        
        ## Usage
        
        wrapper take gaps dates
        
        ```sh
        config.yaml
        ===========
          conf-rorc:
            t_psan_test:
              table_path: "/data/master/psan/data/t_psan_test/"
              supplies : [
                  "/data/master/psan/data/t_ksag_test/",
                  "/data/master/psan/data/t_psan_test/"
              ]
            t_kctk_cust_rating_atrb:
              table_path: ""
               supplies : []
        
        
        
        example1: file.py
        =================
        from spark_gaps_date_rorc_tools import show_gaps_date
        df_pivot = show_gaps_date(spark=spark,
                                  config_path_name="config.yaml",
                                  hdfs_uri="hdfs://pedaaslive.scmx2p100.isi",
                                  filter_date_initial="202101",
                                  filter_date_final="202112")
        df_pivot.head()
        
        
        
        example2: file.py
        =================
        from spark_gaps_date_rorc_tools import show_gaps_date
        df_pivot = show_gaps_date(spark=spark,
                                  config_path_name="config.yaml",
                                  table_rorc=["t_psan_xxx"]
                                  hdfs_uri="hdfs://pedaaslive.scmx2p100.isi",
                                  filter_date_initial="202101",
                                  filter_date_final="202112")
        
        df_pivot.head()
        
        
        Style Dataframe (Spark): file.py
        =========================
        import pyspark
        from spark_gaps_date_rorc_tools import show_spark_df
        pyspark.sql.dataframe.DataFrame.show2 = show_spark_df
        
        df_pivot = show_gaps_date(spark=spark,
                                  config_path_name="config.yaml",
                                  table_rorc=["t_psan_xxx"]
                                  hdfs_uri="hdfs://pedaaslive.scmx2p100.isi",
                                  filter_date_initial="202101",
                                  filter_date_final="202112")
                                  
        df_pivot.show2()
        
        
        
        Style Dataframe (Pandas): file.py
        =========================
        import pandas as pd
        from spark_gaps_date_rorc_tools import show_pd_df
        pd.DataFrame.show2 = show_pd_df
        
        df_pivot = show_gaps_date(spark=spark,
                                  config_path_name="config.yaml",
                                  table_rorc=["t_psan_xxx"]
                                  hdfs_uri="hdfs://pedaaslive.scmx2p100.isi",
                                  filter_date_initial="202101",
                                  filter_date_final="202112")
                                  
        df_pivot2 = df_pivot.toPandas()                      
        df_pivot2.show2()
        
        ```
        
        ## License
        
        [Apache License 2.0](https://www.dropbox.com/s/8t6xtgk06o3ij61/LICENSE?dl=0).
        
        
        ## New features v1.0
        
         
        ## BugFix
        - choco install visualcpp-build-tools
        
        
        
        ## Reference
        
         - Jonathan Quiza [github](https://github.com/jonaqp).
         - Jonathan Quiza [RumiMLSpark](http://rumi-ml.herokuapp.com/).
         - Jonathan Quiza [linkedin](https://www.linkedin.com/in/jonaqp/).
        
Keywords: spark,gaps,date
Platform: UNKNOWN
Classifier: Development Status :: 5 - Production/Stable
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.7
Description-Content-Type: text/markdown
