Содержание
- 4. RDD Basics
- 5. RDD Basics
- 6. RDD Basics
- 7. RDD Basics
- 8. DataFrames
- 9. Datasets
- 10. SQL vs. DataFrame vs. Dataset
- 11. Spark ML Pipelines
- 12. Spark ML Pipelines Transformer
- 13. Spark ML Pipelines Transformer Estimator
- 14. Spark ML Pipelines
- 15. Spark ML Pipelines
- 16. Spark ML Pipelines
- 17. Spark ML Core
- 18. Field Metadata and Attributes
- 19. Prediction Model
- 20. “My Spark ML Model”
- 21. Spark ML Features ETL SQLTransformer SqlFilter, ColumnsExtractor Numerization OneHotEncoder StringIndexer MultinomialExtractor Vectorization VectorAssembler FeatureHasher AutoAssembler Feature
- 22. Spark ML Features Feature Engineering DCT ElementwiseProduct Interaction VectorIndexer PolynomialExpansion Feature Selection ChiSqSelector FoldedFeaturesSelector Dimension reduction
- 23. Spark ML Features Texts extraction Tokenizer RegexTokenizer Ngram StopWordsRemover NLP in Pravada-ML LanguageDetectorTransformer LanguageAwareAnalyzer NGramExtractor URLElimminator
- 24. Spark ML Features Regression Classification
- 25. Spark ML Features Recommendations ALS FPGrowth Evaluation BinaryClassificationEvaluator ClusteringEvaluator MulticlassClassificationEvaluator RegressionEvaluator Tuning ParamGridBuilder CrossValidator More from
- 27. Скачать презентацию