-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-6747] [SQL] Support List<> as a return type in Hive UDF #6179
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
Closed
Changes from 8 commits
Commits
Show all changes
818 commits
Select commit
Hold shift + click to select a range
ba8a453
[SPARK-8482] Added M4 instances to the list.
pradeepchhetri 5d89d9f
[SPARK-8511] [PYSPARK] Modify a test to remove a saved model in `regr…
yu-iskw da7bbb9
[SPARK-8104] [SQL] auto alias expressions in analyzer
cloud-fan 5ab9fcf
[SPARK-8532] [SQL] In Python's DataFrameWriter, save/saveAsTable/json…
yhuai afe35f0
[SPARK-8455] [ML] Implement n-gram feature transformer
b1f3a48
[SPARK-8537] [SPARKR] Add a validation rule about the curly braces in…
yu-iskw 50d3242
[SPARK-8356] [SQL] Reconcile callUDF and callUdf
BenFradet 96aa013
[SPARK-8492] [SQL] support binaryType in UnsafeRow
1dfb0f7
[HOTFIX] [TESTS] Typo mqqt -> mqtt
860a49e
[SPARK-7153] [SQL] support all integral type ordinal in GetArrayItem
cloud-fan 6b7f2ce
[SPARK-8307] [SQL] improve timestamp from parquet
13321e6
[SPARK-7859] [SQL] Collect_set() behavior differences which fails the…
chenghao-intel c4d2343
MAINTENANCE: Automated closing of pull requests.
pwendell 44fa7df
[SPARK-8548] [SPARKR] Remove the trailing whitespaces from the SparkR…
yu-iskw 164fe2a
[SPARK-7781] [MLLIB] gradient boosted trees.train regressor missing m…
holdenk d4f6335
[SPARK-8431] [SPARKR] Add in operator to DataFrame Column in SparkR
yu-iskw 31bd306
[SPARK-8359] [SQL] Fix incorrect decimal precision after multiplication
viirya 9b618fb
[SPARK-8483] [STREAMING] Remove commons-lang3 dependency from Flume Si…
harishreedharan f0dcbe8
[SPARK-8541] [PYSPARK] test the absolute error in approx doctests
megatron-me-uk 6ceb169
[SPARK-8300] DataFrame hint for broadcast join.
rxin 0f92be5
[SPARK-8498] [TUNGSTEN] fix npe in errorhandling path in unsafeshuffl…
holdenk 4f7fbef
[SQL] [DOCS] updated the documentation for explode
lockwobr 7b1450b
[SPARK-7235] [SQL] Refactor the grouping sets
chenghao-intel 6f4cadf
[SPARK-8432] [SQL] fix hashCode() and equals() of BinaryType in Row
2b1111d
[SPARK-7888] Be able to disable intercept in linear regression in ml …
holdenk f2022fa
[SPARK-8265] [MLLIB] [PYSPARK] Add LinearDataGenerator to pyspark.mll…
MechCoder f2fb028
[SPARK-8111] [SPARKR] SparkR shell should display Spark logo and vers…
a803118
[SPARK-8525] [MLLIB] fix LabeledPoint parser when there is a whitespa…
fe2s d96d7b5
[DOC] [SQL] Addes Hive metastore Parquet table conversion section
liancheng 7fb5ae5
[SPARK-8573] [SPARK-8568] [SQL] [PYSPARK] raise Exception if column i…
111d6b9
[SPARK-8139] [SQL] Updates docs and comments of data sources and Parq…
liancheng 0401cba
[SPARK-7157][SQL] add sampleBy to DataFrame
mengxr a458efc
Revert "[SPARK-7157][SQL] add sampleBy to DataFrame"
rxin 50c3a86
[SPARK-6749] [SQL] Make metastore client robust to underlying socket …
ericl 13ae806
[HOTFIX] [BUILD] Fix MiMa checks in master branch; enable MiMa for la…
JoshRosen 09fcf96
[SPARK-8371] [SQL] improve unit test for MaxOf and MinOf and fix bugs
cloud-fan cc465fd
[SPARK-8138] [SQL] Improves error message when conflicting partition …
liancheng 9d36ec2
[SPARK-8567] [SQL] Debugging flaky HiveSparkSubmitSuite
liancheng bba6699
[SPARK-8578] [SQL] Should ignore user defined output committer when a…
yhuai 31f48e5
[SPARK-8576] Add spark-ec2 options to set IAM roles and instance-init…
nchammas 1173483
[SPARK-8399] [STREAMING] [WEB UI] Overlap between histograms and axis…
BenFradet 43e6619
[SPARK-8506] Add pakages to R context created through init.
holdenk b84d4b4
[SPARK-7088] [SQL] Fix analysis for 3rd party logical plan.
smola f04b567
[SPARK-7289] handle project -> limit -> sort efficiently
cloud-fan fb32c38
[SPARK-7633] [MLLIB] [PYSPARK] Python bindings for StreamingLogisticR…
MechCoder 8ab5076
[SPARK-6777] [SQL] Implements backwards compatibility rules in Cataly…
liancheng dca21a8
[SPARK-8558] [BUILD] Script /dev/run-tests fails when _JAVA_OPTIONS e…
fe2s 7daa702
[SPARK-8567] [SQL] Increase the timeout of HiveSparkSubmitSuite
yhuai b71d325
[SPARK-8075] [SQL] apply type check interface to more expressions
cloud-fan 82f80c1
Two minor SQL cleanup (compiler warning & indent).
rxin 7bac2fe
[SPARK-7884] Move block deserialization from BlockStoreShuffleFetcher…
massie c337844
[SPARK-8604] [SQL] HadoopFsRelation subclasses should set their outpu…
liancheng 085a721
[SPARK-5768] [WEB UI] Fix for incorrect memory in Spark UI
rekhajoshm e988adb
[SPARK-8574] org/apache/spark/unsafe doesn't honor the java source/ta…
f9b397f
[SPARK-8567] [SQL] Add logs to record the progress of HiveSparkSubmit…
yhuai 2519dcc
[MINOR] [MLLIB] rename some functions of PythonMLLibAPI
yanboliang c392a9e
[SPARK-8637] [SPARKR] [HOTFIX] Fix packages argument, sparkSubmitBinName
shivaram 47c874b
[SPARK-8237] [SQL] Add misc function sha2
viirya 4036011
[SPARK-8620] [SQL] cleanup CodeGenContext
cloud-fan 1a79f0e
[SPARK-8635] [SQL] improve performance of CatalystTypeConverters
cloud-fan 9fed6ab
[SPARK-8344] Add message processing time metric to DAGScheduler
JoshRosen c9e05a3
[SPARK-8613] [ML] [TRIVIAL] add param to disable linear feature scaling
holdenk 37bf76a
[SPARK-8302] Support heterogeneous cluster install paths on YARN.
41afa16
[SPARK-8652] [PYSPARK] Check return value for all uses of doctest.tes…
JoshRosen a56516f
[SPARK-8662] SparkR Update SparkSQL Test
9d11817
[SPARK-8607] SparkR -- jars not being added to application classpath …
b5a6663
[SPARK-8639] [DOCS] Fixed Minor Typos in Documentation
Rosstin d48e789
[SPARK-3629] [YARN] [DOCS]: Improvement of the "Running Spark on YARN…
4153776
[SPARK-8623] Hadoop RDDs fail to properly serialize configuration
sryza 0b5abbf
[SPARK-8606] Prevent exceptions in RDD.getPreferredLocations() from c…
JoshRosen 40648c5
[SPARK-8583] [SPARK-5482] [BUILD] Refactor python/run-tests to integr…
JoshRosen 42db3a1
[HOTFIX] Fix pull request builder bug in #6967
JoshRosen f510045
[SPARK-8683] [BUILD] Depend on mockito-core instead of mockito-all
JoshRosen 52d1281
[SPARK-8649] [BUILD] Mapr repository is not defined properly
tszym 77da5be
[SPARK-8610] [SQL] Separate Row and InternalRow (part 2)
ec78438
[SPARK-8686] [SQL] DataFrame should support `where` with expression r…
sarutak 9ce78b4
[SPARK-8596] [EC2] Added port for Rstudio
koaning 24fda73
[SPARK-8677] [SQL] Fix non-terminating decimal expansion for decimal …
viirya 00a9d22
[SPARK-7845] [BUILD] Bumping default Hadoop version used in profile h…
liancheng 25f574e
[SPARK-7212] [MLLIB] Add sequence learning flag
dfde31d
[SPARK-5962] [MLLIB] Python support for Power Iteration Clustering
yanboliang 0b10662
[SPARK-8575] [SQL] Deprecate callUDF in favor of udf
BenFradet ac2e17b
[SPARK-8355] [SQL] Python DataFrameReader/Writer should mirror Scala
660c6ce
[SPARK-8698] partitionBy in Python DataFrame reader/writer interface …
rxin 630bd5f
[SPARK-8702] [WEBUI] Avoid massive concating strings in Javascript
zsxwing 5c796d5
[SPARK-8693] [PROJECT INFRA] profiles and goals are not printed in a …
715f084
[SPARK-8554] Add the SparkR document files to `.rat-excludes` for `./…
yu-iskw ea88b1a
Revert "[SPARK-8372] History server shows incorrect information for a…
ed413bc
[SPARK-8692] [SQL] re-order the case statements that handling catalys…
cloud-fan 3664ee2
[SPARK-8066, SPARK-8067] [hive] Add support for Hive 1.0, 1.1 and 1.2.
a5c2961
[SPARK-8235] [SQL] misc function sha / sha1
tarekbecker 492dca3
[SPARK-8528] Expose SparkContext.applicationId in PySpark
94e040d
[SQL][DOCS] Remove wrong example from DataFrame.scala
sarutak 637b4ee
[SPARK-8214] [SQL] Add function hex
zhichao-li c6ba2ea
[SPARK-7862] [SQL] Disable the error message redirect to stderr
chenghao-intel be7ef06
[SPARK-8681] fixed wrong ordering of columns in crosstab
brkyvz afae976
[SPARK-8070] [SQL] [PYSPARK] avoid spark jobs in createDataFrame
27ef854
[SPARK-8709] Exclude hadoop-client's mockito-all dependency
JoshRosen f6fc254
[SPARK-8056][SQL] Design an easier way to construct schema for both S…
ecd3aac
[SPARK-7810] [PYSPARK] solve python rdd socket connection problem
c8ae887
[SPARK-8660][ML] Convert JavaDoc style comments inLogisticRegressionS…
Rosstin 931da5c
[SPARK-8478] [SQL] Harmonize UDF-related code to use uniformly UDF in…
BenFradet ed359de
[SPARK-8579] [SQL] support arbitrary object in UnsafeRow
4e880cf
[SPARK-8661][ML] for LinearRegressionSuite.scala, changed javadoc-sty…
Rosstin 4b497a7
[SPARK-8710] [SQL] Change ScalaReflection.mirror from a val to a def.
yhuai 881662e
[SPARK-8589] [SQL] cleanup DateTimeUtils
cloud-fan cec9852
[SPARK-8634] [STREAMING] [TESTS] Fix flaky test StreamingListenerSuit…
zsxwing fbf7573
[SPARK-7287] [SPARK-8567] [TEST] Add sc.stop to applications in Spark…
yhuai 5d30eae
[SPARK-8437] [DOCS] Using directory path without wildcard for filenam…
srowen d7f796d
[SPARK-8410] [SPARK-8475] remove previous ivy resolution when using s…
brkyvz 4a9e03f
[SPARK-8019] [SPARKR] Support SparkR spawning worker R processes with…
msannell 4c1808b
Revert "[SPARK-8437] [DOCS] Using directory path without wildcard for…
620605a
[SPARK-8456] [ML] Ngram featurizer python
ecacb1e
[SPARK-8715] ArrayOutOfBoundsException fixed for DataFrameStatSuite.c…
brkyvz 4915e9e
[SPARK-8669] [SQL] Fix crash with BINARY (ENUM) fields with Parquet 1.7
stshe f9b6bf2
[SPARK-7667] [MLLIB] MLlib Python API consistency check
yanboliang 7bbbe38
[SPARK-5161] Parallelize Python test execution
JoshRosen ea775b0
MAINTENANCE: Automated closing of pull requests.
pwendell f79410c
[SPARK-8721][SQL] Rename ExpectsInputTypes => AutoCastInputTypes.
rxin e6c3f74
[SPARK-8650] [SQL] Use the user-specified app name priority in SparkS…
watermen 6c5a6db
[SPARK-5161] [HOTFIX] Fix bug in Python test failure reporting
JoshRosen 12671dd
[SPARK-8434][SQL]Add a "pretty" parameter to the "show" method to dis…
zsxwing 5452457
[SPARK-8551] [ML] Elastic net python code example
coderxiang 2ed0c0a
[SPARK-7756] [CORE] More robust SSL options processing.
tellison 08fab48
[SPARK-8590] [SQL] add code gen for ExtractValue
cloud-fan 865a834
[SPARK-8723] [SQL] improve divide and remainder code gen
cloud-fan a48e619
[SPARK-8680] [SQL] Slightly improve PropagateTypes
viirya 722aa5f
[SPARK-8236] [SQL] misc functions: crc32
qiansl127 689da28
[SPARK-8592] [CORE] CoarseGrainedExecutorBackend: Cannot register wit…
xuchenCN ada384b
[SPARK-8437] [DOCS] Corrected: Using directory path without wildcard …
srowen 4528166
[SPARK-4127] [MLLIB] [PYSPARK] Python bindings for StreamingLinearReg…
MechCoder 5fa0863
[SPARK-8679] [PYSPARK] [MLLIB] Default values in Pipeline API should …
MechCoder fbb267e
[SPARK-8713] Make codegen thread safe
9213f73
[SPARK-8615] [DOCUMENTATION] Fixed Sample deprecated code
ca7e460
[SPARK-7988] [STREAMING] Round-robin scheduling of receivers by default
nishkamravi2 5726440
[SPARK-8630] [STREAMING] Prevent from checkpointing QueueInputDStream
zsxwing d16a944
[SPARK-8619] [STREAMING] Don't recover keytab and principal configura…
SaintBacchus 1e1f339
[SPARK-6785] [SQL] fix DateTimeUtils for dates before 1970
ckadner c1befd7
[SPARK-8664] [ML] Add PCA transformer
yanboliang b8e5bb6
[SPARK-8628] [SQL] Race condition in AbstractSparkSQLParser.parse
74cc16d
[SPARK-8471] [ML] Discrete Cosine Transform Feature Transformer
61d7b53
[SPARK-7514] [MLLIB] Add MinMaxScaler to feature transformation
hhbyyh 79f0b37
[SPARK-8560] [UI] The Executors page will have negative if having res…
XuTingjun 7dda084
[SPARK-2645] [CORE] Allow SparkEnv.stop() to be called multiple times…
rekhajoshm 4bb8375
[SPARK-8372] Do not show applications that haven't recorded their app…
3ba23ff
[SPARK-8736] [ML] GBTRegressor should not threshold prediction
jkbradley 8c89896
[SPARK-8705] [WEBUI] Don't display rects when totalExecutionTime is 0
zsxwing e725262
[SPARK-8563] [MLLIB] Fixed a bug so that IndexedRowMatrix.computeSVD(…
lee19 d2495f7
[SPARK-8739] [WEB UI] [WINDOWS] A illegal character `\r` can be conta…
sarutak 58ee2a2
[SPARK-8738] [SQL] [PYSPARK] capture SQL AnalysisException in Python API
8d23587
[SPARK-7739] [MLLIB] Improve ChiSqSelector example code in user guide
sethah 8133125
[SPARK-8741] [SQL] Remove e and pi from DataFrame functions.
rxin ccdb052
[SPARK-8727] [SQL] Missing python api; md5, log2
tarekbecker 3bee0f1
[SPARK-6602][Core] Update Master, Worker, Client, AppClient and relat…
zsxwing f457569
[SPARK-8471] [ML] Rename DiscreteCosineTransformer to DCT
b6e76ed
[SPARK-8535] [PYSPARK] PySpark : Can't create DataFrame from Pandas d…
x1- 64c1461
[SPARK-6602][Core]Remove unnecessary synchronized
zsxwing 365c140
[SPARK-8748][SQL] Move castability test out from Cast case class into…
rxin fc3a6fe
[SPARK-8749][SQL] Remove HiveTypeCoercion trait.
rxin 0eee061
[SQL] [MINOR] remove internalRowRDD in DataFrame
cloud-fan 9765241
[SPARK-8750][SQL] Remove the closure in functions.callUdf.
rxin fdcad6e
[SPARK-8763] [PYSPARK] executing run-tests.py with Python 2.6 fails w…
cocoatomo 69c5dee
[SPARK-7714] [SPARKR] SparkR tests should use more specific expectati…
4137f76
[SPARK-8752][SQL] Add ExpectsInputTypes trait for defining expected i…
rxin 31b4a3d
[SPARK-8621] [SQL] support empty string as column name
cloud-fan 184de91
[SPARK-6263] [MLLIB] Python MLlib API missing items: Utils
Lewuathe 2012913
[SPARK-8308] [MLLIB] add missing save load for python example
hhbyyh b8faa32
[SPARK-8765] [MLLIB] [PYTHON] removed flaky python PIC test
jkbradley 75b9fe4
[SPARK-8378] [STREAMING] Add the Python API for Flume
zsxwing 9f7db34
[SPARK-7820] [BUILD] Fix Java8-tests suite compile and test error und…
jerryshao 3083e17
[QUICKFIX] [SQL] fix copy of generated row
1ce6428
[SPARK-3444] [CORE] Restore INFO level after log4j test.
f958f27
[SPARK-8766] support non-ascii character in column names
2727789
[SPARK-8770][SQL] Create BinaryOperator abstract class.
rxin 3a342de
Revert "[SPARK-8770][SQL] Create BinaryOperator abstract class."
rxin 9fd13d5
[SPARK-8770][SQL] Create BinaryOperator abstract class.
rxin 4e4f74b
[SPARK-8660] [MLLIB] removed > symbols from comments in LogisticRegre…
Rosstin b285ac5
[SPARK-8227] [SQL] Add function unhex
zhichao-li 792fcd8
[SPARK-8754] [YARN] YarnClientSchedulerBackend doesn't stop gracefull…
646366b
[SPARK-8688] [YARN] Bug fix: disable the cache fs to gain the HDFS co…
SaintBacchus d14338e
[SPARK-8771] [TRIVIAL] Add a version to the deprecated annotation for…
holdenk 15d41cc
[SPARK-8769] [TRIVIAL] [DOCS] toLocalIterator should mention it resul…
holdenk 377ff4c
[SPARK-8740] [PROJECT INFRA] Support GitHub OAuth tokens in dev/merge…
JoshRosen 3697232
[SPARK-3071] Increase default driver memory
1b0c8e6
[SPARK-8687] [YARN] Fix bug: Executor can't fetch the new set configu…
SaintBacchus 4158836
[DOCS] Fix minor wrong lambda expression example.
sarutak c572e25
[SPARK-8787] [SQL] Changed parameter order of @deprecated in package …
1bbdf9e
[SPARK-8746] [SQL] update download link for Hive 0.13.1
ckadner 246265f
[SPARK-8690] [SQL] Add a setting to disable SparkSQL parquet schema m…
99c40cd
[SPARK-8647] [MLLIB] Potential issue with constant hashCode
0a468a4
[SPARK-8758] [MLLIB] Add Python user guide for PowerIterationClustering
yanboliang 5b33381
[SPARK-8223] [SPARK-8224] [SQL] shift left and shift right
tarekbecker afa021e
[SPARK-8747] [SQL] fix EqualNullSafe for binary type
cloud-fan 52302a8
[SPARK-8407] [SQL] complex type constructors: struct and named_struct
yjshen 0e553a3
[SPARK-8708] [MLLIB] Paritition ALS ratings based on both users and p…
viirya 2e2f326
[SPARK-8581] [SPARK-8584] Simplify checkpointing code + better error …
34d448d
[SPARK-8479] [MLLIB] Add numNonzeros and numActives to linalg.Matrices
MechCoder 82cf331
[SPARK-8781] Fix variables in published pom.xml are not resolved
fcbcba6
[SPARK-1564] [DOCS] Added Javascript to Javadocs to create badges for…
deroneriksson cd20355
[SPARK-7835] Refactor HeartbeatReceiverSuite for coverage + cleanup
52508be
[SPARK-8772][SQL] Implement implicit type cast for expressions that d…
rxin 7d9cc96
[SPARK-3382] [MLLIB] GradientDescent convergence tolerance
Lewuathe fc7aebd
[SPARK-8784] [SQL] Add Python API for hex and unhex
488bad3
[SPARK-7104] [MLLIB] Support model save/load in Python's Word2Vec
yu-iskw e589e71
Revert "[SPARK-8784] [SQL] Add Python API for hex and unhex"
rxin d983819
[SPARK-8782] [SQL] Fix code generation for ORDER BY NULL
JoshRosen aa7bbc1
[SPARK-6980] [CORE] Akka timeout exceptions indicate which conf contr…
BryanCutler 1a7a7d7
[SPARK-8213][SQL]Add function factorial
zhichao-li dfd8bac
Minor style fix for the previous commit.
rxin 20a4d7d
[SPARK-8501] [SQL] Avoids reading schema from empty ORC files
liancheng a59d14f
[SPARK-8801][SQL] Support TypeCollection in ExpectsInputTypes
rxin f743c79
[SPARK-8776] Increase the default MaxPermSize
yhuai 9b23e92
[SPARK-8803] handle special characters in elements in crosstab
brkyvz 2848f4d
[SPARK-8809][SQL] Remove ConvertNaNs analyzer rule.
rxin ab535b9
[SPARK-8226] [SQL] Add function shiftrightunsigned
zhichao-li f0fac2a
[SPARK-7401] [MLLIB] [PYSPARK] Vectorize dot product and sq_dist betw…
MechCoder e92c24d
[SPARK-8810] [SQL] Added several UDF unit tests for Spark SQL
spirom 4a22bce
[SPARK-8572] [SQL] Type coercion for ScalaUDFs
9fb6b83
[SPARK-8192] [SPARK-8193] [SQL] udf current_date, current_timestamp
adrian-wang f32487b
[SPARK-8777] [SQL] Add random data generator test utilities to Spark SQL
JoshRosen f35b0c3
[SPARK-8238][SPARK-8239][SPARK-8242][SPARK-8243][SPARK-8268][SQL]Add …
chenghao-intel 6b3574e
[SPARK-8270][SQL] levenshtein distance
tarekbecker 48f7aed
Fixed minor style issue with the previous merge.
rxin 347cab8
[SQL] More unit tests for implicit type cast & add simpleString to Ab…
rxin c991ef5
[SPARK-8822][SQL] clean up type checking in math.scala.
rxin 2b820f2
[MINOR] [SQL] Minor fix for CatalystSchemaConverter
viirya f9c448d
[SPARK-7137] [ML] Update SchemaUtils checkInputColumn to print more i…
rekhajoshm a0cb111
[SPARK-8549] [SPARKR] Fix the line length of SparkR
yu-iskw 6d0411b
[SQL][Minor] Update the DataFrame API for encode/decode
chenghao-intel 86768b7
[SPARK-8831][SQL] Support AbstractDataType in TypeCollection.
rxin 39e4e7e
[SPARK-8841] [SQL] Fix partition pruning percentage log message
eglp-slindemann 293225e
[SPARK-8124] [SPARKR] Created more examples on SparkR DataFrames
Emaasit 0e19464
[SPARK-8837][SPARK-7114][SQL] support using keyword in column name
cloud-fan 57c72fc
Small update in the readme file
37e4d92
[SPARK-8784] [SQL] Add Python API for hex and unhex
2471c0b
[SPARK-4485] [SQL] 1) Add broadcast hash outer join, (2) Fix SparkPla…
132e7fc
[MINOR] [SQL] remove unused code in Exchange
adrian-wang 9ff2033
[SPARK-8656] [WEBUI] Fix the webUI and JSON API number is not synced
1165b17
[SPARK-6707] [CORE] [MESOS] Mesos Scheduler should allow the user to …
96c5eee
Revert "[SPARK-7212] [MLLIB] Add sequence learning flag"
mengxr ee232db
Support List as a return type in Hive UDF
maropu 93e3d4e
Add a blank line at the end of UDFToListString
maropu 6984bf4
Apply review comments
maropu 7f812fd
Fix code-style errors
maropu af61f2e
Remove a new type
maropu fdb2ae4
Add StringToUtf8 to comvert String into UTF8String
maropu 7114a47
Add TODO comments in UDFToListString of HiveUdfSuite
maropu 2844a8e
Apply comments
maropu 92ed7a6
Throw an exception when java list type used
maropu feb1129
Fix conflicts
maropu File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
29 changes: 29 additions & 0 deletions
29
sql/hive/src/test/java/org/apache/spark/sql/hive/execution/UDFToListInt.java
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,29 @@ | ||
| /* | ||
| * Licensed to the Apache Software Foundation (ASF) under one or more | ||
| * contributor license agreements. See the NOTICE file distributed with | ||
| * this work for additional information regarding copyright ownership. | ||
| * The ASF licenses this file to You under the Apache License, Version 2.0 | ||
| * (the "License"); you may not use this file except in compliance with | ||
| * the License. You may obtain a copy of the License at | ||
| * | ||
| * http://www.apache.org/licenses/LICENSE-2.0 | ||
| * | ||
| * Unless required by applicable law or agreed to in writing, software | ||
| * distributed under the License is distributed on an "AS IS" BASIS, | ||
| * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | ||
| * See the License for the specific language governing permissions and | ||
| * limitations under the License. | ||
| */ | ||
|
|
||
| package org.apache.spark.sql.hive.execution; | ||
|
|
||
| import org.apache.hadoop.hive.ql.exec.UDF; | ||
|
|
||
| import java.util.Arrays; | ||
| import java.util.List; | ||
|
|
||
| public class UDFToListInt extends UDF { | ||
| public List<Integer> evaluate(Object o) { | ||
| return Arrays.asList(1, 2, 3); | ||
| } | ||
| } |
29 changes: 29 additions & 0 deletions
29
sql/hive/src/test/java/org/apache/spark/sql/hive/execution/UDFToListString.java
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,29 @@ | ||
| /* | ||
| * Licensed to the Apache Software Foundation (ASF) under one or more | ||
| * contributor license agreements. See the NOTICE file distributed with | ||
| * this work for additional information regarding copyright ownership. | ||
| * The ASF licenses this file to You under the Apache License, Version 2.0 | ||
| * (the "License"); you may not use this file except in compliance with | ||
| * the License. You may obtain a copy of the License at | ||
| * | ||
| * http://www.apache.org/licenses/LICENSE-2.0 | ||
| * | ||
| * Unless required by applicable law or agreed to in writing, software | ||
| * distributed under the License is distributed on an "AS IS" BASIS, | ||
| * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | ||
| * See the License for the specific language governing permissions and | ||
| * limitations under the License. | ||
| */ | ||
|
|
||
| package org.apache.spark.sql.hive.execution; | ||
|
|
||
| import org.apache.hadoop.hive.ql.exec.UDF; | ||
|
|
||
| import java.util.Arrays; | ||
| import java.util.List; | ||
|
|
||
| public class UDFToListString extends UDF { | ||
| public List<String> evaluate(Object o) { | ||
| return Arrays.asList("data1", "data2", "data3"); | ||
| } | ||
| } |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am pretty concerned about internal types leaking out of the execution engine into user code here. Are there real UDFs that we are trying to support here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, some libraries already depend on this type of UDFs:
https://github.com/myui/hivemall/blob/master/src/main/java/hivemall/ftvec/AddBiasUDF.java#L37
Any idea to avoid this leaking?