1

我有 AWS Glue 作业,它使用 Spark 和 Scala 以及脚本中指定的 jdbc 连接,用于自定义 ETL 和数据解密。在数据库不公开的环境中运行作业时,作业会因通信故障而失败。我在与数据库相同的 VPC 中定义了 Glue 和 S3 端点,但仍然没有成功。

还尝试创建假连接和网络连接,但在将连接添加到作业后,作业永远不会完成并卡住。

Jun 22, 2021, 7:46:47 AM 21/06/22 14:46:47 WARN ApacheUtils: NoSuchMethodException was thrown when disabling normalizeUri. This indicates you are using an old version (< 4.5.8) of Apache http client. It is recommended to use http client version >= 4.5.9 to avoid the breaking change introduced in apache client 4.5.7 and the latency in exception handling. See https://github.com/aws/aws-sdk-java/issues/1919 for more information
Jun 22, 2021, 7:46:48 AM 21/06/22 14:46:48 INFO Utils: Successfully started service 'sparkDriver' on port 37917.
Jun 22, 2021, 7:46:50 AM 21/06/22 14:46:50 INFO GlueContext: GlueMetrics configured and enabled
Jun 22, 2021, 7:47:48 AM 21/06/22 14:47:48 WARN EC2MetadataUtils: Unable to retrieve the requested metadata (/latest/user-data/). The requested metadata is not found at http://169.254.169.254/latest/user-data/
Jun 22, 2021, 7:47:48 AM 21/06/22 14:47:48 ERROR UserData: Error encountered while try to get user data
Jun 22, 2021, 7:47:48 AM 21/06/22 14:47:48 INFO MultipartUploadOutputStream: close closed:false s3://{{bucket}}/spark/spark-application-1624373208897.inprogress
Jun 22, 2021, 7:48:47 AM 21/06/22 14:48:47 INFO MultipartUploadOutputStream: close closed:false s3://{{bucket}}/spark/spark-application-1624373208897.inprogress
Jun 22, 2021, 7:49:47 AM 21/06/22 14:49:47 INFO MultipartUploadOutputStream: close closed:false s3://{{bucket}}/spark/spark-application-1624373208897.inprogress
Jun 22, 2021, 7:50:47 AM 21/06/22 14:50:47 INFO MultipartUploadOutputStream: close closed:false s3://{{bucket}}/spark/spark-application-1624373208897.inprogress
Jun 22, 2021, 7:51:47 AM 21/06/22 14:51:47 INFO MultipartUploadOutputStream: close closed:false s3://{{bucket}}/spark/spark-application-1624373208897.inprogress
Jun 22, 2021, 7:52:47 AM 21/06/22 14:52:47 INFO MultipartUploadOutputStream: close closed:false s3://{{bucket}}/spark/spark-application-1624373208897.inprogress

无连接

Jun 21, 2021, 8:29:12 PM 21/06/22 03:29:12 ERROR ProcessLauncher: InvocationTargetException java.lang.reflect.InvocationTargetException
Jun 21, 2021, 8:29:12 PM 21/06/22 03:29:12 ERROR ProcessLauncher: Exception in User Class
Jun 21, 2021, 8:29:12 PM 21/06/22 03:29:12 ERROR ProcessLauncher: Exception in User Class: com.mysql.cj.jdbc.exceptions.CommunicationsException : Communications link failure com.mysql.cj.jdbc.exceptions.SQLError.createCommunicationsException(SQLError.java:174) com.mysql.cj.jdbc.exceptions.SQLExceptionsMapping.translateException(SQLExceptionsMapping.java:64) com.mysql.cj.jdbc.ConnectionImpl.createNewIO(ConnectionImpl.java:836) com.mysql.cj.jdbc.ConnectionImpl.<init>(ConnectionImpl.java:456) com.mysql.cj.jdbc.ConnectionImpl.getInstance(ConnectionImpl.java:246) com.mysql.cj.jdbc.NonRegisteringDriver.connect(NonRegisteringDriver.java:198) org.apache.spark.sql.execution.datasources.jdbc.JdbcUtils$$anonfun$createConnectionFactory$1.apply(JdbcUtils.scala:63) org.apache.spark.sql.execution.datasources.jdbc.JdbcUtils$$anonfun$createConnectionFactory$1.apply(JdbcUtils.scala:54) org.apache.spark.sql.execution.datasources.jdbc.JDBCRDD$.resolveTable(JDBCRDD.scala:56) org.apache.spark.sql.execution.datasources.jdbc.JDBCRelation$.getSchema(JDBCRelation.scala:210) org.apache.spark.sql.execution.datasources.jdbc.JdbcRelationProvider.createRelation(JdbcRelationProvider.scala:35) org.apache.spark.sql.execution.datasources.DataSource.resolveRelation(DataSource.scala:318)```
4

0 回答 0