python呼び出しboto3.client.create_data_source_from_s3

私は、AWS Machine LearningのバッチプロセスをPythonプロジェクトから使用しようとしています。私はboto3を使用しています。私は応答でこの失敗メッセージを受け取ります。python呼び出しboto3.client.create_data_source_from_s3

は、スキーマを解析しようエラーが発生しました：\ 'START_ARRAY \ nは[ソースのトークンのうち、ブールのインスタンスをデシリアライズできません： [email protected]。行：1、列：2]（参照を通じてチェーン： com.amazon.eml.dp.recordset.SchemaPojo [ "dataFileContainsHeader"]）

\私は作品を使用しています.csvファイル。私はこれがコンソールプロセスを通して働いたのでこれを知っています。

ここは私のコードです。それが処理されるファイルへのURLを保持しているDjangoのモデル（INPUT_FILE）内の関数である：

def create_data_source_from_s3(self): 
     attributes = [] 
     attribute = { "fieldName": "Var1", "fieldType": "CATEGORICAL" } 
     attributes.append(attribute) 
     attribute = { "fieldName": "Var2", "fieldType": "CATEGORICAL" } 
     attributes.append(attribute) 
     attribute = { "fieldName": "Var3", "fieldType": "NUMERIC" } 
     attributes.append(attribute) 
     attribute = { "fieldName": "Var4", "fieldType": "CATEGORICAL" } 
     attributes.append(attribute) 
     attribute = { "fieldName": "Var5", "fieldType": "CATEGORICAL" } 
     attributes.append(attribute) 
     attribute = { "fieldName": "Var6", "fieldType": "CATEGORICAL" } 
     attributes.append(attribute) 

     dataSchema = {} 
     dataSchema['version'] = '1.0' 
     dataSchema['dataFormat'] = 'CSV' 
     dataSchema['attributes'] = attributes 
     dataSchema["targetFieldName"] = "Var6" 
     dataSchema["dataFileContainsHeader"] = True, 
     json_data = json.dumps(dataSchema) 

     client = boto3.client('machinelearning', region_name=settings.region, aws_access_key_id=settings.aws_access_key_id, aws_secret_access_key=settings.aws_secret_access_key) 
     #create a datasource 
     return client.create_data_source_from_s3(
      DataSourceId=self.input_file.name, 
      DataSourceName=self.input_file.name, 
      DataSpec={ 
       'DataLocationS3': 's3://' + settings.AWS_S3_BUCKET_NAME + '/' + self.input_file.name, 
       'DataSchema': json_data, 
      }, 
      ComputeStatistics=True 
      )

私が間違ってやっている任意のアイデア？

出典

2017-02-28 HenryM

はコンマを削除

dataSchema["dataFileContainsHeader"] = True,

をこれは、タプルを追加していると考えるのはPythonを引き起こしています。だからあなたのdataSchemaは実際（真）

含まれており、出力は、代わりにこの

"dataFileContainsHeader": true

のようなものを期待している。この

{"dataFileContainsHeader": [true], "attributes": [{"fieldName": "Var1", "fieldType": "CATEGORICAL"}, {"fieldName": "Var2", "fieldType": "CATEGORICAL"}, {"fieldName": "Var3", "fieldType": "NUMERIC"}, {"fieldName": "Var4", "fieldType": "CATEGORICAL"}, {"fieldName": "Var5", "fieldType": "CATEGORICAL"}, {"fieldName": "Var6", "fieldType": "CATEGORICAL"}], "version": "1.0", "dataFormat": "CSV", "targetFieldName": "Var6"}

AWSのように見えます

出典

2017-02-28 13:14:58

python呼び出しboto3.client.create_data_source_from_s3

答えて

関連する問題