etl

orientdb load graph csv of nodes and edges

醉酒当歌 提交于 2019-11-29 17:30:02
I'm a newbie in Orientdb . I have a csv file which has both the nodes and the edge and I need to create a graph out of that csv file . csv file "p1","p2","score" "LGG_00001","LGG_01682",282 "LGG_00001",".LGG_01831",183 "LGG_00001","LGG_01491",238 The edge is IsActingWith which had the score attribute { "source": { "file": { "path": "C:/Users/sams/Desktop/OrientDB2/lac2.csv" } }, "extractor": { "csv": {} }, "transformers": [ { "vertex": { "class": "lac2" } }, { "vertex": { "class": "lac2" } }, { "edge": { "class": "IsActingWith", "joinFieldName": "score_p", "lookup": "acore", "direction": "out"

SSIS process for saving .xlsx file as .csv file

社会主义新天地 提交于 2019-11-29 16:56:57
I am trying to download a .xlsx excel file from FTP server and save it in a .csv file format. I was able to download a file from server using ftp task in SSIS and save it in a local folder now I want to save that file as csv file format for import process. I could not find a conversion method or task from .xlxs to csv. i tried script task but it didn't work. can someone please help. You can add a Script task to achieve this, and inside the script you can use Interop Library: Converting XLSX file using to a CSV file Convert .xlsx & .xls to .csv How to Convert Excel to CSV using Interop The

Unzip .tar.gz files in SSIS

喜夏-厌秋 提交于 2019-11-29 16:39:16
I have a .tar.gz file. Now i need to unpack these files with SSIS package. Previously did unzip and delete for .zip files with the help of For each container and script task. Not sure how to do it for .tar.gz files. Any help? You can use an execute process task to achieve this (or using process from Script task) , but you have to install a ZIP application like 7Zip or Winzip or else. And use command line to Zip or Unzip archives. Follow one of these links for more details: Zip a folder using SSIS 7Zip command line examples What command line parameters does WinZip support? 来源: https:/

Oracle provider for Oledb missing in VS 2015 Shell

岁酱吖の 提交于 2019-11-29 16:13:49
I am migrating to SSIS 2016 version. I am trying to use the Oracle provider for Oledb in connections. However, this option does not show. I have installed the Oracle client 12.2 and I am able to do UDL file testing, where in I can see the Oracle provider and able to test connection. But when I try in VS 2015, the option is not shown. The issue is described here - https://jorgklein.com/2011/06/02/ssis-connect-to-oracle-on-a-64-bit-machine-updated-for-ssis-2008-r2/ and based on this I have installed the 64 bit client . But when trying to use the connection manager, the oracle provider is not

Format excel destination column in ssis script task

夙愿已清 提交于 2019-11-29 16:10:13
Is it possible to format a column in an excel destination in ssis before generating it? I'm thinking a script task? I want to format a column to be date/time format within the excel spreadsheet You can use Microsoft.Interop.Excel library and use NumberFormat property to change EntireColumn format to datetime. Note: you have to add Microsoft.Office.Interop.Excel.dll file to the following directories (.Net Framework dll directory) C:\Windows\Microsoft.NET\Framework\v2.0.50727 and (sql server data tools dll directory) C:\Program Files\Microsoft SQL Server\100\DTS\Binn (if using vs 2005 and sql

SSIS failing to save packages and reboots Visual Studio

偶尔善良 提交于 2019-11-29 15:33:33
This is my first experience with SSIS so bear with me... I am using SSIS to migrate tables from Oracle to SSMS, there are some very large tables I am trying to transfer (50 million rows +). SSIS is now completely freezing up and rebooting VS when I am just trying to save the package (not even running it). It keeps returning errors of insufficient memory, however, I am working on a remote server that has well over the RAM it takes to run this package. Error Message when trying to save The only thing I can think of is when this package is attempting to run, my Ethernet Kbps are through the roof

How to join multiple azure databases without rights to configure external tables?

和自甴很熟 提交于 2019-11-29 12:30:07
In my current setup I connect to an Azure SQL Server using Authentication=Active Directory - Integrated. This method of access only allows access to a single database at a time. The architecture was migrated from an on premises SQL server environment with changes to make cloud development feasible, but still analytics and debugging must occur across databases. Typically one would simply do a cross database join with a legacy SQL Server configuration, possibly involving link servers if the databases were on separate servers. In the cloud or azure you are supposed to use Elastic search or avoid

SQL Server Destination vs OLE DB Destination

人走茶凉 提交于 2019-11-29 11:57:01
I was using OLE Db destination for Bulk import of multiple Flat Files. After some tuning I ended up with SQL Server Destination to be 25 - 50 % faster. Though I am confused about this destination as there are contradictory information on the web, some are against it, some are suggesting using it. I would like to know, are there any serious pitfalls before I deploy it to production? Thanks In this answer, I will try to provide information from official documentation of SSIS and I will mention my personal experience with SQL Server destination. 1. SQL Server Destination According to the official

What is a staging table?

喜夏-厌秋 提交于 2019-11-29 11:49:13
问题 Are staging tables used only in Data warehouse project or in any SSIS Project? I would like to know what is a staging table? Can anyone give me some examples on how to use it and in what circumstances it is implemented? Also, may I please know the best practices while using it? 回答1: staging tables are just database tables containing your business data in some form or other. Staging is the process of preparing your business data, usually taken from some business application. For your average

How to join two CSVs with Apache Nifi

时光怂恿深爱的人放手 提交于 2019-11-29 07:51:09
I'm looking into ETL tools (like Talend) and investigating whether Apache Nifi could be used. Could Nifi be used to perform the following: Pick up two CSV files that are placed on local disk Join the CSVs on a common column Write the joined CSV to disk I've tried setting up a job in Nifi, but couldn't see how to perform the join of two separate CSV files. Is this task possible in Apache Nifi? It looks like the QueryDNS processor could be used to perform enrichment of one CSV file using the other, but that seems to be over-complicated for this use case. Here's an example of the input CSVs,