{"id":61964,"date":"2024-05-30T19:29:18","date_gmt":"2024-05-30T13:59:18","guid":{"rendered":"https:\/\/www.tothenew.com\/blog\/?p=61964"},"modified":"2024-05-31T09:50:12","modified_gmt":"2024-05-31T04:20:12","slug":"comprehensive-guide-to-ua-data-backup","status":"publish","type":"post","link":"https:\/\/www.tothenew.com\/blog\/comprehensive-guide-to-ua-data-backup\/","title":{"rendered":"Comprehensive Guide to UA Data Backup"},"content":{"rendered":"<h2><b>Backup of Data<\/b><\/h2>\n<p><span style=\"font-weight: 400;\">As of <\/span><b>July 1, 2024<\/b><span style=\"font-weight: 400;\">, the storage of Universal Analytics (GA3) data will cease, representing a pivotal moment in data management practices. It&#8217;s crucial for organisations to comprehend the importance of preserving this data for future analysis and decision-making processes. Historical trends within UA data offer invaluable insights into user behaviours, preferences, and patterns over time.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Therefore, it&#8217;s essential for businesses to proactively store or download UA data before the cutoff date and continue leveraging it for informed decision-making and strategic planning endeavours.<\/span><\/p>\n<h2><b>Steps to backfill the data<\/b><\/h2>\n<p><span style=\"font-weight: 400;\">Here\u2019s a step-by-step guide to backfill the UA data to ensure no data is lost:<\/span><\/p>\n<ul>\n<li>\n<h3><b>Identify Data to Backup<\/b><\/h3>\n<\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">The initial and most crucial step involves determining which data to preserve, especially given the significant volume of data within UA. To streamline this process, we&#8217;ve categorised the data into two distinct parts, facilitating the development of your personalised blueprint.<\/span><\/p>\n<ul>\n<li>\n<h4><b>Primary level data:\u00a0<\/b><\/h4>\n<\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">The primary data encompasses information of utmost importance for the organization. Therefore, it&#8217;s imperative to securely store this data<\/span><span style=\"font-weight: 400;\">.<\/span><span style=\"font-weight: 400;\"> This includes:<\/span><\/p>\n<p><span style=\"font-weight: 400;\">&#8211; Conversion events, alongside metrics such as Total Events and Total Users.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">&#8211; Traffic-related insights detailing various channels through which users access the site. This can be further segmented into specific source\/medium categories.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">&#8211; Campaigns that attract users, including metrics like total users, total events, and conversions.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">&#8211; Key funnels such as the purchase funnel, checkout funnel, and other journey-related funnels. Analysing these helps in understanding historical trends over time.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">&#8211; Key Performance Indicators (KPIs) essential for the organization, comprising custom dimensions and metrics developed over time to gather organization-specific data.<\/span><\/p>\n<ul>\n<li>\n<h4><b>Secondary level data:\u00a0<\/b><\/h4>\n<\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">Secondary level data comprises important information, though it may not be immediately necessary unless the site relies heavily on content. This includes:<\/span><\/p>\n<p><span style=\"font-weight: 400;\">&#8211; Analysis of most-read blogs\/newsletters on the site, including metrics such as total users, average session duration, sessions, and bounce rate.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">&#8211; Assessment of average page load time to gauge site performance, alongside pageviews.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">&#8211; Demographic data such as age and gender, providing insights into user-specific demographics for further evaluation.<\/span><\/p>\n<ul>\n<li>\n<h3><b>Evaluate the time frame<\/b><\/h3>\n<\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">This step is crucial as it determines the volume of data to be stored. A shorter time frame results in less data and consequently requires less storage space, while a longer timeframe necessitates more storage space.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Typically, a <\/span><b>two-year<\/b><span style=\"font-weight: 400;\"> data retention period is considered adequate for analyzing historical trends and assessing significance. This timeframe is widely accepted by most organizations.\u00a0<\/span><\/p>\n<ul>\n<li>\n<h3><b>Select the method for data backfilling<\/b><\/h3>\n<\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">Select the method for data backfilling wisely to ensure comprehensive data recovery and continuity in analysis.\u00a0<\/span><\/p>\n<h4><span style=\"text-decoration: underline;\"><span style=\"font-weight: 400;\">1<\/span><b>. Manually export data<\/b><\/span><\/h4>\n<p><span style=\"font-weight: 400;\">Google Analytics allows you to download the data you need for further work using an EXPORT button above the date range in each report.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Open the Google Analytics report you want to save. For example, Acquisition &gt; All Traffic &gt; Source\/Medium. You can then apply additional settings: add a segment, a filter, or another parameter for the report.<\/span><\/p>\n<div id=\"attachment_61963\" style=\"width: 889px\" class=\"wp-caption alignnone\"><img aria-describedby=\"caption-attachment-61963\" decoding=\"async\" loading=\"lazy\" class=\"wp-image-61963 size-full\" src=\"https:\/\/www.tothenew.com\/blog\/wp-ttn-blog\/uploads\/2024\/05\/blog1.png\" alt=\"Backfilling data - Manual approach\" width=\"879\" height=\"252\" srcset=\"\/blog\/wp-ttn-blog\/uploads\/2024\/05\/blog1.png 879w, \/blog\/wp-ttn-blog\/uploads\/2024\/05\/blog1-300x86.png 300w, \/blog\/wp-ttn-blog\/uploads\/2024\/05\/blog1-768x220.png 768w, \/blog\/wp-ttn-blog\/uploads\/2024\/05\/blog1-624x179.png 624w\" sizes=\"(max-width: 879px) 100vw, 879px\" \/><p id=\"caption-attachment-61963\" class=\"wp-caption-text\">UA backfilling<\/p><\/div>\n<p><span style=\"font-weight: 400;\">Next, in the upper right corner, click the <\/span><b>EXPORT<\/b><span style=\"font-weight: 400;\"> button. Select the file format from the drop-down menu: PDF, Google Sheets, Excel (XLSX), or CSV.<\/span><\/p>\n<p><b>Limitation<\/b><span style=\"font-weight: 400;\">\u00a0<\/span><\/p>\n<ol>\n<li><span style=\"font-weight: 400;\"> A maximum of<\/span><b> 5,000 <\/b><span style=\"font-weight: 400;\">rows can be downloaded, aligning with the display limit in the Google Analytics interface. Additional data will be aggregated as &#8220;(other)&#8221;, necessitating alternative export methods.<\/span><\/li>\n<li><span style=\"font-weight: 400;\"> High daily visit volumes may result in data sampling, impacting the accuracy of exported data.<\/span><\/li>\n<\/ol>\n<h4><span style=\"text-decoration: underline;\"><b>2. Export data using the Google Analytics Spreadsheet Add-on (API)<\/b><\/span><\/h4>\n<p><span style=\"font-weight: 400;\">For Universal Analytics (UA), Google offers its own Google Analytics add-on, enabling access to data directly within Google Sheets.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">With the Google Analytics Spreadsheet Add-on, you can:<\/span><\/p>\n<p><span style=\"font-weight: 400;\">&#8211; Retrieve data from multiple views<\/span><\/p>\n<p><span style=\"font-weight: 400;\">&#8211; Perform custom calculations based on report data<\/span><\/p>\n<p><span style=\"font-weight: 400;\">&#8211; Generate visualizations using built-in tools and embed them on third-party websites<\/span><\/p>\n<p><span style=\"font-weight: 400;\">This serves as a viable alternative, especially when data backfilling is specified. However, note that creating a report may take some time, particularly when narrowing down the data.<\/span><\/p>\n<p><b>Limitations<\/b><\/p>\n<p><span style=\"font-weight: 400;\">The limitations of exporting data using the Google Analytics Spreadsheet Add-on (API) include:<\/span><\/p>\n<p><b>1<\/b><span style=\"font-weight: 400;\">. <\/span><b>Data Volume<\/b><span style=\"font-weight: 400;\">: The add-on may have limitations on the amount of data that can be exported in a single request. Large datasets may need to be split into multiple requests, which can increase processing time and complexity.<\/span><\/p>\n<p><b>2<\/b><span style=\"font-weight: 400;\">. <\/span><b>API Quotas<\/b><span style=\"font-weight: 400;\">: Google Analytics API has usage quotas, including limits on the number of requests per day and per user. Exceeding these quotas may result in data export failures or temporary restrictions on API access.<\/span><\/p>\n<p><b>3<\/b><span style=\"font-weight: 400;\">. <\/span><b>Sampling<\/b><span style=\"font-weight: 400;\">: When exporting large datasets, Google Analytics may apply data sampling to speed up processing.<\/span><\/p>\n<h4><span style=\"text-decoration: underline;\"><b>3. Exporting the Google Analytics data through Python<\/b><\/span><\/h4>\n<p><span style=\"font-weight: 400;\">To extract Universal Analytics (UA) data via API and export it to a CSV file using Python, certain prerequisites need to be met:<\/span><\/p>\n<p><span style=\"font-weight: 400;\">&#8211; Visual Studio Code is essential for coding and script manipulation.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">&#8211; Proficiency in Python programming is a prerequisite.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Note:<\/span><\/p>\n<p><span style=\"font-weight: 400;\">When retrieving data for extended periods, such as<\/span><b> 3-4 years<\/b><span style=\"font-weight: 400;\">, a higher sampling rate is applied, though it&#8217;s lower than that in the GA Portal. To ensure greater data accuracy, it&#8217;s advisable to fetch the data quarterly to mitigate excessive sampling.<\/span><\/p>\n<p><b>Limitations<\/b><\/p>\n<p><span style=\"font-weight: 400;\">When backfilling User Acquisition (UA) data via Python, there are several limitations to consider:<\/span><\/p>\n<ol>\n<li><b> API Quotas:<\/b><span style=\"font-weight: 400;\"> If you&#8217;re using APIs to fetch UA data, there are often limits on the number of requests you can make within a given time period.<\/span><\/li>\n<li><b> Rate Limits:<\/b><span style=\"font-weight: 400;\"> Similar to API quotas, there may be rate limits imposed by the data source or API provider. Exceeding these limits can result in throttling or temporary bans on API access.<\/span><\/li>\n<li><b> Data Processing Time:<\/b><span style=\"font-weight: 400;\"> Processing time can vary depending on factors such as the size of the dataset, the complexity of the queries, and the performance of the underlying infrastructure.<\/span><\/li>\n<\/ol>\n<h4><span style=\"text-decoration: underline;\"><b>4. Exporting the Google Analytics data to the Big Query\u00a0<\/b><\/span><\/h4>\n<p><span style=\"font-weight: 400;\">Exporting Universal Analytics data to BigQuery requires the following prerequisites:<\/span><\/p>\n<p><span style=\"font-weight: 400;\">&#8211; A Google Cloud Platform account with billing enabled.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">&#8211; A BigQuery project with billing enabled to store the UA data.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">&#8211; Access to Supermetrics, a third-party connector solution facilitating the transfer of UA data to BigQuery, as the API is restricted to UA 360 users and not available to the general user base.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">&#8211; Proficiency in SQL is necessary to execute queries for UA data within BigQuery.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">\u00a0<\/span><b>Limitations<\/b><\/p>\n<ol>\n<li><b> Data Transfer Costs<\/b><span style=\"font-weight: 400;\">: Exporting UA data to BigQuery can incur data transfer costs, especially if you&#8217;re dealing with large datasets or frequent data updates.\u00a0<\/span><\/li>\n<\/ol>\n<ol start=\"2\">\n<li><b> Data Volume and Storage:<\/b><span style=\"font-weight: 400;\"> Large datasets may require additional storage capacity and processing resources, which can increase costs and impact performance.<\/span><\/li>\n<\/ol>\n<ul>\n<li>\n<h3><b>Choose Storage Option<\/b><\/h3>\n<\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">Once you&#8217;ve chosen the method, the next step is determining the storage location for the data, ensuring convenient visualization and analysis whenever required<\/span><span style=\"font-weight: 400;\">.<\/span><\/p>\n<h4><span style=\"font-weight: 400;\">\u00a0<\/span><b>\u00a0\u00a0a. Cloud Storage Solutions:<\/b><\/h4>\n<p><span style=\"font-weight: 400;\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0&#8211; Google Cloud Storage offers a free tier with limited storage and access to various features. Users can store UA data securely and scale storage as needed.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0&#8211; Amazon S3 also provides a free tier with limited storage and access to beginners. It&#8217;s a reliable option for storing backups of UA data with options for data retrieval and management.<\/span><\/p>\n<h4><b>\u00a0\u00a0\u00a0b. On-Premises Storage Systems:<\/b><\/h4>\n<p><span style=\"font-weight: 400;\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0&#8211; Utilizing existing on-premises storage infrastructure incurs no additional cost apart from maintenance and operational expenses. This option provides full control over data storage and security, ideal for organizations with stringent compliance requirements.<\/span><\/p>\n<h4><b>\u00a0\u00a0\u00a0c. Cloud Data Warehouses:<\/b><\/h4>\n<p><span style=\"font-weight: 400;\">\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0&#8211; Cloud data warehouses like Google BigQuery, Amazon Redshift, or Snowflake offer scalable and high-performance storage solutions for the analytical processing of UA data.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">In summary, the backup of data underscores the critical importance of preserving valuable information for future use. Choosing the right method ensures data integrity and accessibility, enabling organizations to navigate challenges, make informed decisions, and drive success with confidence.<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Backup of Data As of July 1, 2024, the storage of Universal Analytics (GA3) data will cease, representing a pivotal moment in data management practices. It&#8217;s crucial for organisations to comprehend the importance of preserving this data for future analysis and decision-making processes. Historical trends within UA data offer invaluable insights into user behaviours, preferences, [&hellip;]<\/p>\n","protected":false},"author":1831,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"iawp_total_views":5},"categories":[5873],"tags":[5932,5934,5933],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/www.tothenew.com\/blog\/wp-json\/wp\/v2\/posts\/61964"}],"collection":[{"href":"https:\/\/www.tothenew.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.tothenew.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.tothenew.com\/blog\/wp-json\/wp\/v2\/users\/1831"}],"replies":[{"embeddable":true,"href":"https:\/\/www.tothenew.com\/blog\/wp-json\/wp\/v2\/comments?post=61964"}],"version-history":[{"count":4,"href":"https:\/\/www.tothenew.com\/blog\/wp-json\/wp\/v2\/posts\/61964\/revisions"}],"predecessor-version":[{"id":62108,"href":"https:\/\/www.tothenew.com\/blog\/wp-json\/wp\/v2\/posts\/61964\/revisions\/62108"}],"wp:attachment":[{"href":"https:\/\/www.tothenew.com\/blog\/wp-json\/wp\/v2\/media?parent=61964"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.tothenew.com\/blog\/wp-json\/wp\/v2\/categories?post=61964"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.tothenew.com\/blog\/wp-json\/wp\/v2\/tags?post=61964"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}