Solved

use google spreadsheet to analyze # of stops each day

Posted on 2013-11-13
7
374 Views
Last Modified: 2015-03-02
I have a google spreadsheet that contains a list of id#s and date/times.  For each id#, for each date, Id like to count the # of entries.  For example, id 3496978724, starting in row 2, has 4 entries for 10/1/2013.  After we count the # of entries an ID has for the day, the following rules should be applied.  If the # of entries is:

1
This should be considered "One Way."  All records for that id for the day should be classified as "One Way"

2
If the difference between the two times is less than 2.5 hours, all records for that id for the day should be classified as "One Way"

If the difference between the two times is greater than 2.5 hours, all records for that id for the day should be classified as "Round Trip"

3
If the difference between any two times is more than 2.5 hours apart, all records for that id for the day should be classified as "Round Trip"

Otherwise, all records for that id for the day should be classified as "One Way"

4
If the difference between any two times is more than 2.5 hours apart, all records for that id for the day should be classified as "Round Trip"

Otherwise, all records for that id for the day should be classified as "One Way"

5 or greater
If the difference between any two times is more than 2.5 hours apart, all records for that id for the day should be classified as "Round Trip"

Otherwise, all records for that id for the day should be classified as "One Way"

In the end
Id like a list of each id/date combination, along with whether that corresponding date was roundtrip or one way.


https://docs.google.com/spreadsheet/ccc?key=0Am5eEooQEL63dGc3NksyM3JhRTl0QVQwclAtSXdoUmc&usp=sharing
0
Comment
Question by:colonialiu20
  • 3
  • 3
7 Comments
 
LVL 16

Expert Comment

by:Peter Kwan
ID: 39650792
I guess 2, 3, 4 and >=5 are all the same logic, so they can be combined as following:

1: Always output "One Way"
>=2: If the difference of any two times are larger than 2.5 hours, then output "Round Trip", otherwise "One Way"

I assumed you already grouped the student ID together and sort the data in the order of date within the same student ID. In this case, you may try to create a script (Tools -> Script Editor) and add a function like this:

function setOneWayOrRoundTrip() {
  var sheet = SpreadsheetApp.getActiveSpreadsheet();
  var rows = sheet.getDataRange();
  var numRows = rows.getNumRows();
  var values = rows.getValues();
  var count = 0;
  
  for (var i = 1; i <= numRows - 1; i++) {
    var row = values[i];
    var studentID = row[0];
    var time = row[1];
    var date = row[2];
    var prevRow = values[i-1];
    var prevStudentID = prevRow[0];
    var prevDate = prevRow[2];
    
    if (studentID != prevStudentID || date != prevDate) {
      if (i > 1) {
        if (count == 1) {
          sheet.getRange("D" + i).setValue("One Way");
        }
        else {
          var isOneWay = true;
          for (var j = i; j > i-count; j--) {
            var time1 = sheet.getRange("B" + j).getValue();
            var time2 = sheet.getRange("B" + (j-1)).getValue();
            if (time1 - time2 > 60*1000*60*2.5) {
              isOneWay = false;
              break;
            }
          }
          for (var j = i; j>i-count; j--)          
            sheet.getRange("D" + j).setValue(isOneWay?"One Way":"Round Trip");
        }
      }
      count = 1;
    }
    else 
      count++;
  }
  if (count == 1) {
    sheet.getRange("D" + numRows).setValue("One Way");
  }
  else {
     var isOneWay = true;
     for (var j = numRows; j > numRows-count; j--) {
       var time1 = sheet.getRange("B" + j).getValue();
       var time2 = sheet.getRange("B" + (j-1)).getValue();
       if (time1 - time2 > 2.5/24.0) {
         isOneWay = false;
         break;
       }
     }
     for (var j = numRows; j>numRows-count; j--)          
        sheet.getRange("D" + j).setValue(isOneWay?"One Way":"Round Trip");
  }
};

Open in new window


Since I found my timezone is different from the data, so I added a column C to get the date as shown in column B by a formula "=TEXT(B?, "YYYY/MM/DD")" for each row.

A sample is referenced below:
https://docs.google.com/spreadsheet/ccc?key=0AoYcUhd_FqORdEZDLWcwZmVWeUFRVEdQODBsUE52RlE&usp=sharing

Please try. Thanks.
0
 

Author Comment

by:colonialiu20
ID: 39651647
Thank you!!!  Is there a way to now pull a record of every unique id #, date, combination, and pull wether it was a round trip or one way.  Similar to what is in f2:h7.

Thank you again
0
 
LVL 16

Expert Comment

by:Peter Kwan
ID: 39652758
A simple way, I dragged column B to column D such that column B is now the date only and C is the result.

Then at cell F2, I add the following formula "=UNIQUE(A2:C)". After entering, it will show the unique results in F:H automatically.

The reference spreadsheet is modified for your reference.
0
Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

 

Author Comment

by:colonialiu20
ID: 39656867
Thank you.  There appears to be a problem with the script that i cant figure out.  Take a look at my spreadsheet, https://docs.google.com/spreadsheet/ccc?key=0Am5eEooQEL63dGdTT0pmclZ0QjVkeVZhWFFvZER6cXc&usp=sharing

If the last two records are:

456      11/19/2013            11/19/2013 8:18:00
456      11/19/2013            11/19/2013 8:20:00

The script will calculate it as a round trip.  This only happens on the last records.

If i put this as the last record:
999      11/20/2013            11/20/2013 8:00:00

it will calculate the student 456 record above correctly as one way as well as calculate 999 as a one way trip.  There just seems to be a problem when the last records are for round trip.  Any thoughts?
0
 

Author Comment

by:colonialiu20
ID: 39657749
In addition to the above problem, the script keeps on timing out after a few hundred rows.  is there any way to extend the timeout time.  If not, is there a way to modify that script for excel?
0
 
LVL 16

Accepted Solution

by:
Peter Kwan earned 500 total points
ID: 39658713
1) Sorry I made a mistake.
2) I have revised and split the function into separate pieces, but I still have no idea how I can  run all using trigger to prevent timeout. The timeout default is ~5 minutes which cannot be modified.
0

Featured Post

Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

This article will guide you to convert a grid from a picture into Excel format using Microsoft OneNote and no other 3rd party application.
Some code to ensure data integrity when using macros within Excel. Also included code that helps secure your data within an Excel workbook.
This Micro Tutorial demonstrates using Microsoft Excel pivot tables, how to reverse engineer competitors' marketing strategies through backlinks.
This Micro Tutorial will demonstrate in Google Sheets how to use the HYPERLINK function to create live links inside your spreadsheet.

920 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

14 Experts available now in Live!

Get 1:1 Help Now