Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Compile a list of counties using Tyler Technologies based systems #6

Closed
OscarVanL opened this issue May 27, 2020 · 9 comments
Closed

Comments

@OscarVanL
Copy link
Contributor

OscarVanL commented May 27, 2020

To build a generalised scraper that works on Tyler Technologies outsourced court record portals (or other widely used portal systems), it would be a good first step to know which counties use which portal system.

  • What are the obvious indicators of a Tyler Technologies portal?
    So far I've observed Calaveras Superior's domain (cacalaverasportal.tylerhost.net) has tylerhost in its name, though this is not the case for all Tyler Technologies portals.
    The bottom right of the page has an "Empowered by Tyler Technologies" logo.
    The bottom left of the page says '© 2020 Tyler Technologies, Inc.'.

  • Where should this list be compiled?
    I propose adding an extra column to Privacy_Public Access to Court Records State Links.csv detailing each portal's vendor.

@ktynski
Copy link
Collaborator

ktynski commented May 27, 2020

I like the idea of updating that csv. lets do that. Alternatively, make it a google doc for collaboration more easily?

@OscarVanL
Copy link
Contributor Author

A Google Doc would be good as it's probably going to take a lot of collaborative work to compile a list of every state and police county, as well as their court portals.

@ktynski
Copy link
Collaborator

ktynski commented May 27, 2020

Ok cool, would you be able to move it to a google doc and update the repo?

@OscarVanL
Copy link
Contributor Author

OscarVanL commented May 27, 2020

I've created Google Sheet. I have created this with 'Can Comment' permissions so it does not get vandalised. In its current state I have to approve each edit, full-edit permissions will need to be given to appropriate people.

We can update the repository readme with a link to this Google Sheet, then periodically copy the content of the sheet to the CSV in the repo.

@sid137
Copy link

sid137 commented May 30, 2020

For identifying the sites, one simple approach is Google. TylerTech is a branded application, so they include their info on every portal: https://share.getcloudapp.com/bLueXZjd

A starting point would be a basic google search excluding their site

"© 2020 Tyler Technologies, Inc." -site:tylertech.com

Then i'd add some filter words such as "court" or "portal" to narrow down more:

court portal "© 2020 Tyler Technologies, Inc." -site:tylertech.com

@salter31
Copy link

salter31 commented Jun 2, 2020

I know that this isn't the place for coordinating tasks but I have a scrawler almost ready for Indiana which uses Tyler Technologies. I did it in golang since that my favorite language and I didn't see the python decision before I started. The concepts should be easy to translate later when re-doing it though. I will make the PR after the license stuff is sorted out and can put it in the correct place.

@Lokesh1411
Copy link

Hi everyone, I'm new to this repository, i need to know what we have to do and other. Please help and suggest !

@daiboz84
Copy link

daiboz84 commented Jun 5, 2020

I'm new to this .how is my phone being cloned

@salter31
Copy link

salter31 commented Jun 6, 2020

I realized after some work that the scraper that it is against their ToS to pull data with any form of a scaper so I won't be posting the work until we get the licensing/legal clearance that it is ok for whoever wants to actually run it. Specifically IN prevents any company in a contract providing the IT service of exposing data to allow scraping of the data.

When retrieving information from MyCase, you are prohibited from (1) using or attempting to use spiders, robots, avatars, intelligent agents, or any other extraction or navigation search except for a normal browser; (2) aggregating, copying or duplicating any of the materials or information available from the site except for the small amount of materials and information temporarily required for an ordinary single use of the site;

Plus there is no officer information exposed in the site so the data wouldn't be useful for a per officer analysis anyway. Seems to me like the best way to get the data from them is to ask for it from the state and get the complete records.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

7 participants