Create a data model for the info required to collect "metadata" from FOIA offices #16

pkarman · 2017-07-20T20:33:08Z

The starting point is in the schema document.

Examples are available.

The assumption for the data model is that an agency may have multiple components (the decentralized FOIA office model, e.g. USDA) or a single component (the centralized model, e.g. GSA). Either way, it makes some sense for the agency-to-component relationship to be a one-to-many.

Data model includes 90% of the universal FOIA request form fields (not unique to individual office)
All fields from agency YAML are present in the data model
Data model includes submission methods and request form data

Note: Removed the 4th acceptance criteria (the build-out in Drupal), since there are now discrete issues for those (#40 and #41)

ba66e77 · 2017-07-25T11:54:52Z

What we have so far is a FOIA Office content type (screenshot of configured fields is below) which corresponds to what was referred to as a component above. The FOIA Office is associated to a Department/Agency in many-to-one manner. The FOIA Office also has an entity reference field (called the Request Submission Form) to a webform entity. This is a one-to-one relationship.

The Department/Agency is a just a taxonomy term with a name, description, and an image upload for the agency seal. Additional fields can be added if needed. An example of a Department/Agency as rendered without any styling work is shown below.

ba66e77 · 2017-07-25T11:58:36Z

@pkarman @bsweger, could one of your crew compile the field list needed from what's in the schema so we have all the details needed in the ticket?

pkarman · 2017-07-25T17:57:30Z

Here's an example of the USDA OIG component. This is pulled from
https://raw.githubusercontent.com/18F/foia-recommendations/master/prototypes/form-wizard/assets/js/agencies.js

{
"Office of the Inspector General, Department of Agriculture ( USDA )" : {
      "name" : "Office of the Inspector General",
      "fax" : "202-690-6305",
      "emails" : [
         "alison.decker@oig.usda.gov"
      ],
      "summary" : {
         "abbreviation" : "USDA",
         "description" : "The Department of Agriculture provides leadership on food, agriculture, natural resources, and related issues.",
         "name" : "Department of Agriculture",
         "usa_id" : "49015"
      },
      "address" : {
         "street" : "1400 Independence Avenue, SW",
         "city" : "Washington",
         "address_lines" : [
            "Alison Decker",
            "FOIA Officer",
            "Room 441-E Whitten Bldg."
         ],
         "state" : "DC",
         "zip" : "20250"
      },
      "phone" : "202-720-5677",
      "request_time_stats" : {
         "2011" : {
            "complex_highest_days" : "489",
            "complex_average_days" : "47",
            "simple_highest_days" : "305",
            "complex_median_days" : "20",
            "expedited_processing_median_days" : "0",
            "expedited_processing_average_days" : "0",
            "simple_average_days" : "22",
            "expedited_processing_lowest_days" : "0",
            "simple_lowest_days" : "2",
            "complex_lowest_days" : "1",
            "expedited_processing_highest_days" : "0",
            "simple_median_days" : "16"
         },
         "2010" : {
            "complex_median_days" : "100",
            "expedited_processing_median_days" : "0",
            "expedited_processing_average_days" : "0",
            "simple_average_days" : "14",
            "expedited_processing_lowest_days" : "0",
            "expedited_processing_highest_days" : "0",
            "simple_lowest_days" : "2",
            "complex_lowest_days" : "41",
            "simple_median_days" : "8",
            "complex_highest_days" : "778",
            "complex_average_days" : "156",
            "simple_highest_days" : "34"
         },
         "2014" : {
            "simple_median_days" : "61",
            "complex_lowest_days" : "42",
            "simple_lowest_days" : "2",
            "simple_average_days" : "85",
            "complex_median_days" : "141",
            "simple_highest_days" : "413",
            "complex_average_days" : "214",
            "complex_highest_days" : "682"
         },
         "2013" : {
            "simple_median_days" : "22",
            "simple_average_days" : "25",
            "simple_lowest_days" : "6",
            "complex_lowest_days" : "33",
            "complex_median_days" : "42",
            "complex_average_days" : "46",
            "simple_highest_days" : "79",
            "complex_highest_days" : "283"
         },
         "2012" : {
            "complex_average_days" : "68",
            "simple_highest_days" : "30",
            "complex_highest_days" : "133",
            "simple_median_days" : "13",
            "simple_average_days" : "13",
            "complex_lowest_days" : "31",
            "simple_lowest_days" : "1",
            "complex_median_days" : "64"
         }
      },
      "top_level" : false
   }
}

The request_time_stats are tricky. Right now they are culled from the annual report data in order to provide some meaningful contextual information to requesters for setting expectations. We don't want to introduce more data-entry burden on agencies when they are already submitting annual report info to OIP. We should talk with OIP about process and which annual report data to require here and what we might be able to automate from their existing process.

There are new fields for submissions methods and the request form that @bsweger and I will need to articulate here. Look for a few comments on this ticket as we aggregate.

pkarman · 2017-07-25T18:02:24Z

A note on the screenshot you've posted @ba66e77.

The fields for expedited processing and fee waiver instructions feel like they belong on the request form model, not at this office level. I don't have strong feelings about the model per se, but the presentation context of those fields to the agency officers feels like it belongs to the form, not the agency itself.

pkarman · 2017-07-25T18:05:09Z

Here's another example. Note that keys like reading_rooms, request_form, service_center, foia_officer and public_liason are present here and not on the previous example.

{
"National Finance Center, Department of Agriculture ( USDA )" : {
      "name" : "National Finance Center",
      "reading_rooms" : [
         [
            "Click here to access the NFC Electronic Reading Room",
            "https://www.nfc.usda.gov/FOIA/foiaereading.html"
         ]
      ],
      "summary" : {
         "usa_id" : "49015",
         "name" : "Department of Agriculture",
         "description" : "The Department of Agriculture provides leadership on food, agriculture, natural resources, and related issues.",
         "abbreviation" : "USDA"
      },
      "emails" : [
         "cheri.alsobrook@nfc.usda.gov"
      ],
      "fax" : "504-426-9706",
      "service_center" : {
         "phone" : [
            "504-426-0300"
         ],
         "name" : "Sandy Francois"
      },
      "website" : "https://www.nfc.usda.gov/FOIA/FOIA_home.html",
      "address" : {
         "city" : "New Orleans",
         "street" : "P.O. Box 60000",
         "zip" : "70160",
         "state" : "LA",
         "address_lines" : [
            "Cheri Alsobrook",
            "FOIA Officer"
         ]
      },
      "public_liaison" : {
         "name" : "Shri Alsobrook, Sandy Francois",
         "phone" : [
            "504-426-0327"
         ]
      },
      "request_form" : "https://efoia-pal.usda.gov/palMain.aspx",
      "top_level" : false,
      "foia_officer" : {
         "name" : "John Hemstreet",
         "phone" : [
            "504-426-0168"
         ]
      },
      "phone" : "504-426-0374"
   }
}

pkarman · 2017-07-25T19:15:20Z

The things we don't want from https://github.com/18F/2015-foia/blob/master/contacts/data/USDA.yaml are the keywords and (maybe) request_time_stats.

ba66e77 · 2017-07-25T19:48:46Z

@pkarman we debated the placement of the fee waver and processing fields. In the end, we put them on the office because we didn't see any instances where the form was presented without the office context and putting them on the office made the content editing UI a little nicer. If that first observation is wrong or there are other factors we're not aware of then we can change the placement.

bsweger · 2017-07-25T20:26:08Z

@pkarman and @ba66e77 ... am compiling a single field list from the various links here and have a few questions to make sure I'm capturing things correctly.

For decentralized agencies should I assume that we want to capture the hierarchical relationship between a department/sub-agency and it's main agency (e.g., animal and plant inspection --> USDA)?
If yes to above, do we know how many levels deep the hierarchies go?
For decentralized agencies, are FOIA requests sent to the sub-agencies only, or can they be sent to both a top-level agency and a sub-agency depending on the nature of the request?
The 2015 USDA YAML has a misc field that contains the name of the FOIA director, for example. Are we looking for our schema to have that kind of flexibility, or do we want to explicitly model all needed fields? Another example is the notes field.

bsweger · 2017-07-25T20:31:37Z

@ba66e77 Following up on the fee waiver and processing fields, am I thinking correctly that these will vary based on the specific FOIA requests? If that's true (and please let me know if it's not!), these do seem like attributes that should be defined as part of the request schema.

I don't have strong UI feelings, since I haven't participating in any testing to date. But we're just talking logical model/schema here?

pkarman · 2017-07-25T20:35:58Z

For decentralized agencies should I assume that we want to capture the hierarchical relationship between a department/sub-agency and it's main agency (e.g., animal and plant inspection --> USDA)?

yes. IIUC, that's represented in the screenshot by the "entity relationship" to the "Agency/Department". I.e. an office can have a parent office.

If yes to above, do we know how many levels deep the hierarchies go?

It goes one level deep.

For decentralized agencies, are FOIA requests sent to the sub-agencies only, or can they be sent to both a top-level agency and a sub-agency depending on the nature of the request?

A single request goes to a single office (component). If a requester needed to get records from multiple components within a single agency, they would need to file one request for each component.

The 2015 USDA YAML has a misc field that contains the name of the FOIA director, for example. Are we looking for our schema to have that kind of flexibility, or do we want to explicitly model all needed fields? Another example is the notes field.

I like flexibility, but I like to use it to discover patterns that the model doesn't yet have explicit slots to represent. So if folks have been using misc to capture FOIA director, I would prefer to keep the misc field and add a foia_director field.

pkarman · 2017-07-25T20:37:35Z

@bsweger The fee waiver and expedited processing instructions are static text that will be displayed on the request form, to help the requester understand how much a FOIA request might cost. It's help text. So it doesn't change from request to request, but does change from office to office (possibly) and from agency to agency (likely).

bsweger · 2017-07-25T20:42:50Z

@pkarman aaaah---thanks for the info re: fee waiver/processing....missed that they are static instructions.

bsweger · 2017-07-25T21:43:26Z

@pkarman Thanks for the info re: the hierarchies.

A single request goes to a single office (component). If a requester needed to get records from multiple components within a single agency, they would need to file one request for each component.

I phrased my original question poorly...let me try again. For an agency like USDA, would we expect that USDA itself (i.e., the top level of a decentralized agency) receive FOIA requests? Put another way, can a requester choose to route a request to the top level of a decentralized agency? Or is the choice limited only to the 2nd tier departments?

I'm asking because in the sample USDA metadata, there's a series of departments with all of the fields needed to created a request. Would we expect to have this information defined for USDA proper as well?

Does this question make sense?

pkarman · 2017-07-25T21:46:56Z

It does make sense @bsweger thanks for clarifying.

Each FOIA office can receive requests. So the main office in a decentralized agency can receive requests like any of their components.

bsweger · 2017-07-26T13:36:59Z

Ok, here's a first pass at a complete list of fields. The names aren't meant to represent what we should use; I added the prefixes to better show, in this flattened format, the 1:n relationships between a department and some of the descriptive names needed to render the form.

This exercise raised a few questions about data structures/models that we'll use to store the info, but perhaps I'm jumping ahead. Would love to use some of our sync time today to go through this and better understand how schemas are represented in the current Drupal implementation and how we can capture their change history, since we'll no doubt have to iterate as we get feedback.

Also @pkarman great point about the stats; agree that we need some thinking there too.

Field Name	1:n with department?
name
fax
emails	y
summary_abbreviation
summary_description
summary_name
summary_usa_id
address_street
address_city
address_lines
state
zip
phone
top_level_flag
reading_rooms	y
service_center_phone
service_center_name
website
request_form
foia_officer_name
foia_officer_phone
public_liaison_name
public_liaison_phone
submission_format	y
submission_name	y
submission_title	y
submission_address_lines	y
submission_email	y
submission_url	y
submission_fax	y
required_form_fields_name	y
required_form_fields_label	y
required_form_fields_regs_url	y
required_form_fields_help_text	y
required_form_fields_enum	y
additional_form_fields_name	y
additional_form_fields_label	y
additional_form_fields_help_text	y
request_stats_year	y
request_stats_complex_highest_days	y
request_stats_complex_avg_days	y
request_stats_complex_lowest_days	y
request_stats_complex_median_days	y
request_stats_simple_highest_days	y
request_stats_simple_avg_days	y
request_status_simple_lowest_days	y
request_stats_simple_median_days	y
request_stats_expedited_highest_days	y
request_stats_expedited_avg_days	y
request_stats_expedited_lowest_days	y
request_stats_expedited_median_days	y

ba66e77 · 2017-07-26T14:02:19Z

@bsweger, it seems like there are two conceptual entities here that the list is trying to represent as a single entity. To my thinking, Agency is an entity and Office/component is an entity, so we should be developing two schemas here. The Agency would contain an array of Office entities to reflect the one-to-many, has-a relationship between the Agency and its Offices.

The yml files I've seen for agencies reflect the structure you've outlined here, but what do you think of something structured more like the below?

name: Department of Energy
ID: xxxxxxxx
description: Morbi pulvinar ante vitae porttitor.
seal: doe.jpg
offices:
    office
      name: Headquarters
      email: ....
      foia officer: ...
      website: .....
      ....other office properties...
    office
      name: Chicago office
      email: ....
      foia officer: ...
      website: .....
      ....other office properties...

bsweger · 2017-07-26T15:34:14Z

@ba66e77 Yes, for sure. The table above is a flat list of all the fields I think we'll need. Modeling the relationships seems like the next logical step. In addition to the one to many between department and offices, there are also one to many relationships between and office and submission format, etc.

Would be helpful to schedule some time with @pkarman or use a meeting already on the calendar to flush out some acceptance criteria for this story? When we say "create an agency data model" are we talking about the json/yaml representation? A logical relational database model?

ba66e77 · 2017-07-26T16:40:35Z

I think what we're talking about is a pseudo-code representation. YAML, JSON, bullet-points, whatever. So long as we don't end up conflating defining the model with how the model will be communicated between the front end and the back end.

pkarman · 2017-07-26T17:48:16Z

@ba66e77 good point about not conflating how the data is stored vs how it is serialized.

Thanks @bsweger for putting together all the marbles into one list. I agree; some talking time to make sure we're all aligned about how to represent the relationships seems fine. I agree that my mental model so far has been Agency and Offices, where an Agency might have one Office (centralized) or multiple (decentralized).

pkarman · 2017-07-26T17:52:29Z

I also realize that we have used departments in some examples where in others we have used offices or components. Those all refer to the same thing: the place where a FOIA request is submitted. I think we should standardize our language. OIP refers to components and offices most often. My personal preference would be for offices.

bsweger · 2017-07-27T14:42:11Z

As I'm putting these fields into into a more hierarchical format, I still have questions about departments and offices (agreed that office is preferable to component).

I know I'm beating a dead horse here, but if an office is the entity that receives a FOIA request and a FOIA request can also be made at the department/agency level, even in a decentralized agency, would we expect to see main agencies listed as offices?

For example:

Centralized Agency

Department Name: GSA
Department attribute #1
Department attribute #2
Office:
  Name: GSA
  FOIA Officer attributes
  E-mails
  reading rooms
  submission methods
  etc.

Decentralized Agency

Department Name: USDA
Department attribute #1
Department attribute #2
Office:
  Name: USDA <-- note that main USDA is repeated here to be treated as an office that can receive FOIA requests
  FOIA Officer attributes
  E-mails
  reading rooms
  submission methods
  etc.
Office:
  Name: Agricultural Marketing Service
  FOIA Officer attributes
  E-mails
  reading rooms
  submission methods
  etc.

pkarman · 2017-07-27T15:35:20Z

You can't make a request at the agency level. I.e. in a decentralized model, you can't request records from across the entire agency (all offices) by submitting to a single main office. It's really up to the agency how they organize/name themselves, but each office must have a FOIA officer, by law, so it's the individual FOIA office to which you must send a request.

So in your example, yes, USDA might be listed twice, once as the name of the agency and once as the name of the main office. The naming of the office is up to them, and may or not be misleading. ~~You can't submit a request to the agency because the agency itself doesn't have a FOIA office.~~ [that's misleading. the agency's FOIA office is the main office] The main FOIA office is first amongst equals.

Note I have tried to use the word agency not department, since department seems to not be used by OIP at all.

bsweger · 2017-07-27T19:57:53Z

@pkarman you're right--there are way too many words for agencies, bureaus, etc. I'll use agency going forward to represent the top level.

@ba66e77 spoke with Peter b/c I needed a synchronous convo to clarify this point about the main FOIA office in a decentralized agency. It turns out that I was thinking of an agency and it's main FOIA office as one in the same. Once Peter set me straight on that, things became much clearer.

Next stab at the fields forthcoming...

bsweger · 2017-07-28T14:38:55Z

@pkarman @ba66e77 Ok, here's an "unflattened" list of the complete universe of attributes, as I understand them right now. I'm still unable to suss out some of the relationships from reading through the various YAML/JSON examples, so still have a few questions:

Can a FOIA office have more than one FOIA officer at a given time?
Can a FOIA office have more than one public liaison at a given time?
Can a FOIA office have more than one contact e-mail addy at a time?
Can a FOIA office have more than one reading room link?

Wanted to ask the above explicitly because some of the JSON/YAML artifacts indicate that the answer can be yes, but wouldn't that be pretty confusing from a requester perspective?

The format below assumes that required_form_fields and additional_form_fields are applicable across a FOIA office rather than descriptors of a particular submission format (e.g., online or e-mail).

Also pinging @adborden for his 👀 , since he was involved in this during discovery.

Look forward to hearing what I missed, etc.

agency: United States Department of Agriculture
abbreviation: USDA
usa_id: 12345
offices:
  office
    id: 99999
    name: United States Department of Agriculture
    description: USDA Main FOIA office
    email: usda_foia.usda.gov
    website: https://usda_main_foia.gov
    foia_officers:
        - name: Alice Jones
        phone: 202-222-3333
        - name: Pete Metaphor
        phone: 202-888-9999
    service_centers:
        - name: Scott Kirk
        phone: 202-123-4567
        - name: Michele Janeway
        phone: 413-444-5555 
    public_liaisons:
        - name: Jim McCoy
        phone: 202-123-4444
        - name: Sue Simile
        phone: 202-666-7777
    reading_rooms: 
        - url: http://foia_reading.usda.gov
        - url: http://more_foia_reading.usda.gov
    required_form_fields:
        - name: request_origin
          label: Request Origin
          help_text: (for example, New England Region 1A)
          regs_url: null
          enum:
          - Company
          - Individual/Self
          - Organization
    additional_form_fields:
        - name: contract_number
          label: The Contract Number
          help_text: If your request relates to a GSA contract, please provide the contract number
    submission_methods:
        - submission_format: paper
          name: Bob Smith
          title: FOIA officer
          address_lines:
            - Bob Smith
            - Room 123-B
            - A Building Name
          street: 333 Massachusetts Ave
          city: Washington
          state: DC
          zip: 02025
        - submission_format: email
          email: foia@usda.gov
        - submission_format: web
          url: https://submit_foia_form/form.aspx
        - submission_format: fax
          fax: 202-123-9876
    request_stats:
        - year: 2016
          complex_highest_days: 222
          complex_avg_days: 222
          complex_lowest_days: 222
          complex_median_days: 222
          simple_highest_days: 222
          simple_avg_days: 222
          simple_lowest_days: 222
          simple_median_days: 222
          expedited_highest_days: 222
          expedited_avg_days: 222
          expedited_lowest_days: 222
          expedited_median_days: 222

pkarman · 2017-07-28T16:34:41Z

This is great @bsweger thanks.

I think the answer to all your questions is "yes" but this feels like a great topic to talk with OIP about to clarify.

bsweger · 2017-07-28T16:53:02Z

Ok, will ask OIP about those questions.

Had a very high-level convo with Matt to your earlier point, @pkarman:

The request_time_stats are tricky. Right now they are culled from the annual report data in order to provide some meaningful contextual information to requesters for setting expectations. We don't want to introduce more data-entry burden on agencies when they are already submitting annual report info to OIP. We should talk with OIP about process and which annual report data to require here and what we might be able to automate from their existing process.

If I understood OIP correctly:

agencies are responsible for aggregating the annual statistics (via a spreadsheet w/ macros) from all of their individual FOIA offices
those aggregated numbers are sent to OIP for review by compliance folks
once OIP has reviewed the agency data and signed off, they provide a copy of the final, human-readable numbers back to the agencies, who use them to create the annual reports they're required to post online
OIP also generates an XML version of those finalized, agency-level, human-readable numbers. This XML goes to the person who maintains the Tomcat system.

It sounds like previously there wasn't necessarily a use for the more granular office-level stats once they'd been used to derive the agency-level info. But now there are two additional things to consider:

the new statutory requirement that FOIA offices post a machine-readable version of their raw stats data
the discovery story that needs to surface the office-level numbers to requesters

Which is just to say that you're right--we need more thinking here. I'll leave the stats fields at the office level in the list above, but we may also need to capture them at the agency level.

adborden · 2017-08-01T19:21:43Z

@bsweger conversation from today re: foia officer sounds like we should keep it in as is for now. I don't think their role is restricted to paper submissions.

bsweger · 2017-08-02T14:55:25Z

@adborden Since you were involved in creating the sample .yaml files in discovery, was wondering if you have any insight into these fields from the draft above:

required_form_fields:
        - name: request_origin
          label: Request Origin
          help_text: (for example, New England Region 1A)
          regs_url: null
          enum:
          - Company
          - Individual/Self
          - Organization
    additional_form_fields:
        - name: contract_number
          label: The Contract Number
          help_text: If your request relates to a GSA contract, please provide the contract number

Was the intent that required_form_fields and additional_form_fields represent information that an office needs to collect from a FOIA requester? In other words, do we expect that many of these items would be pulled from the "90%" list provided by OIP?

bsweger · 2017-08-03T16:44:59Z

@malikkotob Thanks for the pings that remind me to update this issue.

To be more explicit about where things stand:

The structure in this comment represents our current understand of the agency-->office objects, including office-specific submission methods and office-specific FOIA questions (i.e., those that aren't represented on the "90%" list provided by DOJ. That should satisfy the 2nd and 3rd acceptance criteria.
The "90%" field list is here: https://drive.google.com/open?id=0B4JtVmWTTdQEU0lraVdZNVB2d0k. That should satisfy the first acceptance criteria

In other words, it sounds like there might be 3 over-arching entities: agencies, offices, and the pool of common FOIA questions. Does that sound right?

Looks like you've opened new stories for getting this info into Drupal. Does that mean the fourth acceptance criteria here is no longer applicable?

bsweger · 2017-08-03T17:00:29Z

Oh, I forgot to add the caveat that I'd expect schema changes and updates to be part of the process. Particularly around the 90% of questions, since the research/design folks are actively testing those. If they learn that the current wording needs updated or that the office metadata schema needs to be expanded to cover things like adding an ability to customize the ordering of the common questions, we'll revisit.

But I think we're arrived at a good starting point.

malikkotob · 2017-08-03T17:08:11Z

Hi @bsweger! I was about to comment to say just that. I went ahead and split the implementation of those over-arching entities into their own actionable issues, so yes - we can go ahead and remove that fourth criteria from this ticket.

To split hairs with regards to the three entities, the entirety of the request form that an office uses to accept FOIA requests (the 90% that is common, plus the 10% that is unique) is the third entity we will capture in Drupal, as opposed to only the common 90%.

Also, no issue with with changes being part of the process. I think as those come up we can adjust in Drupal as needed, using issues in Github to track those changes.

I'll look over the 90% and the structure from your comment and let you know if I have any questions/comments. Nice work getting this along!

bsweger · 2017-08-04T13:38:54Z

@malikkotob Great, thanks. I'll remove the fourth acceptance criteria. Also, I moved this story into Review/QA on the board, since you'll be reviewing when you get back to the office.

adborden · 2017-08-04T16:20:04Z

I spoke to @bsweger offline about #16 (comment) but want to make sure to document the answer for everyone else:

Was the intent that required_form_fields and additional_form_fields represent information that an office needs to collect from a FOIA requester?

Yes. required_form_fields being what is required by regulation, addtional_form_fields being any other fields that an agency component would like to capture from the requester.

In other words, do we expect that many of these items would be pulled from the "90%" list provided by OIP?

Yes, but we can negotiate how this is implemented. The intent of the metadata file from the recommendations point of view is that the metadata file contains everything you would need to generate a request form for an agency and component. That implies that the 90% fields would be in the metadata file. It doesn't say anything about whether the 90% fields should be exposed in the backstage to FOIA officers.

From a data modeling perspective, we might want to capture the 90% fields as a separate entity from the request form entity. As mentioned in #16 (comment), there might be some research that would require us to have some customization of the 90% fields and that data would land in the request form entity. As an example, in our experience map/backstage demo, there was a possibility that a FOIA officer could choose one of three options to render the name fields (single field, first/last, or prefix/first/last/suffix). That kind of thing would be captured in the request form entity.

adborden · 2017-08-04T18:43:27Z

@malikkotob assigning to you for acceptance. Do you have what you need to move forward? Can we call this done? (part of our sprint review)

adborden · 2017-08-04T23:28:42Z

@RyanSibley reminded me that we weren't planning on putting the annually reported request processing data in the metadata file and we would not collect it from the agency backstage. Agencies already provide this information through annual and quarterly reporting processes so it does not make sense to burden agencies with that duplication.

While this data is not intended to be provided in the metadata file, the current thinking is that we wanted it to display to requesters in some form on the Portal. It's probably okay to punt on this until we know more about how we want to use that historical data. It makes sense to store that data as we have modeled, but there's a missing piece about how that data is updated in the future and how it would be delivered to the front end. I've opened #46 to capture that conversation.

ba66e77 · 2017-08-07T12:02:27Z

Since @malikkotob is out for a couple days, I'll take review of this one

ba66e77 · 2017-08-07T12:47:14Z

Coalescing from above, the attributes of each entity are:

Data Structure

Agency

agency name
abbreviation
usa_id

example

agency: United States Department of Agriculture
abbreviation: USDA
usa_id: 12345

Agency Component (formerly Office)

id
name
description
email
website
FOIA Officers - array of Person
Service Centers - array of Person
reading rooms - array of URLs
paper submission receiver - Person
paper submission address
- line 1
- line 2
- city
- state
- zip
email submission address - email
web submission url - URL
fax submission number - phone number

    id: 99999
    name: United States Department of Agriculture
    description: USDA Main FOIA office
    email: usda_foia@usda.gov
    website: https://usda_main_foia.gov
    foia_officers:
        - name: Alice Jones
        phone: 202-222-3333
        - name: Pete Metaphor
        phone: 202-888-9999
    service_centers:
        - name: Scott Kirk
        phone: 202-123-4567
        - name: Michele Janeway
        phone: 413-444-5555 
    public_liaisons:
        - name: Jim McCoy
        phone: 202-123-4444
        - name: Sue Simile
        phone: 202-666-7777
    reading_rooms: 
        - url: http://foia_reading.usda.gov
        - url: http://more_foia_reading.usda.gov
    submission_methods:
        - submission_format: paper
          name: Bob Smith
          title: FOIA officer
          address_lines:
            - Bob Smith
            - Room 123-B
            - A Building Name
          street: 333 Massachusetts Ave
          city: Washington
          state: DC
          zip: 02025
        - submission_format: email
          email: foia@usda.gov
        - submission_format: web
          url: https://submit_foia_form/form.aspx
        - submission_format: fax
          fax: 202-123-9876

Person

name
title
email address
phone number

Questions

Agency

FOIA.gov indicates there are additional attributes (below). Is there a reason not to add these to the data structure and metadata file?
- logo/seal
- description/about text
- Chief FOIA Officer

Agency Component

Confirm we are removing:
- "Required fields" and "additional fields" - as these are attributes of the form rather than the Agency Component
- Request stats
I'm suggesting we model FOIA Liaisons etc as a consistent Person type. Dissent?
Do we want to attach a name or label to each reading room?
We'll need to add an API endpoint field for those Agency Components which want us to send them submissions via POST request to their API.
We'll need to add an indicator of how the Agency Component wants us to send data to them. I suggest a preferred submission format field with option for either email or API.

ba66e77 · 2017-08-07T13:56:30Z

Fields for the Request Submission Form Template (aka, the 90%)

Field	field type	required	notes
Prefix/Title	Dropdown Selection	Optional	Usually populated with: Mr, Ms, Mrs, Dr, Miss
First Name	Text	Required	N/A
Middle Initial / Middle Name	Text	Optional - Frequently becomes required for First Party Requests	Some agencies ask for initial and some for name
Last Name	Text	Required	N/A
Suffix	Dropdown Selection	Optional	Usually populated with: Esq, MD, PHD, Jr, Sr, I, II, III, IV
Company / Organization	Text	Optional	Usually referring to an affiliation the requester would like to note, i.e., Law Firm, Media Outlet, or Civil Society Group
Email Address	Text	Varies greatly by Agency Regulations	Sometimes required, sometimes optional, and sometimes conditional
Phone Number	Numeric Text	Varies greatly by Agency Regulations	Do we want a standardized format, i.e., (111)-111-1111
Fax Number	Numeric Text	Optional	Do we want a standardized format, i.e., (111)-111-1111
Mailing Address Line 1	Text	Varies greatly by Agency Regulations	Sometimes required, sometimes optional, and sometimes conditional
Mailing Address Line 2	Text	Optional	N/A
City	Text	Varies greatly by Agency Regulations	Sometimes required, sometimes optional, and sometimes conditional
Country	Dropdown Selection	Varies greatly by Agency Regulations	Sometimes required, sometimes optional, and sometimes conditional
State/Province	Dropdown Selection	Varies greatly by Agency Regulations	Dependent on "Country" selection
Zip/Postal Code	Text	Varies greatly by Agency Regulations	Sometimes required, sometimes optional, and sometimes conditional
Processing Fees	Numeric Text	Varies greatly by Agency Regulations	Some agency regulations require the requester to identify how much they are willing to pay, others are silent Needs Explanation to Educate the Requester
Delivery Method	Dropdown Selection	Varies greatly by Agency Regulations	Usually populated with: Paper or Electronic Needs Explanation to Educate the Requester
Requester Category	Dropdown Selection	Optional	Options include: individual (non commercial), representative of the news media, educational, commercial requester Needs Explanation to Educate the Requester
Request Description	Text	Required	Need to consider character limits Needs Explanation to Educate the Requester
Request Fee Waiver	Dropdown Selection - Followed by Text box for Request	Required	Usually "yes" or "no" dropdown selction box, which is followed by a text box for request if yes is selected Needs Explanation to Educate the Requester
Request Expedited Processing	Dropdown Selection - Followed by Text box for Request	Required	Usually "yes" or "no" dropdown selction box, which is followed by a text box for request if yes is selected Needs Explanation to Educate the Requester
Attachments / Supporting Documentation	File Selection Button	Optional	This is important because it allows for submission of required documentation such as a Cert of ID that is necessary to make a first party "perfected" request - need to identify file types supported - Needs Explanation to Educate the Requester
Electronic Signature / Non SPAM Verification	Varies	Required	Some type of Captcha/signature/or other verification mechanism Needs Explanation to Educate the Requester

Questions

For Middle Initial and Middle Name:
- These are technically two different fields. Can we standardize on one of them?
- The note says they are frequently required for "First Party Requests". How do we identify what is a First Party Request?
phone numbers - if we are enforcing formatting, we need to consider if we need to handle international numbers (with country codes) and extensions
In general - are we supporting conditionally required fields? @adborden, I believe you had said previously you thought these kind of fields were out of scope.
Processing fees - I presume these should be indicated as being USD?
Attachments/Supporting documentation - is there any guidance on number of files that can be uploaded, their types (PDF, Word, Excel, .txt), or their maximum file size?

bsweger · 2017-08-07T14:48:14Z

@ba66e77 Thanks for the thoughtful questions. Adding my perspective to the your feedback in this comment re: agency and agency component.

My understanding is that we want a user, be it a developer working on the portal front-end or a 3rd party, to be able to:

get information about a FOIA agency component
get information about the mechanics of submitting a FOIA request to an specific agency component: e.g., what information is required, what the available submission methods are

The sample data we're going back and forth on in this thread represents what a person requesting agency component metadata would see. Thus, I see things like request form fields being part of the agency component metadata.

Yes, the form fields are part of a specific FOIA request, but a list of form fields--required and otherwise--is also descriptive information about an agency component's FOIA process and belongs in the metadata.

How this is all modeled in the backend is an implementation detail in the purview of you and the Drupal team. But I would expect an API call to say, https://api.foia.gov/doj/metdata/ (or whatever), to return something closer to what we have in the earlier comment (i.e., listing the form fields).

I think part of the back and forth here is that the data model part of the issue title means different things to different people, so we're all coming in from a different perspective.

To that end, I've added a story to create a higher-level dataflow diagram of this process. I don't think it needs to block this, but I would find it personally helpful to look at the metadata/form process in the context of something like the journey map we saw last week.

bsweger · 2017-08-07T15:00:25Z

Trying to articulate more of what I think we're after here. We're not really in the business of building a FOIA request form, as I understand it.

For this piece of the project, we're building a service that describes how to a build a FOIA request form. Presumably, we ourselves will be users of this service as we build the portal.

So if there's some kind of question list that's a separate database entity, that's a backend design call. But from the perspective of using this agency component metadata service, there's not separate "form" per se, if I understand the discovery team's suggested approach.

adborden · 2017-08-07T23:02:57Z

Right, @bsweger, I'm hearing three areas of discussion in this issue:

Model of the data we're collecting from FOIA Officers
Model of data we (maybe) need but we aren't collecting from agencies (annual reported stats, usa_id, maybe agency seal/logo)
Model for the metadata API which is generated from the 1 and 2 but might not include everything (and maybe should be modeled separately, and maybe not implemented as a schema instead of a full model)

My understanding is this issue is really about number 1 only, although we've touched on all the other aspects too. The other points might be covered in other issues already but it might be helpful to break them out into explicit stories if we think that is needed. I think the diagram of the data flow will also help to clarify the big picture.

adborden · 2017-08-07T23:04:06Z

Quoting Barrett Smith (2017-08-07 05:47:14)

## Questions ### Agency 1. FOIA.gov indicates there are additional attributes (below). Is there a reason not to add these to the data structure and metadata file? * logo/seal * description/about text * Chief FOIA Officer

These all seem reasonable to add. I would prefer FOIA officers **not** control the upload of a logo/seal for quality control, but it makes sense to store it (we might be able to fetch that from an API, I'm not sure).

### Agency Component 1. Confirm we are removing: * "Required fields" and "additional fields" - as these are attributes of the form rather than the Agency Component * Request stats

I spoke with @bsweger on this, and we are OK to move the "required fields" and "additional fields" to the request form. Also okay to drop Request stats for now.

2. I'm suggesting we model FOIA Liaisons etc as a consistent Person type. Dissent?

+1, Person is just a name, title, phone, and email?

3. Do we want to attach a name or label to each reading room?

I don't think a name or label is useful (again quality control). I think if we show a list of raw URLs to a requester, that will be OK.

4. We'll need to add an API endpoint field for those Agency Components which want us to send them submissions via POST request to their API.

Yes, we have an `api` type captured in submission methods[1].

4. We'll need to add an indicator of how the Agency Component wants us to send data to them. I suggest a `preferred submission format` field with option for either email or API.

Mind if we leave this out for now? Email is going to be the first to be implemented, so I'd rather punt on this for now. [1]: https://github.com/18F/foia-recommendations/blob/master/schemas.md#submission-methods

…

-- Aaron D Borden Human and Hacker

adborden · 2017-08-07T23:42:08Z

Quoting Barrett Smith (2017-08-07 06:56:30)

## Questions 1. For Middle Initial and Middle Name: * These are technically two different fields. Can we standardize on one of them? * The note says they are frequently required for "First Party Requests". How do we identify what is a First Party Request?

Yeah, I think we will standardize on one based on the research coming up. All these fields will go through some research validation as to whether they should really be in the 90% bucket. A first-party request is simply a request about yourself, e.g. requesting your file from the FBI. I don't think there's any programmatic way to determine it. It's just the nature of the request and different agencies get different types of requests. Some may get no first-party requests. My hunch (and the research will inform us better) is that the 90% fields won't be represented simply as fields when presented to FOIA officers. There might be some way customize these fields at the agency component's discretion e.g. if the agency component receives a lot of first-party requests, they would choose to include the full prefix/first/middle/last/suffix fields, but agencies that don't receive first-party requests might choose a simple single full name field. But you wouldn't get a granular menu of each individual field. In the case of supporting documents, we might just leave these in the 10% bucket. Agencies could specify they'd like a generic Attachment field for file upload (FOIA Online includes this by default for every agency), or maybe several file upload fields for specific kinds of documents. /end speculation

2. phone numbers - if we are enforcing formatting, we need to consider if we need to handle international numbers (with country codes) and extensions

+1 to storing international numbers, unless it adds complexity. Would Drupal return the numbers via API un-formatted?

3. In general - are we supporting conditionally required fields? @adborden, I believe you had said previously you thought these kind of fields were out of scope.

I don't think we understand the use case for conditionally required fields yet. If we allow them, I don't think we would expose it for everything. For example we might give some pre-canned options to select from regarding collecting requester contact information conditionally.

4. Processing fees - I presume these should be indicated as being USD?

I was hoping bitcoin ;)

5. Attachments/Supporting documentation - is there any guidance on number of files that can be uploaded, their types (PDF, Word, Excel, .txt), or their maximum file size?

I don't think we know enough about this yet. I suspect we want some reasonable limit on file size.

…

-- Aaron D Borden Human and Hacker

RyanSibley · 2017-08-08T14:03:33Z

Responding to @ba66e77's questions:

For Middle Initial and Middle Name:
These are technically two different fields. Can we standardize on one of them?

Let's standardize to Middle name. See this google sheet for a list of edited fields that we're pretty sure will be included: https://docs.google.com/a/gsa.gov/spreadsheets/d/1eJ1Bzug6Q_jtXdHtzpStIcEHva9-QWCbvhw0GjOrAw8/edit?usp=sharing (Please note the sentence case as well)

The note says they are frequently required for "First Party Requests". How do we identify what is a First Party Request?

@adborden answer ^^ that above

phone numbers - if we are enforcing formatting, we need to consider if we need to handle international numbers (with country codes) and extensions
login.gov deals with this.

Do these issues help: 18F/identity-idp#1568 and https://github.com/18F/identity-private/issues/2252?

In general - are we supporting conditionally required fields? @adborden, I believe you had said previously you thought these kind of fields were out of scope.

Are you referring to the 10 percent of fields required by some agencies? Then yes, I believe that is the goal. See issue #51

Processing fees - I presume these should be indicated as being USD?

Bitcoin would be cool.

Attachments/Supportingdocumentation - is there any guidance on number of files that can be uploaded, their types (PDF, Word, Excel, .txt), or their maximum file size?

I'm not sure about this one. I'll look around for guidance.

adborden · 2017-08-09T15:39:58Z

@RyanSibley I think by conditionally required fields, I think we mean "requesters must provide at least one of email address, mailing address".

So email address is required only when requesters do not provide a mailing address. Mailing address is required only when requesters do not provide an email address.

* FOIA-270: Adds Modal styling and functionality. * FOIA-270: Undoing autoformatting changes. * FOIA-270: Adding modal props.

[FOIA22-127] Style agency links in summary HTML (wip)

pkarman mentioned this issue Jul 20, 2017

Create authorization rules for agency data #18

Closed

leahbannon assigned bsweger, pkarman and ba66e77 Jul 25, 2017

bsweger changed the title ~~Create an agency data model~~ Create a data model for the info required to collect "metadata" from FOIA offices Aug 1, 2017

This was referenced Aug 3, 2017

Build Agency in Drupal #40

Closed

Build Agency Component in Drupal #41

Closed

Update Request Form template to include universal form fields #42

Closed

adborden assigned malikkotob Aug 4, 2017

ba66e77 unassigned malikkotob Aug 7, 2017

malikkotob closed this as completed Aug 15, 2017

malikkotob mentioned this issue Aug 29, 2017

Migrate phone and abbreviation fields for agency components #78

Closed

3 tasks

leahbannon unassigned bsweger Nov 14, 2017

brockfanning pushed a commit that referenced this issue Mar 5, 2020

FOIA-270: Modal styling and component. (#16)

9c1d209

* FOIA-270: Adds Modal styling and functionality. * FOIA-270: Undoing autoformatting changes. * FOIA-270: Adding modal props.

brockfanning pushed a commit that referenced this issue Aug 28, 2023

Merge pull request #16 from forumone/FOIA22-127

e207bfc

[FOIA22-127] Style agency links in summary HTML (wip)

Create a data model for the info required to collect "metadata" from FOIA offices #16

Create a data model for the info required to collect "metadata" from FOIA offices #16

Comments

pkarman commented Jul 20, 2017 • edited by bsweger Loading

ba66e77 commented Jul 25, 2017

ba66e77 commented Jul 25, 2017

pkarman commented Jul 25, 2017 • edited Loading

pkarman commented Jul 25, 2017

pkarman commented Jul 25, 2017

pkarman commented Jul 25, 2017

ba66e77 commented Jul 25, 2017

bsweger commented Jul 25, 2017

bsweger commented Jul 25, 2017

pkarman commented Jul 25, 2017

pkarman commented Jul 25, 2017

bsweger commented Jul 25, 2017

bsweger commented Jul 25, 2017

pkarman commented Jul 25, 2017

bsweger commented Jul 26, 2017

ba66e77 commented Jul 26, 2017

bsweger commented Jul 26, 2017

ba66e77 commented Jul 26, 2017

pkarman commented Jul 26, 2017

pkarman commented Jul 26, 2017

bsweger commented Jul 27, 2017

pkarman commented Jul 27, 2017 • edited Loading

bsweger commented Jul 27, 2017

bsweger commented Jul 28, 2017 • edited Loading

pkarman commented Jul 28, 2017

bsweger commented Jul 28, 2017

adborden commented Aug 1, 2017

bsweger commented Aug 2, 2017

bsweger commented Aug 3, 2017

bsweger commented Aug 3, 2017

malikkotob commented Aug 3, 2017 • edited Loading

bsweger commented Aug 4, 2017

adborden commented Aug 4, 2017

adborden commented Aug 4, 2017

adborden commented Aug 4, 2017

ba66e77 commented Aug 7, 2017

ba66e77 commented Aug 7, 2017

Data Structure

Agency

Agency Component (formerly Office)

Person

Questions

Agency

Agency Component

ba66e77 commented Aug 7, 2017

Fields for the Request Submission Form Template (aka, the 90%)

Questions

bsweger commented Aug 7, 2017

bsweger commented Aug 7, 2017

adborden commented Aug 7, 2017

adborden commented Aug 7, 2017 via email

adborden commented Aug 7, 2017 via email

RyanSibley commented Aug 8, 2017 • edited Loading

adborden commented Aug 9, 2017

pkarman commented Jul 20, 2017 •

edited by bsweger

Loading

pkarman commented Jul 25, 2017 •

edited

Loading

pkarman commented Jul 27, 2017 •

edited

Loading

bsweger commented Jul 28, 2017 •

edited

Loading

malikkotob commented Aug 3, 2017 •

edited

Loading

RyanSibley commented Aug 8, 2017 •

edited

Loading