Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: Added and linted all the files for Agent-Assist App. #541

Open
wants to merge 120 commits into
base: main
Choose a base branch
from

Conversation

nitchandak
Copy link

@nitchandak nitchandak commented Apr 16, 2024

Description

Thank you for opening a Pull Request!
Before submitting your PR, there are a few things you can do to make sure it goes smoothly:

  • Follow the CONTRIBUTING Guide.
  • You are listed as the author in your notebook or README file.
    • Your account is listed in CODEOWNERS for the file(s).
  • Make your Pull Request title in the https://www.conventionalcommits.org/ specification.
  • Ensure the tests and linter pass (Run nox -s format from the repository root to format).
  • Appropriate docs were updated (if necessary)

Fixes #<issue_number_goes_here> 🦕

@nitchandak nitchandak requested a review from a team as a code owner April 16, 2024 07:12
Copy link

conventional-commit-lint-gcf bot commented Apr 16, 2024

🤖 I detect that the PR title and the commit message differ and there's only one commit. To use the PR title for the commit history, you can use Github's automerge feature with squashing, or use automerge label. Good luck human!

-- conventional-commit-lint bot
https://conventionalcommits.org/

@nitchandak nitchandak changed the title Added and linted all the files. feat: Added and linted all the files. Apr 16, 2024
@nitchandak nitchandak changed the title feat: Added and linted all the files. feat: Added and linted all the files for Agent-Assist App. Apr 16, 2024
@holtskinner holtskinner added the owlbot:run Add this label to trigger the Owlbot post processor. label Apr 16, 2024
@gcf-owl-bot gcf-owl-bot bot removed the owlbot:run Add this label to trigger the Owlbot post processor. label Apr 16, 2024
Copy link
Collaborator

@holtskinner holtskinner left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is an incredibly large Pull Request.

Quite a few of these files probably shouldn't be checked into a Git Repository.

We have a Google Cloud Storage Bucket gs://github-repo/ for related files for Notebooks/Sample Apps in this repo. I can upload files for you.

Can you please go through the files and determine which of them are needed for the app to function? The others, like the pickle, JSON, txt, images should probably go in the bucket and be linked from the other files to prevent the repo from getting too large.

@holtskinner holtskinner added the owlbot:run Add this label to trigger the Owlbot post processor. label Apr 18, 2024
@gcf-owl-bot gcf-owl-bot bot removed the owlbot:run Add this label to trigger the Owlbot post processor. label Apr 18, 2024
@nitchandak
Copy link
Author

This is an incredibly large Pull Request.

Quite a few of these files probably shouldn't be checked into a Git Repository.

We have a Google Cloud Storage Bucket gs://github-repo/ for related files for Notebooks/Sample Apps in this repo. I can upload files for you.

Can you please go through the files and determine which of them are needed for the app to function? The others, like the pickle, JSON, txt, images should probably go in the bucket and be linked from the other files to prevent the repo from getting too large.

We have identified these files and folders that can be stored in cloud bucket:
backend/src/chatbot_dir/agents/search_agent/preprocessing/table/one_shot/input.txt
backend/src/chatbot_dir/agents/search_agent/preprocessing/table/one_shot/output.txt
backend/src/data
frontend/src/components/Chatbot/google-bard-icon.png
frontend/src/components/Chatbot/send-button.png
frontend/src/pages/HomePage/home_page.png
frontend/src/pages/Layout/robot-assistant.png
frontend/src/logo.svg

@holtskinner holtskinner added the owlbot:run Add this label to trigger the Owlbot post processor. label Apr 19, 2024
@gcf-owl-bot gcf-owl-bot bot removed the owlbot:run Add this label to trigger the Owlbot post processor. label Apr 19, 2024
@gcf-owl-bot gcf-owl-bot bot removed the owlbot:run Add this label to trigger the Owlbot post processor. label May 3, 2024
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@github-actions github-actions bot added the owlbot:run Add this label to trigger the Owlbot post processor. label May 3, 2024
@gcf-owl-bot gcf-owl-bot bot removed the owlbot:run Add this label to trigger the Owlbot post processor. label May 3, 2024
@github-actions github-actions bot added the owlbot:run Add this label to trigger the Owlbot post processor. label May 3, 2024
@gcf-owl-bot gcf-owl-bot bot removed the owlbot:run Add this label to trigger the Owlbot post processor. label May 3, 2024
@nitchandak nitchandak removed their assignment May 8, 2024
@github-actions github-actions bot added the owlbot:run Add this label to trigger the Owlbot post processor. label May 8, 2024
@gcf-owl-bot gcf-owl-bot bot removed the owlbot:run Add this label to trigger the Owlbot post processor. label May 8, 2024
@github-actions github-actions bot added the owlbot:run Add this label to trigger the Owlbot post processor. label May 15, 2024
@gcf-owl-bot gcf-owl-bot bot removed the owlbot:run Add this label to trigger the Owlbot post processor. label May 15, 2024
Copy link

@check-spelling-bot Report

🔴 Please review

See the 📂 files view, the 📜action log, or 📝 job summary for details.

Unrecognized words (1052)
abf
abiosoft
Accrux
acf
ADTS
advertiserspending
advertisingverticals
aee
afactor
againstburglary
agentname
AGG
ainaomotayo
AIP
aiplatform
aipv
akh
alcuna
aliciawilliams
Allianz
alloydb
alphafold
alsocontributed
alsoreflected
amd
ampm
analysisremote
anantnawal
andcost
andlegal
anduncertainty
anihm
anincreased
aofq
APAC
apiid
applehelp
apps
appuser
AQB
Arogyasanjeevani
Arogyasanjeevni
artifactregistry
arxiv
ASF
ashleyxuu
astype
asynchttpclient
asyncio
asyncmock
atch
ATD
Atticus
atticusprojectai
autoconfigure
autodoc
automagically
automerge
autoprefixer
autopush
autoscaling
autosize
autosummary
autotuning
Autowired
AWt
axios
azzurro
BAARRZ
BACKENDURL
backticks
bafee
Bamburgh
bartle
basedadvertiser
basedproducts
BCsy
bdd
beaffected
bezier
bgcolor
Bgogxg
bgp
bharat
Bhupa
bianche
bigframes
bigquery
bigqueryconnection
bigquerydatapolicy
bigquerymigration
bigquerystorage
bigserial
Bitcoin
blogposts
blogs
bluetooth
bmp
bothinfrastructure
bqml
bqzs
brandadvertising
btn
Btw
bucketname
Buonconsiglio
Bupa
butta
bwc
byinterrelated
caaaa
CALNAME
CALSCALE
Cantarell
Carlessian
Carlesso
cbce
ccai
ccc
cctemplate
cdk
Cdkn
cdn
cdnjs
cefb
cfvbq
channit
channitdak
chatbot
chatbox
checkbox
checkboxes
CHECKOV
chiese
chipset
cielo
circondata
citt
Ciwpq
cjs
cla
classmethod
classpath
clicksand
cloudapis
cloudbuild
cloudcode
cloudconsole
cloudflare
cloudfunction
cloudidentity
cloudkms
cloudonair
cloudresourcemanager
cloudrun
cloudshell
cloudskillsboost
cloudsql
cloudtrace
cls
coc
codechat
codelab
codeowners
codey
colab
colonne
colspan
concat
Consolas
consts
containerfilesystem
containerregistry
conventionalcommits
coreference
coreferencing
correlazione
cors
cosa
countup
cov
coveragerc
cpet
CRA
cred
crossorigin
cse
cuad
cultura
currencyexchange
currentcolor
customermanagement
customised
CUTYPE
cygpath
cygwin
dac
DARKCYAN
dataform
dataframe
datapoints
datasource
datastore
datetime
dateutil
dayfirst
dayjs
dce
ddd
dearmor
debian
deconflict
decription
ded
Dej
delims
delle
demouser
deploymentmanager
Descrivimi
desity
developmentactivities
devhelp
devkit
devrel
devstorage
dflt
DHH
diabling
dialogflow
directresponse
direntries
direnv
discoveryengine
divs
dlx
Dmaven
dmoonat
dnd
dns
docfx
dockerpush
docstore
doctrees
documentai
docutils
dollarversus
Donya
Dqxc
Dra
draggable
drivenby
Droid
dropna
DSCm
Dservices
Dskip
DTEND
DTSTAMP
DTSTART
dtype
Dwwykc
dyanamic
EAJl
earcup
earlyaccess
EBTs
ecommerce
ectskg
editorconfig
eebc
eef
EEle
efefef
ehlo
ekg
eliasecchig
elif
emailid
emailresult
emailsent
emb
embd
embeddings
embedings
EMEA
enableapi
enctype
endblock
endfor
endlocal
engagementlevels
enpoint
enterpriseknowledgegraph
enterprisesearch
Entitities
enumbers
envrc
errorhandler
ERRORLEVEL
erwinh
Erzs
Erzsebet
esac
etags
etf
euo
evenodd
eventarc
excutes
exe
Exif
Exlq
facebook
faiss
fastapi
favouritest
fbb
fbp
Fcreate
feb
feeec
FEo
fetchall
ffb
ffc
fff
Fgenerative
finditer
finetuning
Fira
firebaserc
Fitbit
fixmycar
fixmycarbackend
fkscz
flaskapp
flowbite
Fmultimodal
fns
followedby
folmer
footwell
forlong
formatdate
foto
freeselec
freetrial
fromiter
fromtimestamp
fss
Ftext
ftp
fuction
fullwidth
functiondef
fuw
Fvertex
FWz
gcf
gcloud
gcp
gcpiconscolors
gcr
gcs
Gdo
gdynozbq
genai
genappbuilder
generativeai
genwealth
geolocation
gericdong
getattr
getconn
getcwd
getitem
getlist
geturl
ghchinoy
gitleaks
gke
gmail
GMeet
google
googleapiclient
googleapis
googlemerchandisestore
Googlers
googlesymbols
googleusercontent
googling
gpg
Gqb
gradio
grayscale
gridcell
griha
grocerybot
growthin
grpcio
gserviceaccount
GST
gstatic
gsutil
GSYGSHk
gunicorn
guruvittal
GZZIo
hadolint
hasexpanded
hashicorp
haspopup
havemade
HDFC
headcount
headwindyear
healthyin
heirarchy
highlightjs
hljs
hmfg
hnsw
holtskinner
HOMEDRIVE
HOMEPATH
Homeshield
hospitalisation
hqdefault
hsp
htmlhelp
htmlhintrc
hypersistence
HZN
iam
iamcredentials
iamthuya
iap
ical
icanhazip
ICICI
ico
ics
idx
IIITEM
Iivd
iloc
imageno
imagesearch
imap
imaplib
img
imges
immagine
inardini
inboth
inbox
inbrowsers
includingchallenges
includingincreases
incorniciano
increasedadvertiser
increasingcompetition
indevice
indexend
indexstart
indexvalue
inr
insta
instanceof
intersphinx
inthe
Intialize
inuser
investmentsin
iows
Ipc
ipv
ipynb
IPython
isej
isin
isinstance
isoformat
italiana
javac
JAVACMD
javascript
javax
JBEAP
jbrache
jdbc
jdk
Jdm
jdx
Jeevan
jegadesh
JHome
jjdelorme
jlpc
johnsmith
jpa
jquery
jre
jscpd
jsdelivr
jsondai
jsonify
jsonl
jsvine
jsx
jumpstart
junitxml
jupyter
jvm
Jwj
kanban
kanbanrow
kavach
kazunori
keydown
keyframes
keyrings
kmaphoenix
kms
Knative
Kolkata
Koustav
koverholt
KPIs
KSA
kubeconfig
kubernetes
kwards
kwargs
KWarq
kweinmeister
Kyaw
Kyw
labelledby
langchain
Lannister
lastrequest
lastresponse
lavinigam
LBY
leadsandsales
lego
len
LHERD
LHU
libexec
libpq
LIC
Lifecycle
linting
linux
linuxconfig
listbox
listdir
llm
lnppzg
localtime
lolcat
lombok
Lorme
losswas
LRO
lsb
lts
makedirs
markdownlint
marketingandoutreach
matchingengine
Matplotlib
mavenrc
maxsplit
mdc
meaningfullyin
Medi
mediterraneansea
medlm
Mellissa
metadatas
metageneration
mic
MIMETYPES
miniforge
mlops
MNe
momment
monthsended
moz
MPEG
MPK
mtu
mui
Muli
multer
multiline
multimodalembedding
mvn
mvnw
MXx
myaccount
mydomain
myprojectid
mysql
myvertexdatastoreid
naturual
nazione
nbqa
nbsp
ndarray
NEoa
Networkadvertising
networkmanagement
Networkproperties
Neue
Nigam
nio
nlargest
noopener
noqa
noreferrer
noscript
nowrap
noxfile
numpy
nuvole
nvidia
OAco
oauth
oauthlib
Occured
occuring
ocr
Oeqx
OGNy
oid
OJU
OLAP
olo
omnichannel
onadvertisements
onclick
OPD
openapi
openrate
opensource
Operatingincome
opsz
orgpolicy
originalname
oslogin
osx
otherservices
ourability
ouradvertisers
ourrevenues
overallgrowth
Overholt
owlbot
pagemap
palladius
palladiusbonton
pandasql
partiocularly
PARTSTAT
pased
pathlib
paulramsey
pcr
pdf
pdfplumber
peerings
pgadmin
PGDATABASE
PGHOST
PGka
PGPASSWORD
PGPORT
PGUSER
pgvector
Pharma
Phv
pietra
pipefail
Pixmap
pkey
PLACEHOLDERMESSAGE
PLACHOLDER
Platformservices
playlists
plpgsql
pls
polong
polyfills
posargs
possono
postcss
postgre
postgresql
prebuilt
preconnect
prerel
prerender
prestart
pretrained
prettierrc
prewritten
primarilyon
proactively
Procfile
PROJECTBASEDIR
projectid
projectlombok
proname
Proreflected
prospectfinder
protobuf
protos
psa
psql
psychographics
Pullum
PXikyn
pycqa
pygments
pylint
pyopenssl
pytest
PYTHONUNBUFFERED
pytorch
pytz
pyupgrade
QHjpt
Qkx
qna
qthelp
querybuilder
queryinterface
querytool
questa
quickstart
QVM
Qyq
Radebeul
ragdemos
Raileigh
Rajesh
raksha
rarsan
RBz
RDATE
RDy
RDYE
readlink
realpath
receipient
recevies
recommonmark
redis
Reimagining
relatedchanges
remainedfocused
removeprefix
REPORTPART
REPOURL
respone
restirctions
revenuesfrom
rgb
rgba
RHla
ricc
riccardo
rlhf
RMh
rmtree
Roboto
robotstxt
rohitnaidu
Romin
rominirani
rpc
RST
runtimeconfig
RWTv
rxa
Rxnatzjc
RXy
RYDE
Rzsyn
saeedaghabozorgi
salesof
SAMESITE
Saxtead
SBI
screencast
screenshot
scroller
seatback
seby
secretmanager
securetoken
Segoe
Seiya
sendmail
SEO
serializinghtml
serviceaccount
servicecontrol
servicedirectory
servicemanagement
servicenetworking
servicesfor
serviceusage
setlocal
shanecglass
Shashwat
shellcheck
shutil
sidenav
SIGTERM
simage
SKLf
slatawa
slf
smartphone
smtplib
socketio
solutionbuilder
sono
sourcerepo
spam
spcific
sphinxcontrib
splitext
springframework
sqladmin
sqlalchemy
sqldf
sqlfluff
sqlite
ssd
ssl
ssml
stackoverflow
stakeholders
standalone
startswith
starttls
stcore
stext
streamlit
streamlitapp
strengthin
strftime
strptime
stylelint
stylelintrc
stylesheet
subscriptionbased
Subworkflow
successfullaunch
successfullly
superstore
sveltejs
sveltekit
svg
Svsm
synthtool
sys
systemtest
tabindex
TABLESPACE
tablist
tailwindcss
tcp
tdata
teamindrdhanush
teamkavachinsurance
temeperature
templated
templatefile
temurin
tensorflow
terraform
testutils
tetti
Textbox
textcompletion
textembedding
texting
textno
texttospeech
tftpl
Thallam
Theadverse
theaverage
thelook
therelated
thes
thethird
thethree
theunfavorable
thirdquarter
threemonths
Thu
tiangolo
timedelta
timeframes
tls
TLSv
tmp
Tne
TNXj
tobytes
tolist
toml
toolbar
torri
TPA
trendson
TRosn
trustedtester
tsc
tshirt
tsv
TTFB
tts
tvads
Tyrion
TZID
TZNAME
TZOFFSETFROM
TZOFFSETTO
uid
undeploy
Undeploying
undst
unfavorableeffect
Unicode
unpkg
uomo
upsert
urllib
urlparse
usebackq
usecases
userid
USERPROFILE
usr
utc
utcfromtimestamp
utf
utilzing
UUc
uuid
uuidv
UUL
uvicorn
vais
valign
VARCHAR
VBxd
VCALENDAR
vcap
vectorized
vectorstore
vedere
vedi
VEhkb
venv
versioning
Verte
vertexai
vertexdatastoreid
VEVENT
viai
viewcode
virtualenv
Vks
vmin
VMj
vpc
vpcaccess
vqa
VSC
VTIMEZONE
VTJ
vtpm
vulnz
Vwsey
Vwtyz
VYVe
wasdriven
Wasserturm
wdir
WDr
webclient
webfonts
webinar
weblink
webserver
website
werkzeug
wght
whitesmoke
whl
WHtfh
wikipedia
willreflect
withgoogle
Wobj
workaround
WORKDIR
workflowexecutions
workflows
WQm
Xarg
XAxis
Xdebug
XEV
xffffff
xny
XPSm
Xrs
Xrunjdwp
xsd
xsi
Xsrf
XVGr
xwfa
xxxxxxx
xxxxxxxx
xxxxxxxxxx
YAxis
ybf
yearrevenue
ygj
YKs
yml
yourselfers
youtube
YTS
zbcmv
zdq
Zom
ZPhc
zricethezav
Some files were automatically ignored 🙈

These sample patterns would exclude them:

^\Qgemini/sample-apps/agent-assist/backend/src/apis/__init__.py\E$
^\Qgemini/sample-apps/agent-assist/backend/src/chatbot_dir/__init__.py\E$
^\Qgemini/sample-apps/agent-assist/backend/src/chatbot_dir/agents/__init__.py\E$
^\Qgemini/sample-apps/agent-assist/backend/src/chatbot_dir/agents/search_agent/app/__init__.py\E$
^\Qgemini/sample-apps/agent-assist/backend/src/chatbot_dir/agents/search_agent/preprocessing/__init__.py\E$
^\Qgemini/sample-apps/agent-assist/backend/src/utils/__init__.py\E$
^\Qgemini/sample-apps/image-bash-jam/images/.keep\E$
^\Qlanguage/use-cases/document-qa/utils/__init__.py\E$

You should consider adding them to:

.github/actions/spelling/excludes.txt

File matching is via Perl regular expressions.

To check these files, more of their words need to be in the dictionary than not. You can use patterns.txt to exclude portions, add items to the dictionary (e.g. by adding them to allow.txt), or fix typos.

To accept these unrecognized words as correct and update file exclusions, you could run the following commands

... in a clone of the git@github.com:nitchandak/generative-ai.git repository
on the dev branch (ℹ️ how do I use this?):

curl -s -S -L 'https://raw.githubusercontent.com/check-spelling/check-spelling/main/apply.pl' |
perl - 'https://github.com/GoogleCloudPlatform/generative-ai/actions/runs/9097639310/attempts/1'
Available 📚 dictionaries could cover words not in the 📘 dictionary
Dictionary Entries Covers Uniquely
cspell:python/src/python/python-lib.txt 2417 63 11
cspell:aws/aws.txt 218 20 10
cspell:python/src/common/extra.txt 741 15 9
cspell:fullstack/dict/fullstack.txt 419 27 8
cspell:python/src/python/python.txt 392 38 7

Consider adding them (in .github/workflows/spelling.yaml) for uses: check-spelling/check-spelling@main in its with:

      with:
        extra_dictionaries:
          cspell:python/src/python/python-lib.txt
          cspell:aws/aws.txt
          cspell:python/src/common/extra.txt
          cspell:fullstack/dict/fullstack.txt
          cspell:python/src/python/python.txt

To stop checking additional dictionaries, add (in .github/workflows/spelling.yaml) for uses: check-spelling/check-spelling@main in its with:

check_extra_dictionaries: ''
Pattern suggestions ✂️ (45)

You could add these patterns to .github/actions/spelling/patterns.txt:

# Automatically suggested patterns
# hit-count: 685 file-count: 96
# https/http/file urls
(?:\b(?:https?|ftp|file)://)[-A-Za-z0-9+&@#/%?=~_|!:,.;]+[-A-Za-z0-9+&@#/%=~_|]

# hit-count: 211 file-count: 68
# scala imports
^import (?:[\w.]|\{\w*?(?:,\s*(?:\w*|\*))+\})+

# hit-count: 113 file-count: 22
# Google Storage
\b[-a-zA-Z0-9.]*\bstorage\d*\.googleapis\.com(?:/\S*|)

# hit-count: 38 file-count: 31
# node packages
(["'])@[^/'" ]+/[^/'" ]+\g{-1}

# hit-count: 34 file-count: 15
# hex digits including css/html color classes:
(?:[\\0][xX]|\\u|[uU]\+|#x?|%23)[0-9_a-fA-FgGrR]*?[a-fA-FgGrR]{2,}[0-9_a-fA-FgGrR]*(?:[uUlL]{0,3}|[iu]\d+)\b

# hit-count: 30 file-count: 5
# While you could try to match `http://` and `https://` by using `s?` in `https?://`, sometimes there
# YouTube url
\b(?:(?:www\.|)youtube\.com|youtu.be)/(?:channel/|embed/|user/|playlist\?list=|watch\?v=|v/|)[-a-zA-Z0-9?&=_%]*

# hit-count: 23 file-count: 6
# version suffix <word>v#
(?:(?<=[A-Z]{2})V|(?<=[a-z]{2}|[A-Z]{2})v)\d+(?:\b|(?=[a-zA-Z_]))

# hit-count: 22 file-count: 13
# Python string prefix / binary prefix
# Note that there's a high false positive rate, remove the `?=` and search for the regex to see if the matches seem like reasonable strings
(?<!['"])\b(?:B|BR|Br|F|FR|Fr|R|RB|RF|Rb|Rf|U|UR|Ur|b|bR|br|f|fR|fr|r|rB|rF|rb|rf|u|uR|ur)['"](?=[A-Z]{3,}|[A-Z][a-z]{2,}|[a-z]{3,})

# hit-count: 21 file-count: 5
# Google Fonts
\bfonts\.(?:googleapis|gstatic)\.com/[-/?=:;+&0-9a-zA-Z]*

# hit-count: 20 file-count: 13
# GitHub SHAs (markdown)
(?:\[`?[0-9a-f]+`?\]\(https:/|)/(?:www\.|)github\.com(?:/[^/\s"]+){2,}(?:/[^/\s")]+)(?:[0-9a-f]+(?:[-0-9a-zA-Z/#.]*|)\b|)

# hit-count: 20 file-count: 8
# Compiler flags (Unix, Java/Scala)
# Use if you have things like `-Pdocker` and want to treat them as `docker`
(?:^|[\t ,>"'`=(])-(?:(?:J-|)[DPWXY]|[Llf])(?=[A-Z]{2,}|[A-Z][a-z]|[a-z]{2,})

# hit-count: 20 file-count: 6
# base64 encoded content, possibly wrapped in mime
(?:^|[\s=;:?])[-a-zA-Z=;:/0-9+]{50,}(?:[\s=;:?]|$)

# hit-count: 19 file-count: 2
# kubernetes object suffix
-[0-9a-f]{10}-\w{5}\s

# hit-count: 14 file-count: 13
# Contributor
\[[^\]]+\]\(https://github\.com/[^/\s"]+/?\)

# hit-count: 13 file-count: 5
# Compiler flags (Windows / PowerShell)
# This is a subset of the more general compiler flags pattern.
# It avoids matching `-Path` to prevent it from being treated as `ath`
(?:^|[\t ,"'`=(])-(?:[DPL](?=[A-Z]{2,})|[WXYlf](?=[A-Z]{2,}|[A-Z][a-z]|[a-z]{2,}))

# hit-count: 8 file-count: 4
# libraries
\blib(?!rar(?:ies|y))(?=[a-z])

# hit-count: 8 file-count: 3
# AWS VPC
vpc-\w+

# hit-count: 7 file-count: 7
# uuid:
\b[0-9a-fA-F]{8}-(?:[0-9a-fA-F]{4}-){3}[0-9a-fA-F]{12}\b

# hit-count: 7 file-count: 7
# set arguments
\b(?:bash|sh|set)(?:\s+-[abefimouxE]{1,2})*\s+-[abefimouxE]{3,}(?:\s+-[abefimouxE]+)*

# hit-count: 7 file-count: 5
# hex runs
\b[0-9a-fA-F]{16,}\b

# hit-count: 5 file-count: 1
# URL escaped characters
%[0-9A-F][A-F](?=[A-Za-z])

# hit-count: 4 file-count: 4
# w3
\bw3\.org/[-0-9a-zA-Z/#.]+

# hit-count: 4 file-count: 2
# kubernetes pod status lists
# https://kubernetes.io/docs/concepts/workloads/pods/pod-lifecycle/#pod-phase
\w+(?:-\w+)+\s+\d+/\d+\s+(?:Running|Pending|Succeeded|Failed|Unknown)\s+

# hit-count: 4 file-count: 2
# Time Zones
\b(?:Africa|Atlantic|America|Antarctica|Asia|Australia|Europe|Indian|Pacific)(?:/\w+)+

# hit-count: 3 file-count: 3
# stackexchange -- https://stackexchange.com/feeds/sites
\b(?:askubuntu|serverfault|stack(?:exchange|overflow)|superuser).com/(?:questions/\w+/[-\w]+|a/)

# hit-count: 3 file-count: 2
# integrity
integrity=(['"])(?:\s*sha\d+-[-a-zA-Z=;:/0-9+]{40,})+\g{-1}

# hit-count: 3 file-count: 2
# Non-English
[a-zA-Z]*[ÀÁÂÃÄÅÆČÇÈÉÊËÌÍÎÏÐÑÒÓÔÕÖØÙÚÛÜÝßàáâãäåæčçèéêëìíîïðñòóôõöøùúûüýÿĀāŁłŃńŅņŒœŚśŠšŜŝŸŽžź][a-zA-Z]{3}[a-zA-ZÀÁÂÃÄÅÆČÇÈÉÊËÌÍÎÏÐÑÒÓÔÕÖØÙÚÛÜÝßàáâãäåæčçèéêëìíîïðñòóôõöøùúûüýÿĀāŁłŃńŅņŒœŚśŠšŜŝŸŽžź]*|[a-zA-Z]{3,}[ÀÁÂÃÄÅÆČÇÈÉÊËÌÍÎÏÐÑÒÓÔÕÖØÙÚÛÜÝßàáâãäåæčçèéêëìíîïðñòóôõöøùúûüýÿĀāŁłŃńŅņŒœŚśŠšŜŝŸŽžź]|[ÀÁÂÃÄÅÆČÇÈÉÊËÌÍÎÏÐÑÒÓÔÕÖØÙÚÛÜÝßàáâãäåæčçèéêëìíîïðñòóôõöøùúûüýÿĀāŁłŃńŅņŒœŚśŠšŜŝŸŽžź][a-zA-Z]{3,}

# hit-count: 3 file-count: 1
# C network byte conversions
(?:\d|\bh)to(?!ken)(?=[a-z])|to(?=[adhiklpun]\()

# hit-count: 2 file-count: 2
# css url wrappings
\burl\([^)]+\)

# hit-count: 2 file-count: 2
# YouTube image
\bimg\.youtube\.com/vi/[-a-zA-Z0-9?&=_]*

# hit-count: 2 file-count: 2
# Google Accounts
\baccounts.google.com/[-_/?=.:;+%&0-9a-zA-Z]*

# hit-count: 2 file-count: 2
# base64 encoded content
([`'"])[-a-zA-Z=;:/0-9+]{3,}=\g{-1}

# hit-count: 2 file-count: 1
# IServiceProvider / isAThing
(?:\b|_)(?:I|isA)(?=(?:[A-Z][a-z]{2,})+(?:[A-Z]|\b))

# hit-count: 1 file-count: 1
# data url in quotes
([`'"])data:(?:[^ `'"].*?|)(?:[A-Z]{3,}|[A-Z][a-z]{2,}|[a-z]{3,}).*\g{-1}

# hit-count: 1 file-count: 1
# data url
\bdata:[-a-zA-Z=;:/0-9+]*,\S*

# hit-count: 1 file-count: 1
# mailto urls
mailto:[-a-zA-Z=;:/?%&0-9+@._]{3,}

# hit-count: 1 file-count: 1
# bit.ly
\bbit\.ly/\w+

# hit-count: 1 file-count: 1
# cdn.cloudflare.com
\bcdnjs\.cloudflare\.com/[./\w]+

# hit-count: 1 file-count: 1
# medium
\bmedium\.com/@?[^/\s"]+/[-\w]+

# hit-count: 1 file-count: 1
# Wikipedia
\ben\.wikipedia\.org/wiki/[-\w%.#]+

# hit-count: 1 file-count: 1
# ssh
(?:ssh-\S+|-nistp256) [-a-zA-Z=;:/0-9+]{12,}

# hit-count: 1 file-count: 1
# This does not cover multiline strings, if your repository has them,
# you'll want to remove the `(?=.*?")` suffix.
# The `(?=.*?")` suffix should limit the false positives rate
# printf
%(?:(?:(?:hh?|ll?|[jzt])?[diuoxn]|l?[cs]|L?[fega]|p)(?=[a-z]{2,})|(?:X|L?[FEGA]|p)(?=[a-zA-Z]{2,}))(?!%)(?=[_a-zA-Z]+(?!%)\b)(?=.*?['"])

# hit-count: 1 file-count: 1
# Alternative printf
# %s
%(?:s(?=[a-z]{2,}))(?!%)(?=[_a-zA-Z]+(?!%)\b)(?=.*?['"])

# hit-count: 1 file-count: 1
# bearer auth
(['"])[Bb]ear[e][r] .*?\g{-1}

# hit-count: 1 file-count: 1
# curl arguments
\b(?:\\n|)curl(?:\.exe|)(?:\s+-[a-zA-Z]{1,2}\b)*(?:\s+-[a-zA-Z]{3,})(?:\s+-[a-zA-Z]+)*

Errors (4)

See the 📂 files view, the 📜action log, or 📝 job summary for details.

❌ Errors Count
ℹ️ binary-file 8
ℹ️ candidate-pattern 96
❌ check-file-path 641
❌ forbidden-pattern 30

See ❌ Event descriptions for more information.

@github-actions github-actions bot added the owlbot:run Add this label to trigger the Owlbot post processor. label May 15, 2024
@gcf-owl-bot gcf-owl-bot bot removed the owlbot:run Add this label to trigger the Owlbot post processor. label May 15, 2024
@github-actions github-actions bot added the owlbot:run Add this label to trigger the Owlbot post processor. label May 15, 2024
@gcf-owl-bot gcf-owl-bot bot removed the owlbot:run Add this label to trigger the Owlbot post processor. label May 15, 2024
@github-actions github-actions bot added the owlbot:run Add this label to trigger the Owlbot post processor. label May 15, 2024
@gcf-owl-bot gcf-owl-bot bot removed the owlbot:run Add this label to trigger the Owlbot post processor. label May 15, 2024
@holtskinner
Copy link
Collaborator

Please resolve spelling errors shown in this check https://github.com/GoogleCloudPlatform/generative-ai/actions/runs/9101630581/attempts/1#summary-25019286805

For items that aren't errors, you can follow the instructions to add them to the list of approved terms.

@github-actions github-actions bot added the owlbot:run Add this label to trigger the Owlbot post processor. label May 17, 2024
@gcf-owl-bot gcf-owl-bot bot removed the owlbot:run Add this label to trigger the Owlbot post processor. label May 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants