Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add admin tool for sitemap generation to bie-index #374

Closed
adam-collins opened this issue Sep 29, 2023 · 8 comments
Closed

add admin tool for sitemap generation to bie-index #374

adam-collins opened this issue Sep 29, 2023 · 8 comments
Assignees

Comments

@adam-collins
Copy link
Contributor

No description provided.

@adam-collins adam-collins self-assigned this Oct 3, 2023
@adam-collins
Copy link
Contributor Author

Feedback on these sitemap items please.

  • Taxonomic. Add accepted names to the sitemap.
    • idxType:TAXON AND taxonomicStatus:accepted
    • Exclude records with LSIDs not beginning with http. This is to exclude ALA generated identifiers.
    • Add to the sitemap these pages. They all resolve to an accepted taxon page.
      • http:https://bie.ala.org.au/species/ + COMMON_NAME. This is only the preferred common name.
      • http:https://bie.ala.org.au/species/ + NAME. Use the complete name if available.
  • Wordpress pages. Sitemap already exists https://www.ala.org.au/xmlsitemap.xml
  • Support pages. Sitemap not required as it is crawled https://support.ala.org.au/
  • Collections. Add to sitemap.
    • idxType:COLLECTION OR idxType:INSTITUTION OR idxType:DATAPROVIDER OR idxType:DATARESOURCE
    • Add to sitemap https://collections.ala.org.au/public/show/ + ID
  • Gazetteer locations. URLs point at explore your area, https://biocache.ala.org.au/explore/your-area#-17.7558|127.8768|12|ALL_SPECIES. Location name information is not present on the linked page. Do not add to sitemap.
  • Regions. URLs points to regions https://regions.ala.org.au/feature/6205053#group=ALL_SPECIES&subgroup=&guid=&from=1850&to=2023&tab=speciesTab&fq= Location information is present with title and map. Metadata for the area definition is not present. Species information on the page is pageable and incomplete. Do not add to sitemap.

@matthewandrews
Copy link
Member

matthewandrews commented Oct 3, 2023

I noticed that, for the home site ("WordPress"), we also have an old sitemap.xml file which has not been updated for a long time...(whereas the xmlsitemap.xml files are kept up to date). I've now added a rewrite so that requests for https://www.ala.org.au/sitemap.xml are rewritten to xmlsitemap.xml

@peggynewman
Copy link

I'll ask the taxonomy working group to have a look at this. I think that's a good set from the collectory, although there are a lot of rubbish data providers, I don't think we're ready to be rid of them yet.
@adam-collins what do you think about adding CAPAD areas?

@adam-collins
Copy link
Contributor Author

CAPAD areas is included in idxtype:REGION, the Regions. I think more layer information should be included on the regions page but can continue without that.

@adam-collins
Copy link
Contributor Author

collectory sitemap Pull request AtlasOfLivingAustralia/collectory#218

@adam-collins
Copy link
Contributor Author

@adam-collins
Copy link
Contributor Author

Entry point to all sitemaps is /sitemap.xml

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants