feat: change human scores to None, add prediction and identification fields to CSV export by mihow · Pull Request #1214 · RolnickLab/antenna

mihow · 2026-04-08T02:07:15Z

Summary

Separates machine predictions from human identifications in exports and API, so researchers see both side-by-side. Previously the determination was overwritten when a human verified, losing the original ML prediction.

Extracted find_best_prediction() and find_best_identification() from update_occurrence_determination() for reuse by exports and API
Set determination_score to None for human-determined occurrences (ML confidence preserved in best_machine_prediction_score)
Added new CSV export fields and best_machine_prediction nested object to the API
Added backfill_determination_score management command to null-out scores on legacy human-determined occurrences (supports --dry-run and --project)
Fixed a type mismatch in update_occurrence_determination() — the fallback path was comparing a Taxon instance to an FK id (int), causing needless save() churn
Tightened the legacy backfill in Occurrence.save() to skip human-determined occurrences (suppresses a misleading "Could not determine score" warning)
Addressed all CodeRabbit review feedback: N+1 query fixes, distinct=True on counts, PII removal, withdrawn ID filtering, timezone-safe tests
Per @annavik / @mihow feedback: renamed verified_by_count → participant_count (distinct users), removed best_detection_occurrence_url (URLs not stable enough yet)

Closes #1213

New CSV export columns

Column	Example value	Description
`best_machine_prediction_name`	`Idia aemula`	Top ML prediction taxon
`best_machine_prediction_algorithm`	`moth-classifier-v2`	Algorithm that produced the prediction
`best_machine_prediction_score`	`0.881`	ML confidence score
`verified_by`	`Jane Smith`	Name of the most recent human identifier
`participant_count`	`2`	Number of distinct users with non-withdrawn identifications
`agreed_with_algorithm`	`moth-classifier-v2`	Algorithm name if the human explicitly agreed with an ML prediction
`agreed_with_user`	`jane@example.org`	Email of the prior identifier if the best identification was an "Agree" with another user
`determination_matches_machine_prediction`	`True`	Does the human ID match the ML prediction?
`best_detection_bbox`	`[0.1, 0.1, 0.5, 0.5]`	Bounding box coordinates
`best_detection_source_image_url`	`https://s3.../image.jpg`	Public URL to the original capture

Consumers that want to follow transitive agreement chains (B agreed with A, A agreed with ML) can do so by chaining agreed_with_user across rows themselves — the backend exposes the direct link only.

Example CSV rows

ML prediction only (no human verification):

id,determination_name,determination_score,best_machine_prediction_name,best_machine_prediction_score,verified_by,participant_count,determination_matches_machine_prediction
993,Idia aemula,0.881,Idia aemula,0.881,,0,

Human agrees with ML:

id,determination_name,determination_score,best_machine_prediction_name,best_machine_prediction_score,verified_by,participant_count,agreed_with_algorithm,determination_matches_machine_prediction
993,Idia aemula,,Idia aemula,0.881,Jane Smith,1,moth-classifier-v2,True

Human disagrees with ML:

id,determination_name,determination_score,best_machine_prediction_name,best_machine_prediction_score,verified_by,participant_count,determination_matches_machine_prediction
993,Catocala relicta,,Idia aemula,0.881,Jane Smith,1,False

Human agrees with another human:

id,determination_name,verified_by,participant_count,agreed_with_algorithm,agreed_with_user,determination_matches_machine_prediction
993,Catocala relicta,Bob Jones,2,,jane@example.org,False

New API field: `best_machine_prediction`

Added to OccurrenceListSerializer — always populated regardless of verification status:

{
  "id": 993,
  "determination": {"id": 4205, "name": "Catocala relicta"},
  "determination_score": null,
  "best_machine_prediction": {
    "taxon": {"id": 4205, "name": "Idia aemula"},
    "algorithm": {"id": 3, "name": "moth-classifier-v2"},
    "score": 0.881,
    "determination_matches_machine_prediction": false
  },
  "identifications": [...]
}

Backfill for existing data

Run after deploy to clear determination_score on human-determined occurrences that predate this change:

python manage.py backfill_determination_score --dry-run          # report the count
python manage.py backfill_determination_score                    # apply globally
python manage.py backfill_determination_score --project <id>     # scope to a single project

Test plan

Export field tests covering: ML-only, agreeing human, disagreeing human, multiple identifications, human-agrees-with-human, bbox, field presence
BackfillDeterminationScoreTest covers dry-run, ML-only preserved, human-determined cleared, withdrawn-id ignored
All CSV export tests pass
All occurrence-related main tests pass
All ML tests pass
CI tests pass

Follow-up

Add stale determination data integrity check to ami/main/checks.py (see feat: add check_occurrences for occurrence data integrity #1188)
UI bug: Agree button on human identification card sends wrong FK type — UI: Agree button on human identification card sends wrong FK type #1226

Summary by CodeRabbit

Release Notes

New Features
- CSV exports now include machine prediction metadata (name, algorithm, score), expanded verification details (verified by, participant count, agreement status), and detection source information.
- Occurrence API list responses now include comprehensive machine prediction data.
Bug Fixes
- Improved verification status determination logic to accurately reflect participant-based verification tracking.
Chores
- Added data maintenance tooling for determination score records.

…elds Separate machine predictions from human identifications in exports and API, so researchers see both side-by-side. Previously the determination was overwritten when a human verified, losing the original ML prediction. Model layer: - Extract find_best_prediction() and find_best_identification() from update_occurrence_determination() for reuse by exports and API - Set determination_score to None for human-determined occurrences (ML confidence preserved in best_machine_prediction_score) New CSV export fields: - best_machine_prediction_name, _algorithm, _score - verified_by, verified_by_count, agreed_with_algorithm - determination_matches_machine_prediction - best_detection_bbox, best_detection_source_image_url, best_detection_occurrence_url API changes: - Add best_machine_prediction nested object to OccurrenceListSerializer (always populated regardless of verification status) Closes #1213 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

netlify · 2026-04-08T02:07:21Z

✅ Deploy Preview for antenna-ssec canceled.

Name	Link
🔨 Latest commit	`f7310cf`
🔍 Latest deploy log	https://app.netlify.com/projects/antenna-ssec/deploys/69e02febcea09900087f3f8e

netlify · 2026-04-08T02:07:21Z

✅ Deploy Preview for antenna-preview canceled.

Name	Link
🔨 Latest commit	`f7310cf`
🔍 Latest deploy log	https://app.netlify.com/projects/antenna-preview/deploys/69e02feb9d1a7f0008176980

coderabbitai · 2026-04-08T02:07:29Z

Warning

Rate limit exceeded

@mihow has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 59 minutes and 20 seconds before requesting another review.

Your organization is not enrolled in usage-based pricing. Contact your admin to enable usage-based pricing to continue reviews beyond the rate limit, or try again in 59 minutes and 20 seconds.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 1823ec07-42c7-44f0-82b4-19e874d94abb

📥 Commits

Reviewing files that changed from the base of the PR and between bb46a28 and f7310cf.

📒 Files selected for processing (4)

ami/exports/tests.py
ami/main/api/serializers.py
ami/main/models.py
ami/main/tests.py

📝 Walkthrough

Walkthrough

Adds machine-prediction, verification, and best-detection annotations and fields to Occurrence exports and API; refactors determination logic to preserve ML scores, adds queryset annotation helpers, a backfill command, and tests exercising new export/management behaviors.

Changes

Cohort / File(s)	Summary
Model & Queryset `ami/main/models.py`	Added `with_best_machine_prediction()` and `with_verification_info()` queryset annotations; extended `with_best_detection()` to include source-image path/public base URL; extracted `find_best_prediction()` / `find_best_identification()` and reintroduced cached properties; changed determination score handling and refactored `update_occurrence_determination()`.
Export Serializer `ami/exports/format_types.py`	Extended `OccurrenceTabularSerializer` with: `best_machine_prediction_name`, `best_machine_prediction_algorithm`, `best_machine_prediction_score`, `verified_by`, `participant_count`, `agreed_with_algorithm`, `agreed_with_user`, `determination_matches_machine_prediction`, `best_detection_bbox`, `best_detection_source_image_url`. Changed verification-status logic, added serializer helpers, and CSV builder now applies `.with_best_machine_prediction()` and `.with_verification_info()` annotations.
Export Tests `ami/exports/tests.py`	Added `ExportNewFieldsTest` with helpers to create detections/classifications and run CSV export; asserts new columns for ML-only, agreeing/disagreeing human IDs, multiple identifications, bbox and URL presence, and header completeness.
API Serializer `ami/main/api/serializers.py`	Added `best_machine_prediction` SerializerMethodField to `OccurrenceListSerializer` and `get_best_machine_prediction()` returning taxon, algorithm, score, and determination-match flag (or null).
Management Command & Tests `ami/main/management/commands/backfill_determination_score.py`, `ami/main/tests.py`	New `backfill_human_determination_scores()` command and `Command` wrapper to clear human-derived `determination_score` (dry-run support); tests added in `ami/main/tests.py` to validate dry-run, clearing behavior, and withdrawn-identification handling.

Sequence Diagram(s)

sequenceDiagram
    actor Client
    participant ExportAPI as Export Request
    participant Serializer as OccurrenceTabularSerializer
    participant QuerySet as OccurrenceQuerySet
    participant DB as Database
    participant Occurrence as Occurrence instance

    Client->>ExportAPI: Request CSV export
    ExportAPI->>QuerySet: build queryset (.with_best_detection(), .with_best_machine_prediction(), .with_verification_info())
    QuerySet->>DB: execute annotated query
    DB-->>QuerySet: annotated rows
    QuerySet-->>Serializer: rows
    Serializer->>Occurrence: get_best_detection_source_image_url(obj), get_verified_by(obj), get_determination_matches_machine_prediction(obj)
    Occurrence-->>Serializer: computed field values (from annotations / methods)
    Serializer-->>Client: CSV with ML prediction, verification, detection fields

sequenceDiagram
    actor Client
    participant API as OccurrenceListSerializer
    participant Occ as Occurrence.find_best_prediction()
    participant Classification as Classification (ML)
    participant Taxon
    participant Algorithm

    Client->>API: GET /occurrences/
    API->>Occ: call find_best_prediction()
    Occ->>Classification: select best (terminal first, highest score)
    Classification-->>Occ: best classification (or null)
    alt prediction exists
        API->>Taxon: serialize prediction.taxon
        API->>Algorithm: serialize prediction.algorithm
        API-->>Client: {taxon, algorithm, score, determination_matches_machine_prediction}
    else no prediction
        API-->>Client: null for best_machine_prediction
    end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

Add detection URL and dimensions to Occurrence Tabular export formats (Simple CSV) #1091: Overlaps on adding best-detection export fields and queryset annotation usage in ami/exports/format_types.py.

Poem

🐇
I hopped through indexes, queries, and rows,
Split algorithm whispers from human prose.
Scores kept safe, bbox and URL in tow,
Verification tallies, truths in a row —
Carrots and code, together we grow!

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 71.43% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description check	✅ Passed	The description includes all required sections: summary, list of changes, related issues (`#1213`), detailed description, test plan, and deployment notes with backfill instructions.
Linked Issues check	✅ Passed	The PR fully implements all coding requirements from `#1213`: machine prediction fields, verification fields, detection fields, determination_score behavior change, API field, model refactoring, and comprehensive test coverage.
Out of Scope Changes check	✅ Passed	All changes are directly aligned with `#1213` objectives; no unrelated features or technical debt additions detected outside the stated scope.
Title check	✅ Passed	The title accurately summarizes the main changes: setting human scores to None and adding prediction/identification fields to CSV export, matching the core objectives.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feat/export-machine-prediction-fields

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Copilot

Pull request overview

This PR updates occurrence exports and the API to preserve and expose both machine predictions and human verifications side-by-side, preventing ML predictions from being lost when a human identification is added.

Changes:

Refactors occurrence logic to expose reusable “best prediction” and “best identification” selection methods and adjusts determination score semantics for human IDs.
Extends CSV exports with new machine prediction, verification, and detection-related fields backed by queryset annotations.
Adds a best_machine_prediction nested object to the occurrences list API serializer.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 9 comments.

File	Description
`ami/main/models.py`	Adds queryset annotations for best detection/prediction/verification and refactors best prediction/identification selection + determination score update logic.
`ami/main/api/serializers.py`	Adds `best_machine_prediction` field to occurrence list API responses.
`ami/exports/format_types.py`	Adds new CSV export fields and wires exporter queryset to include new annotations.
`ami/exports/tests.py`	Adds tests covering the new CSV export fields across ML-only and human verification scenarios.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

coderabbitai

Actionable comments posted: 4

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@ami/exports/format_types.py`:
- Around line 192-204: Use the annotated winning detection already attached to
the exported object instead of recomputing it: in
get_best_detection_source_image_url and get_best_detection_occurrence_url, read
the annotated detection from the obj (e.g. getattr(obj, "best_detection", None)
or the specific annotated attributes like
best_detection_source_image_path/public_base_url on that detection) and build
both the source image URL and the occurrence/context URL from that detection's
stored fields (path, public_base_url or signed_url, and occurrence/context link)
rather than calling obj.context_url() which re-queries and may use a different
ordering; this ensures URLs correspond to the exported bbox/path and avoids N+1
queries.

In `@ami/main/api/serializers.py`:
- Around line 1396-1421: get_best_machine_prediction currently calls
obj.find_best_prediction() and directly accesses prediction.taxon and
prediction.algorithm, causing an N+1; change it to reuse obj.best_prediction
(use that if present, fall back to obj.find_best_prediction() only if needed)
and avoid dereferencing relations that aren't prefetched—either use the
already-attached taxon/algorithm on the prediction object or ensure these are
provided via queryset annotations/prefetch_related and passed through context to
TaxonNestedSerializer/AlgorithmNestedSerializer to prevent per-row queries;
update get_best_machine_prediction to check obj.best_prediction first and only
call find_best_prediction as a fallback, and document that views/queries should
prefetch prediction__taxon and prediction__algorithm.

In `@ami/main/models.py`:
- Around line 3210-3218: The code only sets new_score when new_determination
changes, so when authority flips but the taxon stays the same we never recompute
determination_score; update the logic around
occurrence.find_best_identification() and occurrence.find_best_prediction()
(references: top_identification, top_prediction, new_determination, new_score,
determination_score) so that whenever the authority changes you still set
new_score appropriately (None for a human top_identification,
top_prediction.score for an ML top_prediction) and assign determination_score on
the occurrence even if the taxon equals current_determination; apply the same
fix in the analogous block handling lines ~3225-3228.
- Around line 2864-2867: The aggregate verified_by_count is over-counting
because Count("identifications") is performed across detection rows; change the
Count to be distinct so each identification is only counted once (e.g. use
Count("identifications", distinct=True,
filter=Q(identifications__withdrawn=False))). Update the verified_by_count
definition (the Count call) to include distinct=True while keeping the existing
filter on identifications.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: ca26e0c4-7c2a-409c-8ae0-af43172af29c

📥 Commits

Reviewing files that changed from the base of the PR and between 83a970c and f21ec4d.

📒 Files selected for processing (4)

ami/exports/format_types.py
ami/exports/tests.py
ami/main/api/serializers.py
ami/main/models.py

- Add distinct=True to verified_by_count to prevent join multiplication - Fix update_occurrence_determination to recompute score even when taxon stays the same (handles authority flip without taxon change) - Remove email fallback in verified_by (PII concern, name-only) - Filter withdrawn identifications in verification_status fallback - Use obj.best_prediction cached property instead of find_best_prediction() in API serializer to avoid N+1 queries - Build occurrence URL from annotated fields instead of context_url() to avoid N+1 queries in export - Use source_image.timestamp instead of datetime.now() in tests Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

annavik · 2026-04-08T07:36:21Z

From reading the PR description, this looks great Michael!

Some comments and questions:

verified_by_count: From the PR description it sounds like this is considering all human identifications. Since we use the word "verified", maybe this should only consider identifications in line with the final determination?
agreed_with_algorithm: If you agree with an algorithm, then I agree with you, have I also agreed with the algorithm...?
best_detection_occurrence_url: There is a risk this URL could get broken if we change environment URL, do you think our are URLs are stable enough to start distribute in exports?
determination_matches_machine_prediction: I guess this is nice to have, but this information can also be derived from other columns. We could skip if we want to keep things compact! :)

mihow · 2026-04-13T22:44:04Z

Very astute observations @annavik !

verified_by_count: From the PR description it sounds like this is considering all human identifications. Since we use the word "verified", maybe this should only consider identifications in line with the final determination?

That makes sense! Although I am leaning away from including this count at all, since we haven't talked about verification at different ranks or disagreements. I realized what I want here is just "participant_count" to highlight if more than one user son is in the identification history. In general, I want to stick with the assumption that the platform is for experts to ID and the last expert gets the word -- like a museum collection, rather than a community ID platform.

agreed_with_algorithm: If you agree with an algorithm, then I agree with you, have I also agreed with the algorithm...?

Great question! Probably yes! This would require additional changes (I am just exposing this value in this PR, not calculating it). Maybe an easy change though.

best_detection_occurrence_url: There is a risk this URL could get broken if we change environment URL, do you think our are URLs are stable enough to start distribute in exports?

I agree it's too soon! Will remove.

determination_matches_machine_prediction: I guess this is nice to have, but this information can also be derived from other columns. We could skip if we want to keep things compact! :)

I like this one since it's based on the taxon ID rather than string matching, and I think it will be a common use case.

@annavik

…urrence URL Per @annavik and @mihow's review: - Rename `verified_by_count` to `participant_count` (counts distinct users, not total IDs) - Remove `best_detection_occurrence_url` (URLs not stable enough to distribute in exports yet) - Clean up unused annotations (event_id, source_image_id) from with_best_detection() Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

coderabbitai

Actionable comments posted: 3

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

ami/exports/format_types.py (1)

112-145: ⚠️ Potential issue | 🟠 Major

Don’t bake the preview frontend domain into exported CSVs.

best_detection_occurrence_url is hardcoded to the preview app, so exported files become wrong as soon as the frontend base URL differs by environment or changes later. That makes the new column non-portable for exactly the reason raised in the PR discussion.

Either drop this column or build it from a configured frontend base URL instead of a literal hostname.

Also applies to: 199-209
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@ami/exports/format_types.py` around lines 112 - 145, The serializer currently
hardcodes the preview frontend domain for the best_detection_occurrence_url
column (the SerializerMethodField best_detection_occurrence_url and its getter),
making exported CSVs non-portable; either remove best_detection_occurrence_url
from the serializer fields, or change the getter (e.g.,
get_best_detection_occurrence_url) to construct the link from a configurable
frontend base URL (read from settings or an env var like FRONTEND_BASE_URL) and
fall back to None if not set; update the Meta.fields list and the corresponding
method(s) (also referenced in the same file around the other similar fields at
the later occurrence) so no literal hostname is embedded.

ami/main/models.py (1)

3215-3242: ⚠️ Potential issue | 🟠 Major

Compare determination IDs and allow clearing stale determinations.

The fallback query at Lines 3215-3220 returns a raw determination id, while new_determination is a Taxon instance. Combined with the if new_determination ... guard, unchanged rows look changed, and rows with no remaining identification/prediction never clear an old saved determination.

Suggested fix

-    current_determination = (
-        current_determination
-        or Occurrence.objects.select_related("determination")
-        .values("determination")
-        .get(pk=occurrence.pk)["determination"]
-    )
+    current_determination_id = getattr(current_determination, "pk", current_determination)
+    if current_determination_id is None:
+        current_determination_id = (
+            Occurrence.objects.values_list("determination_id", flat=True).get(pk=occurrence.pk)
+        )
     new_determination = None
     new_score = None
@@
-    if new_determination and new_determination != current_determination:
+    new_determination_id = new_determination.pk if new_determination else None
+    if new_determination_id != current_determination_id:
         logger.debug(f"Changing det. of {occurrence} from {current_determination} to {new_determination}")
         occurrence.determination = new_determination
         needs_update = True

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@ami/main/models.py` around lines 3215 - 3242, The bug is that
current_determination is being read as a raw determination id while
new_determination is a Taxon instance, causing false positives and preventing
clearing of stale determinations; fix by normalizing types before comparison:
either load the actual Taxon object for current_determination (e.g. use
Occurrence.objects.select_related("determination").get(pk=occurrence.pk).determination)
or compare IDs consistently (e.g. get current_determination_id via
.values_list("determination", flat=True) and compare to new_determination.pk),
and ensure when no identification/prediction remains you set
occurrence.determination = None (and update determination_score) so stale
determinations are cleared; update the comparisons around current_determination,
new_determination, and occurrence.determination_score accordingly (references:
current_determination, new_determination, occurrence.find_best_identification,
occurrence.find_best_prediction, determination_score).

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@ami/exports/tests.py`:
- Line 376: The unpacking at the call to _create_occurrence_with_prediction
currently assigns unused variables occurrence and classification (e.g.,
"occurrence, classification = self._create_occurrence_with_prediction()") which
triggers RUF059; change the bindings to use the underscore discard pattern
(e.g., "_" or "_classification") for both returned values at each call site
(including the other similar call around line 417) or assign only the needed
value via a single-variable assignment from the function result, updating
references accordingly so unused names are removed.

In `@ami/main/api/serializers.py`:
- Around line 1396-1421: The function get_best_machine_prediction currently
returns None when obj.best_prediction is missing but the API contract requires
the nested object to always be present; change the early-return to return a dict
with the expected keys populated with nullable defaults instead. Specifically,
inside get_best_machine_prediction (and using obj.best_prediction,
TaxonNestedSerializer, AlgorithmNestedSerializer as anchors), always return a
dict with keys taxon, algorithm, score, determination_matches_machine_prediction
where taxon and algorithm are None if no prediction, score is None, and
determination_matches_machine_prediction is None (or False if you prefer
boolean) when no prediction exists; keep the existing logic that fills these
fields when prediction is present.

In `@ami/main/models.py`:
- Around line 3078-3105: find_best_prediction currently calls self.predictions()
which applies per-algorithm max-score filtering and causes a different result
than OccurrenceQuerySet.with_best_machine_prediction(); change
find_best_prediction to mirror with_best_machine_prediction by querying
Classification objects for this occurrence directly (not via predictions()),
ordering by "-terminal", "-score" across all classifications, and returning the
first result so the API/CSV and this method pick the same best machine
prediction; update any references to find_best_prediction/best_prediction if
needed to rely on the new query behavior.

---

Outside diff comments:
In `@ami/exports/format_types.py`:
- Around line 112-145: The serializer currently hardcodes the preview frontend
domain for the best_detection_occurrence_url column (the SerializerMethodField
best_detection_occurrence_url and its getter), making exported CSVs
non-portable; either remove best_detection_occurrence_url from the serializer
fields, or change the getter (e.g., get_best_detection_occurrence_url) to
construct the link from a configurable frontend base URL (read from settings or
an env var like FRONTEND_BASE_URL) and fall back to None if not set; update the
Meta.fields list and the corresponding method(s) (also referenced in the same
file around the other similar fields at the later occurrence) so no literal
hostname is embedded.

In `@ami/main/models.py`:
- Around line 3215-3242: The bug is that current_determination is being read as
a raw determination id while new_determination is a Taxon instance, causing
false positives and preventing clearing of stale determinations; fix by
normalizing types before comparison: either load the actual Taxon object for
current_determination (e.g. use
Occurrence.objects.select_related("determination").get(pk=occurrence.pk).determination)
or compare IDs consistently (e.g. get current_determination_id via
.values_list("determination", flat=True) and compare to new_determination.pk),
and ensure when no identification/prediction remains you set
occurrence.determination = None (and update determination_score) so stale
determinations are cleared; update the comparisons around current_determination,
new_determination, and occurrence.determination_score accordingly (references:
current_determination, new_determination, occurrence.find_best_identification,
occurrence.find_best_prediction, determination_score).

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 3b1730fd-0066-43a0-8e14-4c23b916b341

📥 Commits

Reviewing files that changed from the base of the PR and between f21ec4d and 6bf3a44.

📒 Files selected for processing (4)

ami/exports/format_types.py
ami/exports/tests.py
ami/main/api/serializers.py
ami/main/models.py

mihow · 2026-04-13T22:56:48Z

Claude says: Thanks for the great feedback @annavik! Addressed in 66d9390:

verified_by_count → participant_count: Renamed and changed to count distinct users (not total identifications). Per @mihow's comment, this highlights when more than one person contributed to the identification history.
best_detection_occurrence_url: Removed entirely — agreed that platform URLs aren't stable enough to distribute in exports yet.
agreed_with_algorithm: No changes in this PR — the transitive agreement question ("if you agree with an algorithm, then I agree with you...") is a good one for a follow-up.
determination_matches_machine_prediction: Kept as-is — it compares by taxon ID rather than string matching, which avoids edge cases.

Exposes the email of the prior human identifier when the best identification was explicitly an "Agree" with another user. Lets consumers trace agreement chains without the backend having to resolve transitivity. Co-Authored-By: Claude <noreply@anthropic.com>

mihow · 2026-04-13T23:19:19Z

Claude says: Added agreed_with_user column in 7aa6173 — exposes the email of the prior identifier the best identification explicitly agreed with. Rationale per @mihow: surfacing the raw link is enough; downstream consumers can walk the chain themselves if they need to.

While implementing this I noticed a pre-existing UI bug where agreeing with a human identification from the occurrence details card sends the wrong FK type. Filed separately as #1226 — it means agreed_with_identification is likely undercounted in production today.

- Add `backfill_determination_score` management command to null-out determination_score on legacy human-determined occurrences. Supports --dry-run and --project filtering. - Tighten the legacy score-backfill block in Occurrence.save() to skip human-determined occurrences (avoids a redundant query and suppresses the misleading "Could not determine score" warning). - Fix type mismatch in update_occurrence_determination(): when the caller didn't pass current_determination, the fallback fetched the FK id (int) and compared it against a Taxon instance, which was always unequal and caused unnecessary save() calls and misleading debug logs. Normalize current_determination to an id for the comparison and accept int|Taxon|None. Tests: BackfillDeterminationScoreTest covers the ML-only, human-determined, and withdrawn-identification cases. Co-Authored-By: Claude <noreply@anthropic.com>

coderabbitai

Actionable comments posted: 2

♻️ Duplicate comments (2)

ami/exports/tests.py (1)

376-376: ⚠️ Potential issue | 🟡 Minor

Fix unused unpacked variable to avoid RUF059.

At Line [376] and Line [417], classification is assigned but never used.

Suggested fix

-        occurrence, classification = self._create_occurrence_with_prediction()
+        occurrence, _classification = self._create_occurrence_with_prediction()
...
-        occurrence, classification = self._create_occurrence_with_prediction(taxon=self.taxon_a)
+        occurrence, _classification = self._create_occurrence_with_prediction(taxon=self.taxon_a)

Also applies to: 417-417

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@ami/exports/tests.py` at line 376, The test assigns an unused second value
from self._create_occurrence_with_prediction() to classification causing RUF059;
update both call sites where you do "occurrence, classification =
self._create_occurrence_with_prediction()" (and the duplicate) to ignore the
unused value by renaming it to "_" or "_classification" (e.g., "occurrence, _ =
self._create_occurrence_with_prediction()") so the returned but unused
classification is not flagged.

ami/main/models.py (1)

3067-3075: ⚠️ Potential issue | 🟠 Major

Align find_best_prediction() with the export/API query.

At Line 3075 this still starts from self.predictions(), which pre-filters to per-algorithm max scores. OccurrenceQuerySet.with_best_machine_prediction() in the same file orders across all classifications, so the model method can disagree with the exported best_machine_prediction_* fields for the same occurrence.

Suggested fix

     def find_best_prediction(self) -> "Classification | None":
         """
         Find the best machine prediction for this occurrence.
@@
         Uses the highest scoring classification (from any algorithm) as the best prediction.
         Considers terminal classifications first, then non-terminal ones.
         (Terminal classifications are the final classifications of a pipeline, non-terminal are intermediate models.)
         """
-        return self.predictions().order_by("-terminal", "-score").first()
+        return Classification.objects.filter(detection__occurrence=self).order_by("-terminal", "-score").first()

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@ami/main/models.py` around lines 3067 - 3075, The method
find_best_prediction() currently calls self.predictions(), which applies the
per-algorithm max-score prefilter and can disagree with
OccurrenceQuerySet.with_best_machine_prediction() that considers all
classifications; update find_best_prediction() to query all Classification rows
for this occurrence (e.g., via the Classification model filter by
occurrence=self or the raw reverse relation like self.classifications.all()) and
then order by "-terminal", "-score" to pick the single best prediction so it
matches the logic used in with_best_machine_prediction().

🧹 Nitpick comments (1)

ami/exports/tests.py (1)

465-465: Update docstring to match renamed field semantics.

The text still mentions verified_by_count; tests now validate participant_count (distinct participants), so this wording is stale.

Suggested fix

-        """Multiple identifications: verified_by_count reflects all non-withdrawn IDs."""
+        """Multiple identifications: participant_count reflects distinct non-withdrawn users."""

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@ami/exports/tests.py` at line 465, Update the test docstring to reflect the
renamed field: replace references to "verified_by_count" with
"participant_count" and adjust the description to state that participant_count
reflects the number of distinct non-withdrawn participants (or similar wording
used elsewhere in the codebase). Locate the docstring in ami/exports/tests.py
(the triple-quoted string currently reading "Multiple identifications:
verified_by_count reflects all non-withdrawn IDs.") and change it to mention
participant_count and distinct participants so the docstring matches the
assertions in the test.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@ami/exports/format_types.py`:
- Around line 163-165: The method get_agreed_with_user currently returns the
collaborator's email (agreed_with_user_email) which leaks PII; change it to
return a non-PII identifier or display name instead (e.g., agreed_with_user_id
or agreed_with_user_display_name) by reading the corresponding attribute from
obj and falling back to None if absent—update get_agreed_with_user to use that
non-PII field rather than agreed_with_user_email and ensure callers/CSV exporter
expect the new identifier.

In `@ami/main/models.py`:
- Around line 3224-3229: The current guard only sets occurrence.determination
when new_determination is truthy, leaving a stale determination when everything
authoritative is removed; update the logic around
new_determination/current_determination_id so that if new_determination is None
but current_determination_id is set you explicitly clear
occurrence.determination (set to None), mark needs_update = True, and log the
change (similar to the existing logger.debug), otherwise keep the existing
assignment when new_determination differs; touch the block that references
new_determination, occurrence.determination, current_determination_id, and
needs_update to implement this behavior.

---

Duplicate comments:
In `@ami/exports/tests.py`:
- Line 376: The test assigns an unused second value from
self._create_occurrence_with_prediction() to classification causing RUF059;
update both call sites where you do "occurrence, classification =
self._create_occurrence_with_prediction()" (and the duplicate) to ignore the
unused value by renaming it to "_" or "_classification" (e.g., "occurrence, _ =
self._create_occurrence_with_prediction()") so the returned but unused
classification is not flagged.

In `@ami/main/models.py`:
- Around line 3067-3075: The method find_best_prediction() currently calls
self.predictions(), which applies the per-algorithm max-score prefilter and can
disagree with OccurrenceQuerySet.with_best_machine_prediction() that considers
all classifications; update find_best_prediction() to query all Classification
rows for this occurrence (e.g., via the Classification model filter by
occurrence=self or the raw reverse relation like self.classifications.all()) and
then order by "-terminal", "-score" to pick the single best prediction so it
matches the logic used in with_best_machine_prediction().

---

Nitpick comments:
In `@ami/exports/tests.py`:
- Line 465: Update the test docstring to reflect the renamed field: replace
references to "verified_by_count" with "participant_count" and adjust the
description to state that participant_count reflects the number of distinct
non-withdrawn participants (or similar wording used elsewhere in the codebase).
Locate the docstring in ami/exports/tests.py (the triple-quoted string currently
reading "Multiple identifications: verified_by_count reflects all non-withdrawn
IDs.") and change it to mention participant_count and distinct participants so
the docstring matches the assertions in the test.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: a6709b2e-0997-453a-bfba-a309722205ef

📥 Commits

Reviewing files that changed from the base of the PR and between 6bf3a44 and bb46a28.

📒 Files selected for processing (5)

ami/exports/format_types.py
ami/exports/tests.py
ami/main/management/commands/backfill_determination_score.py
ami/main/models.py
ami/main/tests.py

…otation The API's cached `best_prediction` and the CSV export's annotated `best_machine_prediction_*` fields could pick different rows for the same occurrence: - `find_best_prediction()` went through `self.predictions()`, which filters to the max-score row per algorithm **before** applying the `-terminal, -score` ordering. A high-score non-terminal could survive dedup and then lose to a lower-score terminal on ordering. - `with_best_machine_prediction()` orders directly over all classifications, so it picks the terminal winner regardless of intra-algorithm dedup. Align `find_best_prediction()` to query classifications directly with the same `-terminal, -score, -pk` ordering. Add `-pk` to both so ties break deterministically. Also: - Serializer `get_best_machine_prediction()` now returns a stable shape ({taxon: null, algorithm: null, score: null, ...}) when no prediction exists, instead of null — clients read nullable fields instead of branching on type. - `update_occurrence_determination()` now clears `occurrence.determination` when the last non-withdrawn identification is deleted and no prediction remains, instead of leaving a stale taxon. Tests: new `test_api_and_csv_pick_same_best_prediction_with_mixed_terminal` covers the divergence case. Co-Authored-By: Claude <noreply@anthropic.com>

- Extract BEST_MACHINE_PREDICTION_ORDER as a shared constant used by both Occurrence.find_best_prediction() (row-at-a-time) and OccurrenceQuerySet.with_best_machine_prediction() (bulk annotation). Both methods now reference the constant and each other's docstrings, so future edits to the ordering touch one place and can't silently diverge. - Revert get_best_machine_prediction() to return None when no prediction exists (it was briefly returning a dict of nulls). Annotate the method's return type as `dict | None` so drf-spectacular emits nullability. - Add UpdateOccurrenceDeterminationTest covering the stale-determination clearing fix: withdrawing the only identification on an occurrence with no predictions now clears occurrence.determination (previously left a stale taxon in place). Co-Authored-By: Claude <noreply@anthropic.com>

Copilot AI review requested due to automatic review settings April 8, 2026 02:07

Copilot started reviewing on behalf of mihow April 8, 2026 02:08 View session

Copilot AI reviewed Apr 8, 2026

View reviewed changes

coderabbitai bot reviewed Apr 8, 2026

View reviewed changes

Comment thread ami/exports/format_types.py Outdated

Comment thread ami/main/api/serializers.py Outdated

Comment thread ami/main/models.py Outdated

Comment thread ami/main/models.py

mihow commented Apr 8, 2026

View reviewed changes

Comment thread ami/main/api/serializers.py Outdated

mihow and others added 2 commits April 13, 2026 15:44

Merge branch 'main' into feat/export-machine-prediction-fields

6bf3a44

coderabbitai bot reviewed Apr 13, 2026

View reviewed changes

Comment thread ami/exports/tests.py

Comment thread ami/main/api/serializers.py Outdated

Comment thread ami/main/models.py

mihow mentioned this pull request Apr 13, 2026

UI: Agree button on human identification card sends wrong FK type #1226

Open

mihow changed the title ~~feat(exports): add machine prediction, verification, and detection fields~~ feat: change humans core to None, add prediction and identification fields to CSV export Apr 14, 2026

coderabbitai bot reviewed Apr 14, 2026

View reviewed changes

Comment thread ami/exports/format_types.py

Comment thread ami/main/models.py Outdated

mihow and others added 2 commits April 13, 2026 18:12

mihow requested a review from annavik April 14, 2026 15:13

mihow changed the title ~~feat: change humans core to None, add prediction and identification fields to CSV export~~ feat: change humans scores to None, add prediction and identification fields to CSV export Apr 14, 2026

mihow changed the title ~~feat: change humans scores to None, add prediction and identification fields to CSV export~~ feat: change human scores to None, add prediction and identification fields to CSV export Apr 15, 2026

Merge branch 'main' into feat/export-machine-prediction-fields

f7310cf

Conversation

mihow commented Apr 8, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

New CSV export columns

Example CSV rows

New API field: best_machine_prediction

Backfill for existing data

Test plan

Follow-up

Summary by CodeRabbit

Release Notes

Uh oh!

netlify bot commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for antenna-ssec canceled.

Uh oh!

netlify bot commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for antenna-preview canceled.

Uh oh!

coderabbitai bot commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rate limit exceeded

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

annavik commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mihow commented Apr 13, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mihow commented Apr 13, 2026

Uh oh!

mihow commented Apr 13, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

mihow commented Apr 8, 2026 •

edited by coderabbitai bot

Loading

New API field: `best_machine_prediction`

netlify bot commented Apr 8, 2026 •

edited

Loading

netlify bot commented Apr 8, 2026 •

edited

Loading

coderabbitai bot commented Apr 8, 2026 •

edited

Loading

annavik commented Apr 8, 2026 •

edited

Loading