Skip to content

chore: Update supported languages list#152

Open
anysoftkeyboard-bot wants to merge 1 commit into
mainfrom
chore/update-languages
Open

chore: Update supported languages list#152
anysoftkeyboard-bot wants to merge 1 commit into
mainfrom
chore/update-languages

Conversation

@anysoftkeyboard-bot
Copy link
Copy Markdown
Member

This PR updates the list of supported Wikipedia languages based on the latest statistics.

@anysoftkeyboard-bot
Copy link
Copy Markdown
Member Author

anysoftkeyboard-bot commented Apr 26, 2026

🛠 **Do**No blockers found.
🟡 **Try**Improvements for reliability.
💡 **Consider**Refactor or maintenance suggestions.

### 🟡 Try
*   **Verify Scraper Alignment**: While reviewing the diff, I noticed some significant jumps in statistics, such as **Urdu (`ur`)** increasing its `articleCount` from ~297k to ~422k (a ~42% increase). Given the current official Wikipedia stats for Urdu (approx. 214k articles), this suggests the `update_languages.py` script might be picking up data from incorrect columns or rows (e.g., "Total Pages" or "Users" instead of "Articles") if the table structure on the Wikimedia Meta page has changed. It is recommended to verify the `parse_languages` logic against the current live HTML of [List of Wikipedias](https://meta.wikimedia.org/wiki/List_of_Wikipedias).
*   **Trailing Commas in Script**: The script `scripts/update_languages.py` (lines 141-143) generates Kotlin code without a trailing comma for the last property of `SupportedLanguage`. While the auto-formatter fixes this, adding it to the template would result in cleaner script output.
    ```python
    # scripts/update_languages.py
    content += f'        activeUserCount = {lang["activeUserCount"]},\n' # Add comma
    ```

### 💡 Consider
*   **Sorting Stability

<!-- cassandra-ai-review -->

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants