bug: v0.4.12 Every new thread created is reset to 2048 token context length #2821

Propheticus · 2024-04-25T12:32:09Z

Like the title says the default value is set for every new thread. After picking a model where the model.json defines a ctx_len of e.g. 20000, the context is still set to 2048. After you manually adjust this and load the model, when you open a new thread the default is still reset to 2048. Dragging the slider in the new thread kills nitro and unloads the model.

Related? : https://github.com/janhq/jan/blob/dev/extensions/inference-nitro-extension/resources/default_settings.json

louis-jan · 2024-04-25T12:38:02Z

Hi @Propheticus, this is a minor update to ensure that new models with large context lengths (>8k) do not crash low-spec devices.

We will add a global setting to the extension or check device specs using heuristics to determine the proper settings.

Propheticus · 2024-04-25T12:50:59Z

I can understand that reason. The main issue I had is that it's done silently for new threads, even after overriding it manually when loading the model. Without checking the model's engine parameters you wouldn't know your setting was undone.

louis-jan · 2024-04-25T13:08:21Z

I can understand that reason. The main issue I had is that it's done silently for new threads, even after overriding it manually when loading the model. Without checking the model's engine parameters you wouldn't know your setting was undone.

Yeah, that sucks. We'll address this in the next release.

Van-QA · 2024-05-29T08:52:47Z

should be resolved via janhq/cortex#617

Propheticus added the type: bug Something isn't working label Apr 25, 2024

louis-jan self-assigned this Apr 25, 2024

louis-jan added this to the v0.4.13 milestone Apr 25, 2024

louis-jan modified the milestones: v0.5.0 Broken Rice, v0.5.1 May 3, 2024

Van-QA modified the milestones: v.0.5.0 🍵 Bubur Ayam, v.0.5.1 🍖 Kebap May 29, 2024

imtuyethan assigned namchuai and unassigned louis-jan Jul 2, 2024

imtuyethan added the P1: important Important feature / fix label Jul 2, 2024

Van-QA modified the milestones: v.0.5.2 🍖 Kebap, v.0.5.3 ⚡ Thunder Tea Jul 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: v0.4.12 Every new thread created is reset to 2048 token context length #2821

bug: v0.4.12 Every new thread created is reset to 2048 token context length #2821

Propheticus commented Apr 25, 2024

louis-jan commented Apr 25, 2024 •

edited

Loading

Propheticus commented Apr 25, 2024

louis-jan commented Apr 25, 2024

Van-QA commented May 29, 2024

bug: v0.4.12 Every new thread created is reset to 2048 token context length #2821

bug: v0.4.12 Every new thread created is reset to 2048 token context length #2821

Comments

Propheticus commented Apr 25, 2024

louis-jan commented Apr 25, 2024 • edited Loading

Propheticus commented Apr 25, 2024

louis-jan commented Apr 25, 2024

Van-QA commented May 29, 2024

louis-jan commented Apr 25, 2024 •

edited

Loading