Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
5
31
10
Xiaoran Liu (SII)
SII-xrliu
Follow
yuhangzang's profile picture
LighterDarkness's profile picture
Singhoo's profile picture
11 followers
·
33 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 23 hours ago
AdaLomo: Low-memory Optimization with Adaptive Learning Rate
authored
a paper
4 days ago
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs
updated
a collection
4 days ago
RoPE++
View all activity
Organizations
None yet
SII-xrliu
's models
24
Sort: Recently updated
SII-xrliu/RoPEPP_EC-DCLM-1_5B-32k
1B
•
Updated
4 days ago
•
8
SII-xrliu/RoPEPP_EH-DCLM-1_5B-32k
1B
•
Updated
4 days ago
•
10
SII-xrliu/RoPE-DCLM-1_5B-32k
1B
•
Updated
4 days ago
•
8
SII-xrliu/RoPEPP_EC-DCLM-1_5B-4k
1B
•
Updated
4 days ago
•
9
SII-xrliu/RoPEPP_EH-DCLM-1_5B-4k
1B
•
Updated
4 days ago
•
5
SII-xrliu/RoPE-DCLM-1_5B-4k
1B
•
Updated
4 days ago
•
9
SII-xrliu/RoPEPP_EC-DCLM-776M-32k
0.8B
•
Updated
4 days ago
•
5
SII-xrliu/RoPEPP_EH-DCLM-776M-32k
0.7B
•
Updated
4 days ago
•
6
SII-xrliu/RoPE-DCLM-776M-32k
0.8B
•
Updated
4 days ago
•
8
SII-xrliu/Pythia-DCLM-776M-4k
0.8B
•
Updated
4 days ago
•
9
SII-xrliu/FoPE-DCLM-776M-4k
0.8B
•
Updated
4 days ago
•
22
SII-xrliu/ALiBi-DCLM-776M-4k
0.8B
•
Updated
4 days ago
•
6
SII-xrliu/RoPEPP_EC-DCLM-776M-4k
0.8B
•
Updated
4 days ago
•
5
SII-xrliu/RoPEPP_EH-DCLM-776M-4k
0.7B
•
Updated
4 days ago
•
7
SII-xrliu/RoPE-DCLM-776M-4k
0.8B
•
Updated
4 days ago
•
7
SII-xrliu/FoPE-DCLM-376M-4k
0.4B
•
Updated
5 days ago
•
18
SII-xrliu/RoPEPP_EC-DCLM-376M-32k
0.4B
•
Updated
6 days ago
•
8
SII-xrliu/RoPEPP_EH-DCLM-376M-32k
0.4B
•
Updated
6 days ago
•
13
SII-xrliu/RoPE-DCLM-376M-32k
0.4B
•
Updated
6 days ago
•
5
SII-xrliu/RoPEPP_EC-DCLM-376M-4k
0.4B
•
Updated
6 days ago
•
12
SII-xrliu/RoPEPP_EH-DCLM-376M-4k
0.4B
•
Updated
6 days ago
•
21
SII-xrliu/Pythia-DCLM-376M-4k
0.4B
•
Updated
6 days ago
•
7
SII-xrliu/ALiBi-DCLM-376M-4k
0.4B
•
Updated
6 days ago
•
6
SII-xrliu/RoPE-DCLM-376M-4k
0.4B
•
Updated
6 days ago
•
19