Amino acid dipepetide frequency for Arenavirus sp.

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.075AlaAla: 2.075 ± 0.899
0.593AlaCys: 0.593 ± 0.313
1.482AlaAsp: 1.482 ± 1.157
2.371AlaGlu: 2.371 ± 1.156
1.778AlaPhe: 1.778 ± 0.602
1.482AlaGly: 1.482 ± 0.738
0.593AlaHis: 0.593 ± 0.365
3.557AlaIle: 3.557 ± 0.732
1.482AlaLys: 1.482 ± 1.157
5.631AlaLeu: 5.631 ± 1.766
0.593AlaMet: 0.593 ± 0.365
1.186AlaAsn: 1.186 ± 1.347
0.593AlaPro: 0.593 ± 2.786
1.482AlaGln: 1.482 ± 1.225
0.889AlaArg: 0.889 ± 0.467
2.371AlaSer: 2.371 ± 0.891
2.371AlaThr: 2.371 ± 1.253
2.371AlaVal: 2.371 ± 1.253
0.296AlaTrp: 0.296 ± 0.398
0.296AlaTyr: 0.296 ± 0.162
0.0AlaXaa: 0.0 ± 0.0
Cys
0.889CysAla: 0.889 ± 0.467
0.0CysCys: 0.0 ± 0.0
1.186CysAsp: 1.186 ± 0.73
1.778CysGlu: 1.778 ± 0.975
2.075CysPhe: 2.075 ± 0.743
1.186CysGly: 1.186 ± 0.73
0.593CysHis: 0.593 ± 0.325
1.778CysIle: 1.778 ± 1.327
2.964CysLys: 2.964 ± 1.037
2.371CysLeu: 2.371 ± 1.253
0.296CysMet: 0.296 ± 0.398
2.371CysAsn: 2.371 ± 1.46
1.186CysPro: 1.186 ± 1.347
0.593CysGln: 0.593 ± 0.313
0.0CysArg: 0.0 ± 0.0
1.186CysSer: 1.186 ± 1.347
0.889CysThr: 0.889 ± 0.81
1.186CysVal: 1.186 ± 0.65
0.593CysTrp: 0.593 ± 1.524
0.593CysTyr: 0.593 ± 1.327
0.0CysXaa: 0.0 ± 0.0
Asp
2.371AspAla: 2.371 ± 0.807
0.593AspCys: 0.593 ± 0.313
2.964AspAsp: 2.964 ± 0.676
4.149AspGlu: 4.149 ± 0.659
2.075AspPhe: 2.075 ± 0.532
2.075AspGly: 2.075 ± 0.743
0.593AspHis: 0.593 ± 0.365
3.557AspIle: 3.557 ± 1.491
2.075AspLys: 2.075 ± 0.767
10.077AspLeu: 10.077 ± 1.576
1.778AspMet: 1.778 ± 0.702
1.186AspAsn: 1.186 ± 0.331
1.778AspPro: 1.778 ± 0.602
2.667AspGln: 2.667 ± 0.954
2.964AspArg: 2.964 ± 1.248
4.149AspSer: 4.149 ± 1.389
2.667AspThr: 2.667 ± 0.845
3.557AspVal: 3.557 ± 1.069
1.186AspTrp: 1.186 ± 0.65
2.371AspTyr: 2.371 ± 0.451
0.0AspXaa: 0.0 ± 0.0
Glu
2.371GluAla: 2.371 ± 0.482
2.075GluCys: 2.075 ± 1.137
2.964GluAsp: 2.964 ± 1.232
3.853GluGlu: 3.853 ± 1.228
4.446GluPhe: 4.446 ± 1.633
2.964GluGly: 2.964 ± 0.676
0.889GluHis: 0.889 ± 0.487
4.149GluIle: 4.149 ± 1.315
3.853GluLys: 3.853 ± 1.211
7.113GluLeu: 7.113 ± 1.352
0.889GluMet: 0.889 ± 0.467
4.742GluAsn: 4.742 ± 1.71
2.075GluPro: 2.075 ± 0.654
2.964GluGln: 2.964 ± 0.536
2.371GluArg: 2.371 ± 1.109
6.52GluSer: 6.52 ± 2.075
2.667GluThr: 2.667 ± 0.954
5.631GluVal: 5.631 ± 1.17
0.593GluTrp: 0.593 ± 0.325
1.778GluTyr: 1.778 ± 0.602
0.0GluXaa: 0.0 ± 0.0
Phe
0.889PheAla: 0.889 ± 0.487
0.889PheCys: 0.889 ± 0.81
1.778PheAsp: 1.778 ± 0.934
4.742PheGlu: 4.742 ± 1.71
0.889PhePhe: 0.889 ± 0.487
1.778PheGly: 1.778 ± 1.571
1.778PheHis: 1.778 ± 0.602
4.149PheIle: 4.149 ± 1.869
5.335PheLys: 5.335 ± 1.806
3.557PheLeu: 3.557 ± 0.745
0.889PheMet: 0.889 ± 0.334
1.778PheAsn: 1.778 ± 0.621
1.482PhePro: 1.482 ± 0.812
1.778PheGln: 1.778 ± 0.622
1.778PheArg: 1.778 ± 0.225
5.335PheSer: 5.335 ± 1.271
2.964PheThr: 2.964 ± 0.753
2.964PheVal: 2.964 ± 0.536
0.296PheTrp: 0.296 ± 0.456
1.778PheTyr: 1.778 ± 0.531
0.0PheXaa: 0.0 ± 0.0
Gly
1.482GlyAla: 1.482 ± 1.618
0.296GlyCys: 0.296 ± 0.456
2.667GlyAsp: 2.667 ± 1.462
2.964GlyGlu: 2.964 ± 0.676
1.186GlyPhe: 1.186 ± 0.73
2.075GlyGly: 2.075 ± 0.91
0.296GlyHis: 0.296 ± 0.162
2.371GlyIle: 2.371 ± 0.482
2.371GlyLys: 2.371 ± 1.193
5.631GlyLeu: 5.631 ± 1.388
0.889GlyMet: 0.889 ± 0.698
2.075GlyAsn: 2.075 ± 1.416
1.186GlyPro: 1.186 ± 1.249
1.778GlyGln: 1.778 ± 0.225
3.26GlyArg: 3.26 ± 1.674
3.26GlySer: 3.26 ± 0.795
0.889GlyThr: 0.889 ± 0.467
2.964GlyVal: 2.964 ± 0.753
0.889GlyTrp: 0.889 ± 0.467
2.075GlyTyr: 2.075 ± 1.066
0.296GlyXaa: 0.296 ± 0.456
His
0.296HisAla: 0.296 ± 0.162
0.593HisCys: 0.593 ± 0.616
1.482HisAsp: 1.482 ± 0.484
1.186HisGlu: 1.186 ± 0.627
1.482HisPhe: 1.482 ± 0.484
0.296HisGly: 0.296 ± 0.398
0.296HisHis: 0.296 ± 0.456
0.889HisIle: 0.889 ± 0.334
0.889HisLys: 0.889 ± 0.487
3.557HisLeu: 3.557 ± 1.718
0.0HisMet: 0.0 ± 0.0
1.186HisAsn: 1.186 ± 0.65
0.296HisPro: 0.296 ± 0.456
1.186HisGln: 1.186 ± 0.65
1.186HisArg: 1.186 ± 0.73
1.778HisSer: 1.778 ± 1.263
0.593HisThr: 0.593 ± 0.313
1.778HisVal: 1.778 ± 0.975
0.0HisTrp: 0.0 ± 0.0
0.889HisTyr: 0.889 ± 1.429
0.0HisXaa: 0.0 ± 0.0
Ile
1.778IleAla: 1.778 ± 1.181
1.778IleCys: 1.778 ± 1.253
6.52IleAsp: 6.52 ± 1.253
3.853IleGlu: 3.853 ± 1.224
2.667IlePhe: 2.667 ± 0.414
2.667IleGly: 2.667 ± 0.414
1.482IleHis: 1.482 ± 1.006
3.26IleIle: 3.26 ± 0.628
8.299IleLys: 8.299 ± 2.542
7.41IleLeu: 7.41 ± 1.057
2.371IleMet: 2.371 ± 0.451
3.853IleAsn: 3.853 ± 0.965
2.075IlePro: 2.075 ± 0.532
2.667IleGln: 2.667 ± 1.835
2.075IleArg: 2.075 ± 1.137
7.41IleSer: 7.41 ± 1.911
4.742IleThr: 4.742 ± 1.068
2.667IleVal: 2.667 ± 0.414
0.889IleTrp: 0.889 ± 0.467
1.482IleTyr: 1.482 ± 0.484
0.0IleXaa: 0.0 ± 0.0
Lys
2.075LysAla: 2.075 ± 1.252
3.26LysCys: 3.26 ± 1.241
4.446LysAsp: 4.446 ± 0.818
6.224LysGlu: 6.224 ± 1.662
4.742LysPhe: 4.742 ± 1.93
2.371LysGly: 2.371 ± 1.109
0.296LysHis: 0.296 ± 0.162
7.113LysIle: 7.113 ± 2.293
4.742LysLys: 4.742 ± 1.327
9.781LysLeu: 9.781 ± 2.271
1.482LysMet: 1.482 ± 0.812
4.149LysAsn: 4.149 ± 0.561
2.667LysPro: 2.667 ± 0.548
2.964LysGln: 2.964 ± 1.587
3.26LysArg: 3.26 ± 1.086
7.113LysSer: 7.113 ± 0.855
5.631LysThr: 5.631 ± 0.744
3.853LysVal: 3.853 ± 1.386
0.889LysTrp: 0.889 ± 0.487
2.667LysTyr: 2.667 ± 0.414
0.0LysXaa: 0.0 ± 0.0
Leu
4.149LeuAla: 4.149 ± 0.634
3.853LeuCys: 3.853 ± 1.375
2.964LeuAsp: 2.964 ± 0.949
4.446LeuGlu: 4.446 ± 0.652
6.817LeuPhe: 6.817 ± 1.237
4.742LeuGly: 4.742 ± 1.033
2.075LeuHis: 2.075 ± 0.532
10.67LeuIle: 10.67 ± 1.784
12.448LeuLys: 12.448 ± 1.903
15.412LeuLeu: 15.412 ± 2.762
3.26LeuMet: 3.26 ± 0.835
10.966LeuAsn: 10.966 ± 2.612
4.149LeuPro: 4.149 ± 0.894
3.557LeuGln: 3.557 ± 0.628
4.446LeuArg: 4.446 ± 0.676
12.448LeuSer: 12.448 ± 1.195
7.113LeuThr: 7.113 ± 4.287
6.224LeuVal: 6.224 ± 1.35
0.889LeuTrp: 0.889 ± 0.301
2.964LeuTyr: 2.964 ± 0.753
0.0LeuXaa: 0.0 ± 0.0
Met
1.186MetAla: 1.186 ± 0.868
0.0MetCys: 0.0 ± 0.0
2.075MetAsp: 2.075 ± 0.743
0.889MetGlu: 0.889 ± 0.487
1.482MetPhe: 1.482 ± 0.652
1.482MetGly: 1.482 ± 1.363
0.0MetHis: 0.0 ± 0.0
0.593MetIle: 0.593 ± 0.325
3.557MetLys: 3.557 ± 1.513
2.075MetLeu: 2.075 ± 1.317
1.778MetMet: 1.778 ± 0.621
1.186MetAsn: 1.186 ± 0.377
0.296MetPro: 0.296 ± 0.456
1.186MetGln: 1.186 ± 0.331
1.778MetArg: 1.778 ± 0.531
2.964MetSer: 2.964 ± 0.968
0.593MetThr: 0.593 ± 0.365
1.482MetVal: 1.482 ± 0.484
0.296MetTrp: 0.296 ± 0.162
0.593MetTyr: 0.593 ± 0.325
0.0MetXaa: 0.0 ± 0.0
Asn
1.482AsnAla: 1.482 ± 0.232
0.889AsnCys: 0.889 ± 2.716
4.742AsnAsp: 4.742 ± 0.668
3.26AsnGlu: 3.26 ± 0.628
2.371AsnPhe: 2.371 ± 0.919
2.667AsnGly: 2.667 ± 1.436
1.186AsnHis: 1.186 ± 1.347
2.075AsnIle: 2.075 ± 0.899
5.039AsnLys: 5.039 ± 0.816
9.781AsnLeu: 9.781 ± 1.512
0.889AsnMet: 0.889 ± 0.698
3.26AsnAsn: 3.26 ± 0.716
2.371AsnPro: 2.371 ± 0.662
1.186AsnGln: 1.186 ± 0.331
1.778AsnArg: 1.778 ± 0.621
6.52AsnSer: 6.52 ± 1.679
3.26AsnThr: 3.26 ± 0.821
2.964AsnVal: 2.964 ± 0.676
1.186AsnTrp: 1.186 ± 0.65
3.557AsnTyr: 3.557 ± 0.745
0.0AsnXaa: 0.0 ± 0.0
Pro
1.186ProAla: 1.186 ± 1.249
0.889ProCys: 0.889 ± 0.301
1.778ProAsp: 1.778 ± 1.253
0.889ProGlu: 0.889 ± 1.312
1.186ProPhe: 1.186 ± 0.65
2.371ProGly: 2.371 ± 1.09
2.075ProHis: 2.075 ± 1.317
3.853ProIle: 3.853 ± 1.808
3.853ProLys: 3.853 ± 0.573
3.853ProLeu: 3.853 ± 1.342
1.186ProMet: 1.186 ± 0.929
1.482ProAsn: 1.482 ± 1.157
0.593ProPro: 0.593 ± 1.327
0.296ProGln: 0.296 ± 0.398
1.482ProArg: 1.482 ± 0.484
2.964ProSer: 2.964 ± 1.14
2.371ProThr: 2.371 ± 1.567
0.593ProVal: 0.593 ± 0.313
0.0ProTrp: 0.0 ± 0.0
1.186ProTyr: 1.186 ± 1.753
0.0ProXaa: 0.0 ± 0.0
Gln
1.778GlnAla: 1.778 ± 0.621
0.296GlnCys: 0.296 ± 0.162
0.889GlnAsp: 0.889 ± 0.301
2.371GlnGlu: 2.371 ± 1.46
1.778GlnPhe: 1.778 ± 0.225
2.667GlnGly: 2.667 ± 1.194
0.593GlnHis: 0.593 ± 1.327
2.964GlnIle: 2.964 ± 0.464
2.964GlnLys: 2.964 ± 0.49
1.482GlnLeu: 1.482 ± 0.652
0.889GlnMet: 0.889 ± 0.301
2.371GlnAsn: 2.371 ± 0.395
1.778GlnPro: 1.778 ± 0.94
0.889GlnGln: 0.889 ± 1.194
1.778GlnArg: 1.778 ± 0.94
4.149GlnSer: 4.149 ± 2.58
2.371GlnThr: 2.371 ± 1.253
1.482GlnVal: 1.482 ± 0.68
0.0GlnTrp: 0.0 ± 0.0
1.186GlnTyr: 1.186 ± 0.377
0.0GlnXaa: 0.0 ± 0.0
Arg
0.889ArgAla: 0.889 ± 0.467
0.593ArgCys: 0.593 ± 1.327
1.778ArgAsp: 1.778 ± 0.602
4.446ArgGlu: 4.446 ± 1.153
1.482ArgPhe: 1.482 ± 0.812
1.186ArgGly: 1.186 ± 0.65
0.889ArgHis: 0.889 ± 0.334
1.186ArgIle: 1.186 ± 0.73
1.186ArgLys: 1.186 ± 1.092
8.002ArgLeu: 8.002 ± 1.399
0.593ArgMet: 0.593 ± 0.704
2.964ArgAsn: 2.964 ± 0.49
1.186ArgPro: 1.186 ± 0.65
2.371ArgGln: 2.371 ± 0.482
2.075ArgArg: 2.075 ± 1.685
2.964ArgSer: 2.964 ± 1.321
2.371ArgThr: 2.371 ± 0.451
2.075ArgVal: 2.075 ± 0.767
0.593ArgTrp: 0.593 ± 0.365
0.889ArgTyr: 0.889 ± 0.301
0.0ArgXaa: 0.0 ± 0.0
Ser
2.964SerAla: 2.964 ± 3.141
2.371SerCys: 2.371 ± 1.319
6.224SerAsp: 6.224 ± 1.305
6.224SerGlu: 6.224 ± 0.799
4.149SerPhe: 4.149 ± 1.016
2.964SerGly: 2.964 ± 0.824
2.371SerHis: 2.371 ± 0.919
5.928SerIle: 5.928 ± 0.995
7.113SerLys: 7.113 ± 2.538
9.781SerLeu: 9.781 ± 2.439
2.667SerMet: 2.667 ± 1.043
6.817SerAsn: 6.817 ± 0.677
2.964SerPro: 2.964 ± 1.285
3.557SerGln: 3.557 ± 0.688
2.964SerArg: 2.964 ± 0.828
8.002SerSer: 8.002 ± 2.17
2.371SerThr: 2.371 ± 1.74
8.002SerVal: 8.002 ± 2.827
1.186SerTrp: 1.186 ± 0.377
3.853SerTyr: 3.853 ± 0.874
0.0SerXaa: 0.0 ± 0.0
Thr
2.371ThrAla: 2.371 ± 0.807
1.778ThrCys: 1.778 ± 1.263
3.26ThrAsp: 3.26 ± 0.602
2.371ThrGlu: 2.371 ± 0.754
1.778ThrPhe: 1.778 ± 0.225
2.371ThrGly: 2.371 ± 1.253
1.482ThrHis: 1.482 ± 0.68
5.039ThrIle: 5.039 ± 1.692
5.039ThrLys: 5.039 ± 1.204
5.335ThrLeu: 5.335 ± 2.318
1.778ThrMet: 1.778 ± 0.225
1.482ThrAsn: 1.482 ± 0.652
2.667ThrPro: 2.667 ± 2.614
1.778ThrGln: 1.778 ± 0.94
0.889ThrArg: 0.889 ± 0.334
4.742ThrSer: 4.742 ± 0.731
4.446ThrThr: 4.446 ± 4.203
2.667ThrVal: 2.667 ± 1.152
1.186ThrTrp: 1.186 ± 0.331
0.889ThrTyr: 0.889 ± 0.467
0.0ThrXaa: 0.0 ± 0.0
Val
2.667ValAla: 2.667 ± 0.599
1.778ValCys: 1.778 ± 0.602
3.853ValAsp: 3.853 ± 1.228
5.039ValGlu: 5.039 ± 1.483
1.778ValPhe: 1.778 ± 0.602
1.778ValGly: 1.778 ± 0.621
1.186ValHis: 1.186 ± 0.369
3.557ValIle: 3.557 ± 0.745
3.557ValLys: 3.557 ± 0.957
6.817ValLeu: 6.817 ± 0.96
2.371ValMet: 2.371 ± 0.395
3.853ValAsn: 3.853 ± 0.573
3.26ValPro: 3.26 ± 1.787
1.186ValGln: 1.186 ± 0.369
2.371ValArg: 2.371 ± 0.919
5.039ValSer: 5.039 ± 1.107
2.964ValThr: 2.964 ± 1.503
3.26ValVal: 3.26 ± 1.029
0.296ValTrp: 0.296 ± 0.398
1.186ValTyr: 1.186 ± 0.331
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.296TrpCys: 0.296 ± 0.456
1.482TrpAsp: 1.482 ± 1.006
1.186TrpGlu: 1.186 ± 0.65
0.593TrpPhe: 0.593 ± 0.325
0.593TrpGly: 0.593 ± 0.325
0.296TrpHis: 0.296 ± 0.162
1.186TrpIle: 1.186 ± 0.369
0.889TrpLys: 0.889 ± 0.334
1.186TrpLeu: 1.186 ± 0.331
0.296TrpMet: 0.296 ± 0.456
0.296TrpAsn: 0.296 ± 0.162
0.296TrpPro: 0.296 ± 0.398
0.0TrpGln: 0.0 ± 0.0
0.296TrpArg: 0.296 ± 0.162
0.889TrpSer: 0.889 ± 0.81
0.0TrpThr: 0.0 ± 0.0
0.889TrpVal: 0.889 ± 0.487
0.296TrpTrp: 0.296 ± 0.162
0.593TrpTyr: 0.593 ± 1.524
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.593TyrAla: 0.593 ± 0.365
0.889TyrCys: 0.889 ± 0.81
0.593TyrAsp: 0.593 ± 0.325
2.667TyrGlu: 2.667 ± 1.075
2.075TyrPhe: 2.075 ± 1.137
1.186TyrGly: 1.186 ± 1.347
0.889TyrHis: 0.889 ± 0.487
1.778TyrIle: 1.778 ± 0.934
2.075TyrLys: 2.075 ± 0.532
4.742TyrLeu: 4.742 ± 1.375
0.296TyrMet: 0.296 ± 0.162
3.26TyrAsn: 3.26 ± 1.635
1.186TyrPro: 1.186 ± 0.627
0.593TyrGln: 0.593 ± 1.327
2.075TyrArg: 2.075 ± 0.767
2.667TyrSer: 2.667 ± 1.209
2.075TyrThr: 2.075 ± 1.177
1.186TyrVal: 1.186 ± 0.65
0.0TyrTrp: 0.0 ± 0.0
1.482TyrTyr: 1.482 ± 0.812
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.296XaaIle: 0.296 ± 0.456
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3375 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski