Amino acid dipepetide frequency for Haloarcula californiae icosahedral virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.479AlaAla: 11.479 ± 3.159
0.522AlaCys: 0.522 ± 0.296
12.627AlaAsp: 12.627 ± 1.534
8.139AlaGlu: 8.139 ± 0.892
3.965AlaPhe: 3.965 ± 0.592
9.6AlaGly: 9.6 ± 2.211
1.774AlaHis: 1.774 ± 0.404
3.444AlaIle: 3.444 ± 0.554
2.713AlaLys: 2.713 ± 0.746
7.722AlaLeu: 7.722 ± 1.095
2.922AlaMet: 2.922 ± 0.472
2.087AlaAsn: 2.087 ± 0.459
3.965AlaPro: 3.965 ± 0.636
3.652AlaGln: 3.652 ± 0.696
5.218AlaArg: 5.218 ± 0.662
5.426AlaSer: 5.426 ± 0.885
6.887AlaThr: 6.887 ± 0.716
8.348AlaVal: 8.348 ± 0.935
2.713AlaTrp: 2.713 ± 0.533
2.296AlaTyr: 2.296 ± 0.49
0.0AlaXaa: 0.0 ± 0.0
Cys
0.522CysAla: 0.522 ± 0.249
0.313CysCys: 0.313 ± 0.224
0.835CysAsp: 0.835 ± 0.339
0.626CysGlu: 0.626 ± 0.278
0.104CysPhe: 0.104 ± 0.11
1.357CysGly: 1.357 ± 0.529
0.209CysHis: 0.209 ± 0.144
0.104CysIle: 0.104 ± 0.093
0.104CysLys: 0.104 ± 0.093
0.313CysLeu: 0.313 ± 0.174
0.104CysMet: 0.104 ± 0.099
0.104CysAsn: 0.104 ± 0.113
0.73CysPro: 0.73 ± 0.334
0.417CysGln: 0.417 ± 0.254
0.522CysArg: 0.522 ± 0.201
0.835CysSer: 0.835 ± 0.398
0.104CysThr: 0.104 ± 0.113
0.209CysVal: 0.209 ± 0.123
0.313CysTrp: 0.313 ± 0.2
0.209CysTyr: 0.209 ± 0.128
0.0CysXaa: 0.0 ± 0.0
Asp
12.627AspAla: 12.627 ± 2.185
0.417AspCys: 0.417 ± 0.198
15.966AspAsp: 15.966 ± 2.722
12.627AspGlu: 12.627 ± 1.372
2.087AspPhe: 2.087 ± 0.43
12.105AspGly: 12.105 ± 1.654
1.565AspHis: 1.565 ± 0.323
1.357AspIle: 1.357 ± 0.435
0.626AspLys: 0.626 ± 0.242
8.661AspLeu: 8.661 ± 0.797
0.939AspMet: 0.939 ± 0.276
1.67AspAsn: 1.67 ± 0.6
7.305AspPro: 7.305 ± 1.016
3.548AspGln: 3.548 ± 0.485
5.426AspArg: 5.426 ± 0.653
4.383AspSer: 4.383 ± 0.572
4.8AspThr: 4.8 ± 0.896
7.722AspVal: 7.722 ± 0.743
1.044AspTrp: 1.044 ± 0.297
3.444AspTyr: 3.444 ± 0.658
0.0AspXaa: 0.0 ± 0.0
Glu
11.896GluAla: 11.896 ± 1.521
1.252GluCys: 1.252 ± 0.404
5.739GluAsp: 5.739 ± 0.692
9.392GluGlu: 9.392 ± 1.421
2.713GluPhe: 2.713 ± 0.406
6.574GluGly: 6.574 ± 0.599
2.713GluHis: 2.713 ± 0.59
3.131GluIle: 3.131 ± 0.83
3.757GluLys: 3.757 ± 0.497
3.339GluLeu: 3.339 ± 0.67
2.087GluMet: 2.087 ± 0.464
2.087GluAsn: 2.087 ± 0.54
2.296GluPro: 2.296 ± 0.661
3.652GluGln: 3.652 ± 0.646
6.157GluArg: 6.157 ± 1.178
5.635GluSer: 5.635 ± 0.896
4.07GluThr: 4.07 ± 0.609
8.139GluVal: 8.139 ± 1.051
2.296GluTrp: 2.296 ± 0.535
2.4GluTyr: 2.4 ± 0.396
0.0GluXaa: 0.0 ± 0.0
Phe
2.609PheAla: 2.609 ± 0.589
0.209PheCys: 0.209 ± 0.156
3.757PheAsp: 3.757 ± 0.597
1.878PheGlu: 1.878 ± 0.463
0.626PhePhe: 0.626 ± 0.319
3.548PheGly: 3.548 ± 0.614
0.313PheHis: 0.313 ± 0.18
0.626PheIle: 0.626 ± 0.276
0.417PheLys: 0.417 ± 0.203
1.252PheLeu: 1.252 ± 0.387
0.73PheMet: 0.73 ± 0.227
1.148PheAsn: 1.148 ± 0.331
0.626PhePro: 0.626 ± 0.212
1.252PheGln: 1.252 ± 0.343
1.67PheArg: 1.67 ± 0.338
1.357PheSer: 1.357 ± 0.401
2.296PheThr: 2.296 ± 0.522
1.774PheVal: 1.774 ± 0.432
0.626PheTrp: 0.626 ± 0.209
0.835PheTyr: 0.835 ± 0.316
0.0PheXaa: 0.0 ± 0.0
Gly
8.766GlyAla: 8.766 ± 2.077
0.835GlyCys: 0.835 ± 0.418
12.418GlyAsp: 12.418 ± 1.263
6.678GlyGlu: 6.678 ± 0.765
2.609GlyPhe: 2.609 ± 0.709
13.67GlyGly: 13.67 ± 1.63
1.461GlyHis: 1.461 ± 0.4
3.026GlyIle: 3.026 ± 0.571
3.131GlyLys: 3.131 ± 0.409
5.844GlyLeu: 5.844 ± 1.013
1.044GlyMet: 1.044 ± 0.316
2.4GlyAsn: 2.4 ± 0.545
3.548GlyPro: 3.548 ± 0.582
1.983GlyGln: 1.983 ± 0.68
5.322GlyArg: 5.322 ± 0.54
4.487GlySer: 4.487 ± 0.635
7.096GlyThr: 7.096 ± 1.001
7.2GlyVal: 7.2 ± 0.813
1.148GlyTrp: 1.148 ± 0.353
2.4GlyTyr: 2.4 ± 0.499
0.0GlyXaa: 0.0 ± 0.0
His
1.67HisAla: 1.67 ± 0.504
0.0HisCys: 0.0 ± 0.0
1.461HisAsp: 1.461 ± 0.52
1.565HisGlu: 1.565 ± 0.381
0.209HisPhe: 0.209 ± 0.141
1.774HisGly: 1.774 ± 0.397
0.313HisHis: 0.313 ± 0.178
1.252HisIle: 1.252 ± 0.368
0.209HisLys: 0.209 ± 0.221
1.878HisLeu: 1.878 ± 0.384
0.0HisMet: 0.0 ± 0.0
0.73HisAsn: 0.73 ± 0.209
1.148HisPro: 1.148 ± 0.373
0.209HisGln: 0.209 ± 0.167
1.878HisArg: 1.878 ± 0.445
0.73HisSer: 0.73 ± 0.278
1.252HisThr: 1.252 ± 0.46
1.357HisVal: 1.357 ± 0.288
0.0HisTrp: 0.0 ± 0.0
0.522HisTyr: 0.522 ± 0.228
0.0HisXaa: 0.0 ± 0.0
Ile
1.983IleAla: 1.983 ± 0.412
0.104IleCys: 0.104 ± 0.093
2.922IleAsp: 2.922 ± 0.629
2.191IleGlu: 2.191 ± 0.309
0.73IlePhe: 0.73 ± 0.309
2.922IleGly: 2.922 ± 0.596
0.313IleHis: 0.313 ± 0.181
1.252IleIle: 1.252 ± 0.314
0.835IleLys: 0.835 ± 0.317
0.417IleLeu: 0.417 ± 0.178
0.626IleMet: 0.626 ± 0.217
0.73IleAsn: 0.73 ± 0.236
1.044IlePro: 1.044 ± 0.347
1.357IleGln: 1.357 ± 0.325
1.983IleArg: 1.983 ± 0.499
2.087IleSer: 2.087 ± 0.582
0.73IleThr: 0.73 ± 0.385
1.983IleVal: 1.983 ± 0.518
0.209IleTrp: 0.209 ± 0.112
1.148IleTyr: 1.148 ± 0.261
0.0IleXaa: 0.0 ± 0.0
Lys
4.07LysAla: 4.07 ± 0.831
0.209LysCys: 0.209 ± 0.144
2.191LysAsp: 2.191 ± 0.552
1.67LysGlu: 1.67 ± 0.409
0.835LysPhe: 0.835 ± 0.232
1.252LysGly: 1.252 ± 0.3
0.626LysHis: 0.626 ± 0.256
0.835LysIle: 0.835 ± 0.359
1.67LysLys: 1.67 ± 0.505
0.939LysLeu: 0.939 ± 0.439
0.626LysMet: 0.626 ± 0.203
0.835LysAsn: 0.835 ± 0.28
1.044LysPro: 1.044 ± 0.302
0.939LysGln: 0.939 ± 0.274
2.191LysArg: 2.191 ± 0.443
1.044LysSer: 1.044 ± 0.347
2.087LysThr: 2.087 ± 0.509
1.878LysVal: 1.878 ± 0.477
0.417LysTrp: 0.417 ± 0.207
0.522LysTyr: 0.522 ± 0.269
0.0LysXaa: 0.0 ± 0.0
Leu
7.826LeuAla: 7.826 ± 0.829
0.522LeuCys: 0.522 ± 0.256
6.052LeuAsp: 6.052 ± 0.924
6.157LeuGlu: 6.157 ± 1.03
1.044LeuPhe: 1.044 ± 0.309
5.948LeuGly: 5.948 ± 0.891
1.252LeuHis: 1.252 ± 0.447
0.73LeuIle: 0.73 ± 0.288
2.191LeuLys: 2.191 ± 0.54
6.783LeuLeu: 6.783 ± 1.222
0.939LeuMet: 0.939 ± 0.226
1.252LeuAsn: 1.252 ± 0.257
3.861LeuPro: 3.861 ± 0.496
1.565LeuGln: 1.565 ± 0.497
5.844LeuArg: 5.844 ± 0.765
3.548LeuSer: 3.548 ± 0.629
4.696LeuThr: 4.696 ± 0.615
4.487LeuVal: 4.487 ± 0.584
1.044LeuTrp: 1.044 ± 0.272
2.4LeuTyr: 2.4 ± 0.503
0.0LeuXaa: 0.0 ± 0.0
Met
2.4MetAla: 2.4 ± 0.439
0.313MetCys: 0.313 ± 0.193
1.252MetAsp: 1.252 ± 0.388
1.461MetGlu: 1.461 ± 0.497
0.313MetPhe: 0.313 ± 0.197
1.461MetGly: 1.461 ± 0.284
0.0MetHis: 0.0 ± 0.0
0.104MetIle: 0.104 ± 0.093
0.104MetLys: 0.104 ± 0.093
1.357MetLeu: 1.357 ± 0.335
0.522MetMet: 0.522 ± 0.215
0.73MetAsn: 0.73 ± 0.309
0.939MetPro: 0.939 ± 0.29
0.522MetGln: 0.522 ± 0.216
1.252MetArg: 1.252 ± 0.394
1.252MetSer: 1.252 ± 0.397
1.357MetThr: 1.357 ± 0.401
0.939MetVal: 0.939 ± 0.28
0.209MetTrp: 0.209 ± 0.145
0.313MetTyr: 0.313 ± 0.145
0.0MetXaa: 0.0 ± 0.0
Asn
2.504AsnAla: 2.504 ± 0.572
0.417AsnCys: 0.417 ± 0.244
3.131AsnAsp: 3.131 ± 1.009
1.774AsnGlu: 1.774 ± 0.552
0.626AsnPhe: 0.626 ± 0.174
3.548AsnGly: 3.548 ± 0.706
0.417AsnHis: 0.417 ± 0.161
0.939AsnIle: 0.939 ± 0.248
0.522AsnLys: 0.522 ± 0.199
1.461AsnLeu: 1.461 ± 0.308
0.104AsnMet: 0.104 ± 0.103
1.044AsnAsn: 1.044 ± 0.542
2.504AsnPro: 2.504 ± 0.677
0.835AsnGln: 0.835 ± 0.331
1.461AsnArg: 1.461 ± 0.401
0.73AsnSer: 0.73 ± 0.262
1.67AsnThr: 1.67 ± 0.503
1.878AsnVal: 1.878 ± 0.587
0.522AsnTrp: 0.522 ± 0.209
1.252AsnTyr: 1.252 ± 0.307
0.0AsnXaa: 0.0 ± 0.0
Pro
5.635ProAla: 5.635 ± 0.516
0.104ProCys: 0.104 ± 0.101
6.47ProAsp: 6.47 ± 0.809
6.47ProGlu: 6.47 ± 0.909
1.252ProPhe: 1.252 ± 0.288
3.339ProGly: 3.339 ± 0.789
0.73ProHis: 0.73 ± 0.269
1.774ProIle: 1.774 ± 0.407
0.835ProLys: 0.835 ± 0.309
2.4ProLeu: 2.4 ± 0.491
0.835ProMet: 0.835 ± 0.263
1.252ProAsn: 1.252 ± 0.294
1.878ProPro: 1.878 ± 0.547
1.148ProGln: 1.148 ± 0.399
1.983ProArg: 1.983 ± 0.561
2.087ProSer: 2.087 ± 0.604
4.696ProThr: 4.696 ± 0.788
3.652ProVal: 3.652 ± 0.526
0.626ProTrp: 0.626 ± 0.241
1.044ProTyr: 1.044 ± 0.412
0.0ProXaa: 0.0 ± 0.0
Gln
3.757GlnAla: 3.757 ± 0.602
0.0GlnCys: 0.0 ± 0.0
3.339GlnAsp: 3.339 ± 0.396
2.922GlnGlu: 2.922 ± 0.635
1.148GlnPhe: 1.148 ± 0.379
3.548GlnGly: 3.548 ± 0.771
0.522GlnHis: 0.522 ± 0.191
0.939GlnIle: 0.939 ± 0.25
1.148GlnLys: 1.148 ± 0.342
1.357GlnLeu: 1.357 ± 0.382
0.626GlnMet: 0.626 ± 0.333
0.522GlnAsn: 0.522 ± 0.23
1.774GlnPro: 1.774 ± 0.39
2.191GlnGln: 2.191 ± 0.485
3.757GlnArg: 3.757 ± 0.714
1.774GlnSer: 1.774 ± 0.498
1.774GlnThr: 1.774 ± 0.353
2.609GlnVal: 2.609 ± 0.474
0.0GlnTrp: 0.0 ± 0.0
1.357GlnTyr: 1.357 ± 0.329
0.0GlnXaa: 0.0 ± 0.0
Arg
5.426ArgAla: 5.426 ± 0.763
0.939ArgCys: 0.939 ± 0.379
6.365ArgAsp: 6.365 ± 1.02
6.157ArgGlu: 6.157 ± 0.769
2.191ArgPhe: 2.191 ± 0.452
3.652ArgGly: 3.652 ± 0.683
0.835ArgHis: 0.835 ± 0.234
1.148ArgIle: 1.148 ± 0.336
2.504ArgLys: 2.504 ± 0.508
5.009ArgLeu: 5.009 ± 0.882
0.835ArgMet: 0.835 ± 0.348
1.357ArgAsn: 1.357 ± 0.295
2.922ArgPro: 2.922 ± 0.581
3.652ArgGln: 3.652 ± 0.566
5.844ArgArg: 5.844 ± 1.119
2.609ArgSer: 2.609 ± 0.461
4.174ArgThr: 4.174 ± 0.833
5.322ArgVal: 5.322 ± 0.663
1.565ArgTrp: 1.565 ± 0.473
2.087ArgTyr: 2.087 ± 0.477
0.0ArgXaa: 0.0 ± 0.0
Ser
5.009SerAla: 5.009 ± 0.917
0.313SerCys: 0.313 ± 0.195
5.635SerAsp: 5.635 ± 0.728
3.339SerGlu: 3.339 ± 0.766
1.461SerPhe: 1.461 ± 0.381
6.574SerGly: 6.574 ± 1.042
1.044SerHis: 1.044 ± 0.287
1.461SerIle: 1.461 ± 0.412
1.357SerLys: 1.357 ± 0.413
3.757SerLeu: 3.757 ± 0.554
0.626SerMet: 0.626 ± 0.205
2.609SerAsn: 2.609 ± 0.693
2.191SerPro: 2.191 ± 0.453
1.461SerGln: 1.461 ± 0.297
2.087SerArg: 2.087 ± 0.514
3.026SerSer: 3.026 ± 0.722
3.131SerThr: 3.131 ± 0.578
3.652SerVal: 3.652 ± 0.54
0.522SerTrp: 0.522 ± 0.28
1.252SerTyr: 1.252 ± 0.349
0.0SerXaa: 0.0 ± 0.0
Thr
6.783ThrAla: 6.783 ± 0.779
0.417ThrCys: 0.417 ± 0.289
8.035ThrAsp: 8.035 ± 1.007
6.052ThrGlu: 6.052 ± 0.836
2.296ThrPhe: 2.296 ± 0.449
5.113ThrGly: 5.113 ± 0.873
1.044ThrHis: 1.044 ± 0.356
0.73ThrIle: 0.73 ± 0.242
0.835ThrLys: 0.835 ± 0.331
5.426ThrLeu: 5.426 ± 0.735
1.044ThrMet: 1.044 ± 0.274
1.461ThrAsn: 1.461 ± 0.348
3.757ThrPro: 3.757 ± 0.601
1.878ThrGln: 1.878 ± 0.505
2.817ThrArg: 2.817 ± 0.4
2.504ThrSer: 2.504 ± 0.583
5.218ThrThr: 5.218 ± 0.81
5.739ThrVal: 5.739 ± 0.651
1.044ThrTrp: 1.044 ± 0.338
1.565ThrTyr: 1.565 ± 0.533
0.0ThrXaa: 0.0 ± 0.0
Val
6.887ValAla: 6.887 ± 0.885
0.417ValCys: 0.417 ± 0.215
6.365ValAsp: 6.365 ± 0.878
7.513ValGlu: 7.513 ± 0.829
1.67ValPhe: 1.67 ± 0.402
5.322ValGly: 5.322 ± 0.963
2.087ValHis: 2.087 ± 0.663
1.983ValIle: 1.983 ± 0.331
1.983ValLys: 1.983 ± 0.53
5.218ValLeu: 5.218 ± 1.013
1.357ValMet: 1.357 ± 0.438
3.444ValAsn: 3.444 ± 0.798
4.487ValPro: 4.487 ± 0.71
2.504ValGln: 2.504 ± 0.492
5.635ValArg: 5.635 ± 0.74
4.278ValSer: 4.278 ± 0.837
5.113ValThr: 5.113 ± 0.971
6.365ValVal: 6.365 ± 0.874
0.626ValTrp: 0.626 ± 0.269
3.652ValTyr: 3.652 ± 0.578
0.0ValXaa: 0.0 ± 0.0
Trp
1.148TrpAla: 1.148 ± 0.227
0.313TrpCys: 0.313 ± 0.254
1.044TrpAsp: 1.044 ± 0.318
1.148TrpGlu: 1.148 ± 0.351
0.313TrpPhe: 0.313 ± 0.2
1.252TrpGly: 1.252 ± 0.375
0.522TrpHis: 0.522 ± 0.289
0.313TrpIle: 0.313 ± 0.169
0.626TrpLys: 0.626 ± 0.214
1.252TrpLeu: 1.252 ± 0.386
0.313TrpMet: 0.313 ± 0.156
1.148TrpAsn: 1.148 ± 0.487
0.73TrpPro: 0.73 ± 0.262
0.626TrpGln: 0.626 ± 0.256
0.626TrpArg: 0.626 ± 0.219
0.835TrpSer: 0.835 ± 0.268
1.252TrpThr: 1.252 ± 0.319
1.148TrpVal: 1.148 ± 0.35
0.209TrpTrp: 0.209 ± 0.132
0.313TrpTyr: 0.313 ± 0.175
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.504TyrAla: 2.504 ± 0.439
0.417TyrCys: 0.417 ± 0.22
2.713TyrAsp: 2.713 ± 1.005
1.565TyrGlu: 1.565 ± 0.409
1.357TyrPhe: 1.357 ± 0.484
2.504TyrGly: 2.504 ± 0.476
0.417TyrHis: 0.417 ± 0.238
0.417TyrIle: 0.417 ± 0.165
0.313TyrLys: 0.313 ± 0.132
3.757TyrLeu: 3.757 ± 0.925
0.417TyrMet: 0.417 ± 0.198
0.73TyrAsn: 0.73 ± 0.219
1.461TyrPro: 1.461 ± 0.463
1.67TyrGln: 1.67 ± 0.46
3.026TyrArg: 3.026 ± 0.626
1.878TyrSer: 1.878 ± 0.596
1.357TyrThr: 1.357 ± 0.329
2.504TyrVal: 2.504 ± 0.548
0.0TyrTrp: 0.0 ± 0.0
1.148TyrTyr: 1.148 ± 0.372
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 47 proteins (9584 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski