Amino acid dipepetide frequency for Sulfolobales Mexican fusellovirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.0AlaCys: 0.0 ± 0.0
1.079AlaAsp: 1.079 ± 0.499
4.745AlaGlu: 4.745 ± 1.199
4.745AlaPhe: 4.745 ± 0.979
4.961AlaGly: 4.961 ± 1.075
0.647AlaHis: 0.647 ± 0.401
5.393AlaIle: 5.393 ± 1.059
4.745AlaLys: 4.745 ± 1.407
8.197AlaLeu: 8.197 ± 1.333
2.157AlaMet: 2.157 ± 0.691
2.373AlaAsn: 2.373 ± 0.919
4.53AlaPro: 4.53 ± 0.903
2.373AlaGln: 2.373 ± 0.657
3.02AlaArg: 3.02 ± 0.807
3.883AlaSer: 3.883 ± 1.03
3.667AlaThr: 3.667 ± 0.606
4.314AlaVal: 4.314 ± 1.07
1.51AlaTrp: 1.51 ± 0.499
3.02AlaTyr: 3.02 ± 0.858
0.0AlaXaa: 0.0 ± 0.0
Cys
0.431CysAla: 0.431 ± 0.277
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.079CysGlu: 1.079 ± 0.589
0.431CysPhe: 0.431 ± 0.3
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.216CysIle: 0.216 ± 0.202
0.0CysLys: 0.0 ± 0.0
0.216CysLeu: 0.216 ± 0.198
0.216CysMet: 0.216 ± 0.202
0.216CysAsn: 0.216 ± 0.191
0.647CysPro: 0.647 ± 0.431
0.216CysGln: 0.216 ± 0.198
0.431CysArg: 0.431 ± 0.283
0.216CysSer: 0.216 ± 0.195
0.216CysThr: 0.216 ± 0.254
0.431CysVal: 0.431 ± 0.265
0.0CysTrp: 0.0 ± 0.0
0.216CysTyr: 0.216 ± 0.198
0.0CysXaa: 0.0 ± 0.0
Asp
3.236AspAla: 3.236 ± 1.222
0.0AspCys: 0.0 ± 0.0
1.079AspAsp: 1.079 ± 0.428
1.726AspGlu: 1.726 ± 0.526
1.726AspPhe: 1.726 ± 0.509
2.157AspGly: 2.157 ± 0.835
0.431AspHis: 0.431 ± 0.251
1.726AspIle: 1.726 ± 0.834
2.373AspLys: 2.373 ± 0.734
2.804AspLeu: 2.804 ± 1.1
0.863AspMet: 0.863 ± 0.407
1.079AspAsn: 1.079 ± 0.467
1.726AspPro: 1.726 ± 0.66
0.0AspGln: 0.0 ± 0.0
1.079AspArg: 1.079 ± 0.485
1.51AspSer: 1.51 ± 0.442
2.157AspThr: 2.157 ± 0.671
2.804AspVal: 2.804 ± 0.592
0.216AspTrp: 0.216 ± 0.202
2.157AspTyr: 2.157 ± 0.697
0.0AspXaa: 0.0 ± 0.0
Glu
3.883GluAla: 3.883 ± 1.081
0.216GluCys: 0.216 ± 0.195
3.451GluAsp: 3.451 ± 0.923
10.138GluGlu: 10.138 ± 3.296
3.236GluPhe: 3.236 ± 0.837
5.608GluGly: 5.608 ± 1.656
1.941GluHis: 1.941 ± 0.716
3.667GluIle: 3.667 ± 0.859
4.098GluLys: 4.098 ± 1.254
7.118GluLeu: 7.118 ± 1.714
1.941GluMet: 1.941 ± 0.664
2.804GluAsn: 2.804 ± 0.862
3.667GluPro: 3.667 ± 1.039
1.294GluGln: 1.294 ± 0.494
2.373GluArg: 2.373 ± 0.916
2.157GluSer: 2.157 ± 0.617
3.236GluThr: 3.236 ± 0.605
5.824GluVal: 5.824 ± 1.654
1.079GluTrp: 1.079 ± 0.486
2.804GluTyr: 2.804 ± 0.684
0.0GluXaa: 0.0 ± 0.0
Phe
4.53PheAla: 4.53 ± 0.944
0.647PheCys: 0.647 ± 0.593
1.294PheAsp: 1.294 ± 0.807
3.883PheGlu: 3.883 ± 0.946
1.726PhePhe: 1.726 ± 0.519
2.157PheGly: 2.157 ± 0.506
0.863PheHis: 0.863 ± 0.294
2.804PheIle: 2.804 ± 0.971
1.941PheLys: 1.941 ± 0.603
4.961PheLeu: 4.961 ± 1.107
1.726PheMet: 1.726 ± 0.639
2.804PheAsn: 2.804 ± 0.892
1.294PhePro: 1.294 ± 0.477
1.941PheGln: 1.941 ± 0.85
1.51PheArg: 1.51 ± 0.839
2.373PheSer: 2.373 ± 0.849
5.608PheThr: 5.608 ± 1.506
2.804PheVal: 2.804 ± 0.628
0.647PheTrp: 0.647 ± 0.311
2.804PheTyr: 2.804 ± 0.702
0.0PheXaa: 0.0 ± 0.0
Gly
3.883GlyAla: 3.883 ± 1.045
0.0GlyCys: 0.0 ± 0.0
3.451GlyAsp: 3.451 ± 1.186
6.687GlyGlu: 6.687 ± 1.476
2.804GlyPhe: 2.804 ± 0.691
4.314GlyGly: 4.314 ± 1.055
0.863GlyHis: 0.863 ± 0.39
3.451GlyIle: 3.451 ± 0.746
4.314GlyLys: 4.314 ± 1.101
5.177GlyLeu: 5.177 ± 0.93
1.294GlyMet: 1.294 ± 0.36
0.431GlyAsn: 0.431 ± 0.335
1.079GlyPro: 1.079 ± 0.599
1.51GlyGln: 1.51 ± 0.455
3.236GlyArg: 3.236 ± 1.049
5.608GlySer: 5.608 ± 1.494
2.588GlyThr: 2.588 ± 0.953
6.903GlyVal: 6.903 ± 1.407
0.216GlyTrp: 0.216 ± 0.195
4.314GlyTyr: 4.314 ± 1.246
0.0GlyXaa: 0.0 ± 0.0
His
0.647HisAla: 0.647 ± 0.339
0.431HisCys: 0.431 ± 0.241
0.863HisAsp: 0.863 ± 0.507
0.647HisGlu: 0.647 ± 0.376
0.431HisPhe: 0.431 ± 0.3
0.647HisGly: 0.647 ± 0.436
0.647HisHis: 0.647 ± 0.612
1.294HisIle: 1.294 ± 0.436
1.294HisLys: 1.294 ± 0.863
1.726HisLeu: 1.726 ± 0.531
0.647HisMet: 0.647 ± 0.329
0.216HisAsn: 0.216 ± 0.181
1.294HisPro: 1.294 ± 0.384
0.0HisGln: 0.0 ± 0.0
0.647HisArg: 0.647 ± 0.431
1.51HisSer: 1.51 ± 0.541
0.647HisThr: 0.647 ± 0.513
0.863HisVal: 0.863 ± 0.588
0.216HisTrp: 0.216 ± 0.202
1.079HisTyr: 1.079 ± 0.41
0.0HisXaa: 0.0 ± 0.0
Ile
4.961IleAla: 4.961 ± 0.819
0.216IleCys: 0.216 ± 0.202
1.294IleAsp: 1.294 ± 0.611
1.726IleGlu: 1.726 ± 0.66
2.588IlePhe: 2.588 ± 1.001
3.667IleGly: 3.667 ± 1.298
0.647IleHis: 0.647 ± 0.435
5.177IleIle: 5.177 ± 1.137
3.02IleLys: 3.02 ± 0.714
6.255IleLeu: 6.255 ± 1.049
1.51IleMet: 1.51 ± 0.867
3.667IleAsn: 3.667 ± 0.846
4.745IlePro: 4.745 ± 0.994
1.726IleGln: 1.726 ± 0.594
2.157IleArg: 2.157 ± 0.686
6.04IleSer: 6.04 ± 1.295
3.667IleThr: 3.667 ± 1.241
4.961IleVal: 4.961 ± 0.899
0.647IleTrp: 0.647 ± 0.422
3.236IleTyr: 3.236 ± 0.584
0.0IleXaa: 0.0 ± 0.0
Lys
4.53LysAla: 4.53 ± 0.984
0.0LysCys: 0.0 ± 0.0
2.157LysAsp: 2.157 ± 0.984
6.687LysGlu: 6.687 ± 1.717
2.804LysPhe: 2.804 ± 0.703
3.236LysGly: 3.236 ± 1.094
1.51LysHis: 1.51 ± 0.697
3.236LysIle: 3.236 ± 0.832
5.608LysLys: 5.608 ± 1.652
6.255LysLeu: 6.255 ± 1.661
1.51LysMet: 1.51 ± 0.649
2.804LysAsn: 2.804 ± 0.947
1.941LysPro: 1.941 ± 0.597
1.726LysGln: 1.726 ± 0.852
2.804LysArg: 2.804 ± 1.045
3.667LysSer: 3.667 ± 0.727
4.53LysThr: 4.53 ± 0.687
5.177LysVal: 5.177 ± 1.437
0.863LysTrp: 0.863 ± 0.497
3.236LysTyr: 3.236 ± 1.04
0.0LysXaa: 0.0 ± 0.0
Leu
8.628LeuAla: 8.628 ± 1.275
1.079LeuCys: 1.079 ± 0.481
3.451LeuAsp: 3.451 ± 0.977
5.824LeuGlu: 5.824 ± 1.186
5.393LeuPhe: 5.393 ± 1.182
7.55LeuGly: 7.55 ± 1.167
1.294LeuHis: 1.294 ± 0.464
8.628LeuIle: 8.628 ± 1.921
5.824LeuLys: 5.824 ± 1.752
11.217LeuLeu: 11.217 ± 1.553
1.294LeuMet: 1.294 ± 0.541
4.961LeuAsn: 4.961 ± 0.909
5.393LeuPro: 5.393 ± 0.793
2.588LeuGln: 2.588 ± 1.04
6.255LeuArg: 6.255 ± 1.644
6.255LeuSer: 6.255 ± 1.082
6.255LeuThr: 6.255 ± 0.957
9.06LeuVal: 9.06 ± 1.018
0.431LeuTrp: 0.431 ± 0.289
4.314LeuTyr: 4.314 ± 1.001
0.0LeuXaa: 0.0 ± 0.0
Met
1.294MetAla: 1.294 ± 0.633
0.216MetCys: 0.216 ± 0.215
0.431MetAsp: 0.431 ± 0.266
0.431MetGlu: 0.431 ± 0.302
1.941MetPhe: 1.941 ± 0.629
2.373MetGly: 2.373 ± 0.514
0.431MetHis: 0.431 ± 0.281
0.863MetIle: 0.863 ± 0.433
1.079MetLys: 1.079 ± 0.379
2.373MetLeu: 2.373 ± 0.958
0.863MetMet: 0.863 ± 0.337
1.079MetAsn: 1.079 ± 0.484
1.51MetPro: 1.51 ± 0.549
0.863MetGln: 0.863 ± 0.407
0.863MetArg: 0.863 ± 0.457
2.373MetSer: 2.373 ± 0.596
2.157MetThr: 2.157 ± 0.684
0.647MetVal: 0.647 ± 0.402
0.0MetTrp: 0.0 ± 0.0
0.863MetTyr: 0.863 ± 0.456
0.0MetXaa: 0.0 ± 0.0
Asn
3.883AsnAla: 3.883 ± 0.937
0.431AsnCys: 0.431 ± 0.306
0.647AsnAsp: 0.647 ± 0.313
2.804AsnGlu: 2.804 ± 0.613
1.51AsnPhe: 1.51 ± 0.577
3.236AsnGly: 3.236 ± 1.089
0.431AsnHis: 0.431 ± 0.318
1.294AsnIle: 1.294 ± 0.578
2.588AsnLys: 2.588 ± 0.824
4.745AsnLeu: 4.745 ± 0.929
1.51AsnMet: 1.51 ± 0.568
1.51AsnAsn: 1.51 ± 0.61
1.294AsnPro: 1.294 ± 0.568
0.863AsnGln: 0.863 ± 0.369
1.51AsnArg: 1.51 ± 0.514
3.236AsnSer: 3.236 ± 0.689
2.804AsnThr: 2.804 ± 0.846
4.098AsnVal: 4.098 ± 0.749
1.294AsnTrp: 1.294 ± 0.504
1.51AsnTyr: 1.51 ± 0.456
0.0AsnXaa: 0.0 ± 0.0
Pro
2.157ProAla: 2.157 ± 0.79
0.647ProCys: 0.647 ± 0.431
1.294ProAsp: 1.294 ± 0.66
4.098ProGlu: 4.098 ± 1.189
1.079ProPhe: 1.079 ± 0.654
1.726ProGly: 1.726 ± 0.62
1.294ProHis: 1.294 ± 0.673
2.157ProIle: 2.157 ± 0.815
2.373ProLys: 2.373 ± 0.769
5.177ProLeu: 5.177 ± 1.143
0.647ProMet: 0.647 ± 0.364
1.726ProAsn: 1.726 ± 0.603
2.588ProPro: 2.588 ± 0.885
1.079ProGln: 1.079 ± 0.558
1.941ProArg: 1.941 ± 0.609
5.177ProSer: 5.177 ± 2.048
3.02ProThr: 3.02 ± 0.597
5.393ProVal: 5.393 ± 0.934
0.863ProTrp: 0.863 ± 0.415
2.588ProTyr: 2.588 ± 0.651
0.0ProXaa: 0.0 ± 0.0
Gln
1.079GlnAla: 1.079 ± 0.395
0.0GlnCys: 0.0 ± 0.0
0.863GlnAsp: 0.863 ± 0.429
1.726GlnGlu: 1.726 ± 0.66
1.941GlnPhe: 1.941 ± 0.523
2.373GlnGly: 2.373 ± 0.499
0.216GlnHis: 0.216 ± 0.195
1.294GlnIle: 1.294 ± 0.572
2.588GlnLys: 2.588 ± 0.915
1.726GlnLeu: 1.726 ± 0.521
0.431GlnMet: 0.431 ± 0.245
0.431GlnAsn: 0.431 ± 0.34
0.863GlnPro: 0.863 ± 0.43
0.431GlnGln: 0.431 ± 0.257
1.726GlnArg: 1.726 ± 0.724
2.588GlnSer: 2.588 ± 0.657
1.941GlnThr: 1.941 ± 0.596
1.51GlnVal: 1.51 ± 0.463
0.0GlnTrp: 0.0 ± 0.0
0.863GlnTyr: 0.863 ± 0.348
0.0GlnXaa: 0.0 ± 0.0
Arg
3.236ArgAla: 3.236 ± 1.155
0.216ArgCys: 0.216 ± 0.254
1.941ArgAsp: 1.941 ± 0.764
3.883ArgGlu: 3.883 ± 1.134
1.294ArgPhe: 1.294 ± 0.614
2.373ArgGly: 2.373 ± 0.791
0.863ArgHis: 0.863 ± 0.523
1.726ArgIle: 1.726 ± 0.642
4.53ArgLys: 4.53 ± 1.674
6.471ArgLeu: 6.471 ± 2.046
0.216ArgMet: 0.216 ± 0.219
1.726ArgAsn: 1.726 ± 0.875
1.726ArgPro: 1.726 ± 0.575
1.51ArgGln: 1.51 ± 0.534
2.804ArgArg: 2.804 ± 0.944
1.294ArgSer: 1.294 ± 0.551
1.726ArgThr: 1.726 ± 0.527
2.804ArgVal: 2.804 ± 0.842
0.0ArgTrp: 0.0 ± 0.0
1.726ArgTyr: 1.726 ± 0.677
0.0ArgXaa: 0.0 ± 0.0
Ser
4.745SerAla: 4.745 ± 0.886
0.0SerCys: 0.0 ± 0.0
1.51SerAsp: 1.51 ± 0.431
3.667SerGlu: 3.667 ± 1.133
1.726SerPhe: 1.726 ± 0.487
5.177SerGly: 5.177 ± 1.19
1.294SerHis: 1.294 ± 0.6
5.177SerIle: 5.177 ± 1.275
2.804SerLys: 2.804 ± 1.023
5.824SerLeu: 5.824 ± 0.881
1.51SerMet: 1.51 ± 0.596
2.373SerAsn: 2.373 ± 0.687
4.098SerPro: 4.098 ± 1.249
1.51SerGln: 1.51 ± 0.517
2.804SerArg: 2.804 ± 0.862
8.197SerSer: 8.197 ± 3.933
8.844SerThr: 8.844 ± 3.201
4.53SerVal: 4.53 ± 0.963
1.726SerTrp: 1.726 ± 0.632
4.098SerTyr: 4.098 ± 1.441
0.0SerXaa: 0.0 ± 0.0
Thr
4.098ThrAla: 4.098 ± 0.59
0.647ThrCys: 0.647 ± 0.315
1.726ThrAsp: 1.726 ± 0.528
3.451ThrGlu: 3.451 ± 0.822
4.961ThrPhe: 4.961 ± 1.849
4.314ThrGly: 4.314 ± 1.467
0.647ThrHis: 0.647 ± 0.301
3.883ThrIle: 3.883 ± 0.912
3.236ThrLys: 3.236 ± 0.801
7.981ThrLeu: 7.981 ± 1.109
1.941ThrMet: 1.941 ± 0.647
3.451ThrAsn: 3.451 ± 0.848
3.02ThrPro: 3.02 ± 0.889
2.804ThrGln: 2.804 ± 0.842
1.941ThrArg: 1.941 ± 0.654
6.471ThrSer: 6.471 ± 2.184
9.707ThrThr: 9.707 ± 4.441
8.197ThrVal: 8.197 ± 2.258
0.647ThrTrp: 0.647 ± 0.381
3.883ThrTyr: 3.883 ± 1.27
0.0ThrXaa: 0.0 ± 0.0
Val
6.687ValAla: 6.687 ± 0.966
0.0ValCys: 0.0 ± 0.0
2.157ValAsp: 2.157 ± 0.641
4.53ValGlu: 4.53 ± 1.44
3.236ValPhe: 3.236 ± 0.802
3.667ValGly: 3.667 ± 0.761
1.079ValHis: 1.079 ± 0.497
5.177ValIle: 5.177 ± 0.806
5.824ValLys: 5.824 ± 1.799
9.922ValLeu: 9.922 ± 1.759
1.51ValMet: 1.51 ± 0.552
4.745ValAsn: 4.745 ± 0.904
3.236ValPro: 3.236 ± 0.888
1.51ValGln: 1.51 ± 0.534
3.236ValArg: 3.236 ± 1.041
5.393ValSer: 5.393 ± 1.109
8.844ValThr: 8.844 ± 3.003
5.608ValVal: 5.608 ± 1.328
0.216ValTrp: 0.216 ± 0.212
4.745ValTyr: 4.745 ± 0.879
0.0ValXaa: 0.0 ± 0.0
Trp
0.216TrpAla: 0.216 ± 0.195
0.216TrpCys: 0.216 ± 0.191
0.216TrpAsp: 0.216 ± 0.258
0.647TrpGlu: 0.647 ± 0.362
0.647TrpPhe: 0.647 ± 0.585
0.431TrpGly: 0.431 ± 0.305
0.431TrpHis: 0.431 ± 0.257
1.079TrpIle: 1.079 ± 0.527
0.647TrpLys: 0.647 ± 0.279
1.51TrpLeu: 1.51 ± 0.587
0.0TrpMet: 0.0 ± 0.0
0.863TrpAsn: 0.863 ± 0.452
0.0TrpPro: 0.0 ± 0.0
0.216TrpGln: 0.216 ± 0.212
0.431TrpArg: 0.431 ± 0.381
0.647TrpSer: 0.647 ± 0.399
1.079TrpThr: 1.079 ± 0.458
0.863TrpVal: 0.863 ± 0.54
0.216TrpTrp: 0.216 ± 0.195
0.863TrpTyr: 0.863 ± 0.481
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.236TyrAla: 3.236 ± 0.9
0.216TyrCys: 0.216 ± 0.221
1.941TyrAsp: 1.941 ± 0.479
2.373TyrGlu: 2.373 ± 0.793
4.098TyrPhe: 4.098 ± 0.951
2.157TyrGly: 2.157 ± 0.945
0.216TyrHis: 0.216 ± 0.254
3.667TyrIle: 3.667 ± 1.064
5.393TyrLys: 5.393 ± 1.167
6.04TyrLeu: 6.04 ± 1.111
0.647TyrMet: 0.647 ± 0.272
1.941TyrAsn: 1.941 ± 0.583
2.373TyrPro: 2.373 ± 0.869
0.431TyrGln: 0.431 ± 0.243
1.294TyrArg: 1.294 ± 0.495
3.02TyrSer: 3.02 ± 0.728
4.314TyrThr: 4.314 ± 1.31
4.53TyrVal: 4.53 ± 0.842
0.431TyrTrp: 0.431 ± 0.39
3.02TyrTyr: 3.02 ± 0.776
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 24 proteins (4637 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski