Amino acid dipepetide frequency for Antarctic penguin virus B

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.009AlaAla: 8.009 ± 0.97
1.299AlaCys: 1.299 ± 0.433
1.948AlaAsp: 1.948 ± 0.44
3.896AlaGlu: 3.896 ± 0.479
1.515AlaPhe: 1.515 ± 0.431
5.628AlaGly: 5.628 ± 2.449
1.082AlaHis: 1.082 ± 0.439
3.896AlaIle: 3.896 ± 0.845
2.597AlaLys: 2.597 ± 0.294
6.277AlaLeu: 6.277 ± 0.892
1.948AlaMet: 1.948 ± 0.929
3.896AlaAsn: 3.896 ± 0.327
3.463AlaPro: 3.463 ± 0.818
3.68AlaGln: 3.68 ± 1.542
3.247AlaArg: 3.247 ± 0.818
8.225AlaSer: 8.225 ± 1.905
3.68AlaThr: 3.68 ± 0.938
7.359AlaVal: 7.359 ± 2.222
0.216AlaTrp: 0.216 ± 0.239
3.463AlaTyr: 3.463 ± 0.74
0.0AlaXaa: 0.0 ± 0.0
Cys
1.299CysAla: 1.299 ± 0.451
0.216CysCys: 0.216 ± 0.13
0.649CysAsp: 0.649 ± 0.262
0.866CysGlu: 0.866 ± 0.519
1.082CysPhe: 1.082 ± 0.289
1.515CysGly: 1.515 ± 0.395
0.216CysHis: 0.216 ± 0.13
1.082CysIle: 1.082 ± 0.438
0.649CysLys: 0.649 ± 0.429
1.732CysLeu: 1.732 ± 0.431
1.082CysMet: 1.082 ± 0.502
0.433CysAsn: 0.433 ± 0.259
0.649CysPro: 0.649 ± 0.682
2.597CysGln: 2.597 ± 0.587
0.866CysArg: 0.866 ± 0.324
1.948CysSer: 1.948 ± 0.602
2.165CysThr: 2.165 ± 0.661
1.082CysVal: 1.082 ± 0.455
0.0CysTrp: 0.0 ± 0.0
0.433CysTyr: 0.433 ± 0.208
0.0CysXaa: 0.0 ± 0.0
Asp
2.381AspAla: 2.381 ± 0.452
1.299AspCys: 1.299 ± 0.495
4.113AspAsp: 4.113 ± 0.954
1.515AspGlu: 1.515 ± 0.492
2.165AspPhe: 2.165 ± 0.678
3.463AspGly: 3.463 ± 0.36
0.866AspHis: 0.866 ± 0.416
2.381AspIle: 2.381 ± 0.461
1.948AspLys: 1.948 ± 0.491
6.061AspLeu: 6.061 ± 1.916
1.082AspMet: 1.082 ± 0.514
1.299AspAsn: 1.299 ± 0.306
4.329AspPro: 4.329 ± 0.557
3.247AspGln: 3.247 ± 0.537
2.814AspArg: 2.814 ± 0.315
3.03AspSer: 3.03 ± 1.217
2.165AspThr: 2.165 ± 0.404
2.381AspVal: 2.381 ± 0.664
0.649AspTrp: 0.649 ± 0.28
1.732AspTyr: 1.732 ± 0.184
0.0AspXaa: 0.0 ± 0.0
Glu
2.597GluAla: 2.597 ± 0.507
1.082GluCys: 1.082 ± 0.327
2.381GluAsp: 2.381 ± 0.723
3.463GluGlu: 3.463 ± 0.556
1.299GluPhe: 1.299 ± 0.357
2.381GluGly: 2.381 ± 0.669
0.866GluHis: 0.866 ± 0.256
2.814GluIle: 2.814 ± 0.223
1.515GluLys: 1.515 ± 0.448
4.978GluLeu: 4.978 ± 0.821
2.597GluMet: 2.597 ± 0.34
3.03GluAsn: 3.03 ± 0.863
1.299GluPro: 1.299 ± 0.583
0.866GluGln: 0.866 ± 0.253
1.948GluArg: 1.948 ± 0.346
4.329GluSer: 4.329 ± 1.069
3.247GluThr: 3.247 ± 0.631
3.03GluVal: 3.03 ± 0.838
0.649GluTrp: 0.649 ± 0.247
2.597GluTyr: 2.597 ± 1.047
0.0GluXaa: 0.0 ± 0.0
Phe
1.299PheAla: 1.299 ± 0.45
0.216PheCys: 0.216 ± 0.227
1.082PheAsp: 1.082 ± 0.311
1.948PheGlu: 1.948 ± 0.773
1.732PhePhe: 1.732 ± 0.682
1.515PheGly: 1.515 ± 0.631
0.433PheHis: 0.433 ± 0.259
2.165PheIle: 2.165 ± 0.829
1.299PheLys: 1.299 ± 0.462
3.03PheLeu: 3.03 ± 0.753
1.515PheMet: 1.515 ± 0.432
2.165PheAsn: 2.165 ± 0.708
2.597PhePro: 2.597 ± 0.482
0.433PheGln: 0.433 ± 0.225
1.299PheArg: 1.299 ± 0.625
3.247PheSer: 3.247 ± 0.8
2.165PheThr: 2.165 ± 0.537
2.165PheVal: 2.165 ± 1.024
0.216PheTrp: 0.216 ± 0.285
0.216PheTyr: 0.216 ± 0.229
0.0PheXaa: 0.0 ± 0.0
Gly
3.68GlyAla: 3.68 ± 0.933
1.082GlyCys: 1.082 ± 0.649
4.978GlyAsp: 4.978 ± 1.125
1.515GlyGlu: 1.515 ± 1.013
2.165GlyPhe: 2.165 ± 0.628
4.329GlyGly: 4.329 ± 1.222
0.216GlyHis: 0.216 ± 0.13
4.329GlyIle: 4.329 ± 0.819
4.329GlyLys: 4.329 ± 1.152
4.113GlyLeu: 4.113 ± 0.757
0.866GlyMet: 0.866 ± 0.58
3.68GlyAsn: 3.68 ± 0.745
1.515GlyPro: 1.515 ± 0.573
1.299GlyGln: 1.299 ± 0.676
4.113GlyArg: 4.113 ± 0.823
4.113GlySer: 4.113 ± 1.453
4.978GlyThr: 4.978 ± 1.211
6.277GlyVal: 6.277 ± 1.329
0.0GlyTrp: 0.0 ± 0.0
1.082GlyTyr: 1.082 ± 0.648
0.0GlyXaa: 0.0 ± 0.0
His
0.649HisAla: 0.649 ± 0.264
0.216HisCys: 0.216 ± 0.229
0.649HisAsp: 0.649 ± 0.262
0.433HisGlu: 0.433 ± 0.228
0.433HisPhe: 0.433 ± 0.259
0.433HisGly: 0.433 ± 0.208
0.0HisHis: 0.0 ± 0.0
0.433HisIle: 0.433 ± 0.259
0.433HisLys: 0.433 ± 0.259
2.597HisLeu: 2.597 ± 1.345
0.649HisMet: 0.649 ± 0.389
0.649HisAsn: 0.649 ± 0.262
1.732HisPro: 1.732 ± 0.62
0.649HisGln: 0.649 ± 0.298
1.732HisArg: 1.732 ± 0.534
0.866HisSer: 0.866 ± 0.332
0.866HisThr: 0.866 ± 0.355
1.082HisVal: 1.082 ± 0.311
0.216HisTrp: 0.216 ± 0.227
0.649HisTyr: 0.649 ± 0.262
0.0HisXaa: 0.0 ± 0.0
Ile
5.411IleAla: 5.411 ± 1.215
0.866IleCys: 0.866 ± 0.357
3.03IleAsp: 3.03 ± 0.71
2.381IleGlu: 2.381 ± 0.64
2.814IlePhe: 2.814 ± 0.446
3.247IleGly: 3.247 ± 0.834
1.515IleHis: 1.515 ± 0.528
5.628IleIle: 5.628 ± 1.187
3.68IleLys: 3.68 ± 0.93
6.926IleLeu: 6.926 ± 1.198
1.515IleMet: 1.515 ± 0.528
3.463IleAsn: 3.463 ± 1.082
3.247IlePro: 3.247 ± 0.916
4.113IleGln: 4.113 ± 1.157
2.381IleArg: 2.381 ± 0.595
6.277IleSer: 6.277 ± 1.001
3.68IleThr: 3.68 ± 0.978
3.896IleVal: 3.896 ± 0.436
1.082IleTrp: 1.082 ± 0.277
1.948IleTyr: 1.948 ± 1.029
0.0IleXaa: 0.0 ± 0.0
Lys
3.247LysAla: 3.247 ± 0.958
0.866LysCys: 0.866 ± 0.272
1.515LysAsp: 1.515 ± 0.616
2.814LysGlu: 2.814 ± 1.249
1.299LysPhe: 1.299 ± 0.459
2.381LysGly: 2.381 ± 0.643
0.866LysHis: 0.866 ± 0.312
3.68LysIle: 3.68 ± 0.59
2.814LysLys: 2.814 ± 1.438
4.978LysLeu: 4.978 ± 1.404
1.732LysMet: 1.732 ± 0.453
2.597LysAsn: 2.597 ± 0.344
2.165LysPro: 2.165 ± 0.243
2.165LysGln: 2.165 ± 0.797
1.732LysArg: 1.732 ± 0.448
3.03LysSer: 3.03 ± 0.994
3.463LysThr: 3.463 ± 0.84
4.329LysVal: 4.329 ± 1.32
0.0LysTrp: 0.0 ± 0.0
1.948LysTyr: 1.948 ± 0.542
0.0LysXaa: 0.0 ± 0.0
Leu
9.091LeuAla: 9.091 ± 1.17
1.515LeuCys: 1.515 ± 0.473
5.411LeuAsp: 5.411 ± 0.788
6.494LeuGlu: 6.494 ± 1.186
2.381LeuPhe: 2.381 ± 0.584
5.195LeuGly: 5.195 ± 0.862
2.597LeuHis: 2.597 ± 1.171
5.844LeuIle: 5.844 ± 1.413
6.71LeuLys: 6.71 ± 0.9
11.472LeuLeu: 11.472 ± 2.448
2.381LeuMet: 2.381 ± 0.671
4.762LeuAsn: 4.762 ± 0.682
3.03LeuPro: 3.03 ± 0.509
2.597LeuGln: 2.597 ± 0.398
4.545LeuArg: 4.545 ± 0.949
12.554LeuSer: 12.554 ± 2.014
9.74LeuThr: 9.74 ± 1.507
5.195LeuVal: 5.195 ± 1.126
1.515LeuTrp: 1.515 ± 0.676
4.545LeuTyr: 4.545 ± 0.633
0.0LeuXaa: 0.0 ± 0.0
Met
1.948MetAla: 1.948 ± 0.461
0.649MetCys: 0.649 ± 0.389
1.082MetAsp: 1.082 ± 0.465
1.082MetGlu: 1.082 ± 0.469
0.866MetPhe: 0.866 ± 0.389
1.299MetGly: 1.299 ± 0.529
0.0MetHis: 0.0 ± 0.0
2.165MetIle: 2.165 ± 0.338
1.515MetLys: 1.515 ± 0.577
3.463MetLeu: 3.463 ± 0.948
1.082MetMet: 1.082 ± 0.843
1.082MetAsn: 1.082 ± 0.522
1.948MetPro: 1.948 ± 0.387
1.948MetGln: 1.948 ± 1.038
1.082MetArg: 1.082 ± 0.588
2.597MetSer: 2.597 ± 0.406
1.299MetThr: 1.299 ± 0.529
1.948MetVal: 1.948 ± 0.384
0.0MetTrp: 0.0 ± 0.0
0.866MetTyr: 0.866 ± 0.519
0.0MetXaa: 0.0 ± 0.0
Asn
2.814AsnAla: 2.814 ± 0.592
2.165AsnCys: 2.165 ± 0.604
2.814AsnAsp: 2.814 ± 0.358
1.515AsnGlu: 1.515 ± 0.195
1.082AsnPhe: 1.082 ± 0.618
2.165AsnGly: 2.165 ± 0.472
1.082AsnHis: 1.082 ± 0.439
4.545AsnIle: 4.545 ± 1.443
2.165AsnLys: 2.165 ± 0.629
6.71AsnLeu: 6.71 ± 1.697
0.866AsnMet: 0.866 ± 0.339
1.515AsnAsn: 1.515 ± 0.855
3.03AsnPro: 3.03 ± 0.526
1.515AsnGln: 1.515 ± 0.209
2.814AsnArg: 2.814 ± 0.672
3.03AsnSer: 3.03 ± 0.524
2.597AsnThr: 2.597 ± 1.424
1.299AsnVal: 1.299 ± 1.035
0.866AsnTrp: 0.866 ± 0.519
1.299AsnTyr: 1.299 ± 0.462
0.0AsnXaa: 0.0 ± 0.0
Pro
3.896ProAla: 3.896 ± 0.802
0.649ProCys: 0.649 ± 0.298
2.597ProAsp: 2.597 ± 0.894
2.597ProGlu: 2.597 ± 0.566
1.732ProPhe: 1.732 ± 0.62
3.463ProGly: 3.463 ± 0.717
1.082ProHis: 1.082 ± 0.372
1.732ProIle: 1.732 ± 0.452
1.515ProLys: 1.515 ± 0.342
6.494ProLeu: 6.494 ± 0.876
0.649ProMet: 0.649 ± 0.314
1.082ProAsn: 1.082 ± 0.48
3.03ProPro: 3.03 ± 1.404
1.948ProGln: 1.948 ± 0.855
2.165ProArg: 2.165 ± 0.478
3.68ProSer: 3.68 ± 1.351
3.03ProThr: 3.03 ± 0.907
4.545ProVal: 4.545 ± 0.933
0.0ProTrp: 0.0 ± 0.0
2.165ProTyr: 2.165 ± 0.49
0.0ProXaa: 0.0 ± 0.0
Gln
3.463GlnAla: 3.463 ± 0.981
0.649GlnCys: 0.649 ± 0.247
1.732GlnAsp: 1.732 ± 0.707
1.732GlnGlu: 1.732 ± 0.534
1.082GlnPhe: 1.082 ± 0.495
3.03GlnGly: 3.03 ± 0.505
0.866GlnHis: 0.866 ± 0.476
3.68GlnIle: 3.68 ± 0.761
1.515GlnLys: 1.515 ± 0.728
4.762GlnLeu: 4.762 ± 0.296
0.866GlnMet: 0.866 ± 0.299
1.948GlnAsn: 1.948 ± 0.549
2.165GlnPro: 2.165 ± 0.706
2.381GlnGln: 2.381 ± 1.013
2.165GlnArg: 2.165 ± 1.089
4.329GlnSer: 4.329 ± 1.598
3.68GlnThr: 3.68 ± 0.984
2.814GlnVal: 2.814 ± 0.766
0.433GlnTrp: 0.433 ± 0.225
1.299GlnTyr: 1.299 ± 0.462
0.0GlnXaa: 0.0 ± 0.0
Arg
4.329ArgAla: 4.329 ± 1.105
1.299ArgCys: 1.299 ± 0.413
2.597ArgAsp: 2.597 ± 0.646
2.597ArgGlu: 2.597 ± 0.92
1.299ArgPhe: 1.299 ± 0.198
3.03ArgGly: 3.03 ± 0.419
1.515ArgHis: 1.515 ± 0.907
4.329ArgIle: 4.329 ± 0.734
2.165ArgLys: 2.165 ± 0.938
6.494ArgLeu: 6.494 ± 0.64
2.165ArgMet: 2.165 ± 0.492
2.597ArgAsn: 2.597 ± 0.957
2.165ArgPro: 2.165 ± 0.779
2.381ArgGln: 2.381 ± 0.53
2.165ArgArg: 2.165 ± 0.592
3.247ArgSer: 3.247 ± 0.534
2.381ArgThr: 2.381 ± 0.772
2.814ArgVal: 2.814 ± 0.856
0.216ArgTrp: 0.216 ± 0.285
1.515ArgTyr: 1.515 ± 0.407
0.0ArgXaa: 0.0 ± 0.0
Ser
7.576SerAla: 7.576 ± 1.371
2.165SerCys: 2.165 ± 0.829
3.68SerAsp: 3.68 ± 0.946
5.844SerGlu: 5.844 ± 1.572
1.732SerPhe: 1.732 ± 0.587
5.411SerGly: 5.411 ± 0.492
1.299SerHis: 1.299 ± 0.524
8.442SerIle: 8.442 ± 2.535
3.68SerLys: 3.68 ± 1.338
9.307SerLeu: 9.307 ± 0.897
1.515SerMet: 1.515 ± 0.568
5.195SerAsn: 5.195 ± 0.981
4.329SerPro: 4.329 ± 1.184
4.113SerGln: 4.113 ± 0.898
5.844SerArg: 5.844 ± 1.082
7.792SerSer: 7.792 ± 1.002
4.978SerThr: 4.978 ± 0.494
4.329SerVal: 4.329 ± 1.165
1.082SerTrp: 1.082 ± 0.469
2.165SerTyr: 2.165 ± 0.536
0.0SerXaa: 0.0 ± 0.0
Thr
3.896ThrAla: 3.896 ± 0.651
1.299ThrCys: 1.299 ± 0.198
2.814ThrAsp: 2.814 ± 0.602
2.597ThrGlu: 2.597 ± 0.819
2.165ThrPhe: 2.165 ± 0.307
4.113ThrGly: 4.113 ± 0.507
0.433ThrHis: 0.433 ± 0.355
3.896ThrIle: 3.896 ± 0.694
3.03ThrLys: 3.03 ± 0.715
5.844ThrLeu: 5.844 ± 0.575
1.948ThrMet: 1.948 ± 0.706
2.381ThrAsn: 2.381 ± 0.54
3.68ThrPro: 3.68 ± 0.619
3.03ThrGln: 3.03 ± 0.924
4.545ThrArg: 4.545 ± 0.969
7.792ThrSer: 7.792 ± 1.244
3.68ThrThr: 3.68 ± 0.829
5.195ThrVal: 5.195 ± 0.939
1.082ThrTrp: 1.082 ± 0.469
2.165ThrTyr: 2.165 ± 0.945
0.0ThrXaa: 0.0 ± 0.0
Val
6.061ValAla: 6.061 ± 0.671
1.299ValCys: 1.299 ± 0.405
4.113ValAsp: 4.113 ± 1.17
1.732ValGlu: 1.732 ± 0.689
2.814ValPhe: 2.814 ± 1.661
4.329ValGly: 4.329 ± 1.022
0.433ValHis: 0.433 ± 0.259
3.247ValIle: 3.247 ± 0.499
3.68ValLys: 3.68 ± 0.328
8.225ValLeu: 8.225 ± 0.915
1.515ValMet: 1.515 ± 0.458
2.381ValAsn: 2.381 ± 0.884
2.597ValPro: 2.597 ± 0.535
3.896ValGln: 3.896 ± 0.733
2.381ValArg: 2.381 ± 0.445
4.978ValSer: 4.978 ± 0.674
5.628ValThr: 5.628 ± 1.804
3.247ValVal: 3.247 ± 1.076
0.0ValTrp: 0.0 ± 0.0
2.165ValTyr: 2.165 ± 0.764
0.0ValXaa: 0.0 ± 0.0
Trp
0.649TrpAla: 0.649 ± 0.262
0.216TrpCys: 0.216 ± 0.227
0.216TrpAsp: 0.216 ± 0.13
0.649TrpGlu: 0.649 ± 0.264
0.649TrpPhe: 0.649 ± 0.262
0.433TrpGly: 0.433 ± 0.225
0.0TrpHis: 0.0 ± 0.0
0.866TrpIle: 0.866 ± 0.272
0.216TrpLys: 0.216 ± 0.13
0.866TrpLeu: 0.866 ± 0.519
0.216TrpMet: 0.216 ± 0.272
0.216TrpAsn: 0.216 ± 0.13
0.216TrpPro: 0.216 ± 0.13
0.0TrpGln: 0.0 ± 0.0
1.299TrpArg: 1.299 ± 0.475
1.082TrpSer: 1.082 ± 0.318
0.433TrpThr: 0.433 ± 0.259
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.216TrpTyr: 0.216 ± 0.13
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.03TyrAla: 3.03 ± 0.466
1.515TyrCys: 1.515 ± 0.428
1.515TyrAsp: 1.515 ± 0.43
1.299TyrGlu: 1.299 ± 0.583
0.433TyrPhe: 0.433 ± 0.208
1.299TyrGly: 1.299 ± 0.644
0.0TyrHis: 0.0 ± 0.0
1.948TyrIle: 1.948 ± 0.339
2.165TyrLys: 2.165 ± 0.78
3.463TyrLeu: 3.463 ± 1.168
1.515TyrMet: 1.515 ± 0.712
1.515TyrAsn: 1.515 ± 0.43
0.866TyrPro: 0.866 ± 0.741
1.732TyrGln: 1.732 ± 0.305
2.165TyrArg: 2.165 ± 0.596
4.329TyrSer: 4.329 ± 1.1
1.732TyrThr: 1.732 ± 0.801
1.732TyrVal: 1.732 ± 0.184
0.216TyrTrp: 0.216 ± 0.13
1.732TyrTyr: 1.732 ± 0.677
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4621 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski