Amino acid dipepetide frequency for Antarctic penguin virus A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.361AlaAla: 7.361 ± 1.36
1.732AlaCys: 1.732 ± 0.458
2.814AlaAsp: 2.814 ± 0.894
4.763AlaGlu: 4.763 ± 0.57
2.598AlaPhe: 2.598 ± 0.65
7.361AlaGly: 7.361 ± 1.754
1.082AlaHis: 1.082 ± 0.462
5.845AlaIle: 5.845 ± 0.887
2.381AlaLys: 2.381 ± 0.67
7.144AlaLeu: 7.144 ± 2.079
1.082AlaMet: 1.082 ± 0.811
2.598AlaAsn: 2.598 ± 1.005
3.464AlaPro: 3.464 ± 0.56
4.113AlaGln: 4.113 ± 0.964
3.897AlaArg: 3.897 ± 1.337
8.01AlaSer: 8.01 ± 1.232
4.33AlaThr: 4.33 ± 1.948
5.412AlaVal: 5.412 ± 1.378
0.216AlaTrp: 0.216 ± 0.237
2.165AlaTyr: 2.165 ± 0.66
0.0AlaXaa: 0.0 ± 0.0
Cys
0.866CysAla: 0.866 ± 0.394
0.216CysCys: 0.216 ± 0.132
1.082CysAsp: 1.082 ± 0.436
0.433CysGlu: 0.433 ± 0.265
0.866CysPhe: 0.866 ± 0.264
0.866CysGly: 0.866 ± 0.331
0.649CysHis: 0.649 ± 0.276
1.732CysIle: 1.732 ± 0.739
0.866CysLys: 0.866 ± 0.624
1.515CysLeu: 1.515 ± 0.577
0.866CysMet: 0.866 ± 0.401
1.299CysAsn: 1.299 ± 0.339
0.866CysPro: 0.866 ± 0.46
1.299CysGln: 1.299 ± 0.38
1.082CysArg: 1.082 ± 0.48
2.381CysSer: 2.381 ± 0.581
1.082CysThr: 1.082 ± 0.474
1.299CysVal: 1.299 ± 0.6
0.216CysTrp: 0.216 ± 0.132
0.433CysTyr: 0.433 ± 0.23
0.0CysXaa: 0.0 ± 0.0
Asp
2.814AspAla: 2.814 ± 0.696
1.515AspCys: 1.515 ± 0.638
4.546AspAsp: 4.546 ± 1.046
1.948AspGlu: 1.948 ± 0.572
3.031AspPhe: 3.031 ± 0.521
2.598AspGly: 2.598 ± 0.628
0.866AspHis: 0.866 ± 0.271
4.546AspIle: 4.546 ± 0.977
2.165AspLys: 2.165 ± 0.606
5.629AspLeu: 5.629 ± 1.615
1.299AspMet: 1.299 ± 0.608
2.381AspAsn: 2.381 ± 0.687
4.546AspPro: 4.546 ± 0.982
2.814AspGln: 2.814 ± 0.635
1.948AspArg: 1.948 ± 0.396
2.598AspSer: 2.598 ± 0.529
2.381AspThr: 2.381 ± 0.963
1.515AspVal: 1.515 ± 0.46
0.649AspTrp: 0.649 ± 0.296
1.948AspTyr: 1.948 ± 0.294
0.0AspXaa: 0.0 ± 0.0
Glu
2.598GluAla: 2.598 ± 1.096
0.649GluCys: 0.649 ± 0.397
1.732GluAsp: 1.732 ± 0.631
2.598GluGlu: 2.598 ± 0.302
1.299GluPhe: 1.299 ± 0.6
2.165GluGly: 2.165 ± 0.519
1.082GluHis: 1.082 ± 0.464
3.247GluIle: 3.247 ± 0.918
1.732GluLys: 1.732 ± 0.702
3.68GluLeu: 3.68 ± 0.643
2.598GluMet: 2.598 ± 1.022
1.948GluAsn: 1.948 ± 0.747
1.515GluPro: 1.515 ± 0.291
1.732GluGln: 1.732 ± 0.376
1.732GluArg: 1.732 ± 0.916
3.68GluSer: 3.68 ± 1.13
3.247GluThr: 3.247 ± 0.995
2.381GluVal: 2.381 ± 0.521
0.433GluTrp: 0.433 ± 0.227
1.515GluTyr: 1.515 ± 0.695
0.0GluXaa: 0.0 ± 0.0
Phe
3.031PheAla: 3.031 ± 0.884
0.866PheCys: 0.866 ± 0.517
1.732PheAsp: 1.732 ± 0.435
1.515PheGlu: 1.515 ± 0.476
1.948PhePhe: 1.948 ± 0.697
1.732PheGly: 1.732 ± 0.962
0.866PheHis: 0.866 ± 0.292
1.948PheIle: 1.948 ± 0.537
1.732PheLys: 1.732 ± 0.591
2.598PheLeu: 2.598 ± 0.794
1.299PheMet: 1.299 ± 0.485
2.165PheAsn: 2.165 ± 0.795
1.948PhePro: 1.948 ± 0.586
0.866PheGln: 0.866 ± 0.455
0.866PheArg: 0.866 ± 0.532
2.165PheSer: 2.165 ± 1.236
2.381PheThr: 2.381 ± 0.563
2.381PheVal: 2.381 ± 0.734
0.216PheTrp: 0.216 ± 0.271
0.216PheTyr: 0.216 ± 0.237
0.0PheXaa: 0.0 ± 0.0
Gly
4.113GlyAla: 4.113 ± 1.654
1.299GlyCys: 1.299 ± 0.984
4.33GlyAsp: 4.33 ± 1.487
1.732GlyGlu: 1.732 ± 1.08
1.515GlyPhe: 1.515 ± 0.818
4.546GlyGly: 4.546 ± 0.788
0.433GlyHis: 0.433 ± 0.227
3.897GlyIle: 3.897 ± 0.629
3.68GlyLys: 3.68 ± 1.036
4.979GlyLeu: 4.979 ± 0.463
0.433GlyMet: 0.433 ± 0.304
3.464GlyAsn: 3.464 ± 1.043
1.515GlyPro: 1.515 ± 0.777
2.598GlyGln: 2.598 ± 0.945
4.113GlyArg: 4.113 ± 1.004
5.196GlySer: 5.196 ± 1.239
4.113GlyThr: 4.113 ± 1.499
6.062GlyVal: 6.062 ± 1.033
0.216GlyTrp: 0.216 ± 0.132
1.299GlyTyr: 1.299 ± 0.6
0.0GlyXaa: 0.0 ± 0.0
His
1.082HisAla: 1.082 ± 0.361
0.0HisCys: 0.0 ± 0.0
0.433HisAsp: 0.433 ± 0.23
0.433HisGlu: 0.433 ± 0.23
0.649HisPhe: 0.649 ± 0.276
1.948HisGly: 1.948 ± 0.799
0.0HisHis: 0.0 ± 0.0
1.515HisIle: 1.515 ± 0.351
0.433HisLys: 0.433 ± 0.225
3.247HisLeu: 3.247 ± 0.808
0.433HisMet: 0.433 ± 0.265
0.433HisAsn: 0.433 ± 0.23
1.515HisPro: 1.515 ± 0.416
0.649HisGln: 0.649 ± 0.252
1.299HisArg: 1.299 ± 0.498
1.082HisSer: 1.082 ± 0.346
1.732HisThr: 1.732 ± 0.663
0.216HisVal: 0.216 ± 0.132
0.433HisTrp: 0.433 ± 0.304
0.433HisTyr: 0.433 ± 0.265
0.0HisXaa: 0.0 ± 0.0
Ile
5.196IleAla: 5.196 ± 1.225
1.082IleCys: 1.082 ± 0.48
4.546IleAsp: 4.546 ± 0.823
2.814IleGlu: 2.814 ± 1.44
3.031IlePhe: 3.031 ± 0.854
4.113IleGly: 4.113 ± 0.661
1.082IleHis: 1.082 ± 0.716
6.711IleIle: 6.711 ± 1.517
3.68IleLys: 3.68 ± 1.119
8.227IleLeu: 8.227 ± 1.441
2.381IleMet: 2.381 ± 0.756
2.598IleAsn: 2.598 ± 1.048
3.68IlePro: 3.68 ± 0.472
5.412IleGln: 5.412 ± 1.138
3.68IleArg: 3.68 ± 0.498
6.928IleSer: 6.928 ± 0.968
4.763IleThr: 4.763 ± 0.777
3.68IleVal: 3.68 ± 0.933
0.649IleTrp: 0.649 ± 0.276
2.165IleTyr: 2.165 ± 1.158
0.0IleXaa: 0.0 ± 0.0
Lys
3.464LysAla: 3.464 ± 0.947
0.649LysCys: 0.649 ± 0.296
1.732LysAsp: 1.732 ± 0.592
1.948LysGlu: 1.948 ± 0.998
1.515LysPhe: 1.515 ± 0.413
2.381LysGly: 2.381 ± 1.173
1.299LysHis: 1.299 ± 0.449
4.33LysIle: 4.33 ± 0.765
3.031LysLys: 3.031 ± 1.553
4.979LysLeu: 4.979 ± 1.453
1.948LysMet: 1.948 ± 0.353
1.515LysAsn: 1.515 ± 0.462
1.299LysPro: 1.299 ± 0.305
2.381LysGln: 2.381 ± 0.844
1.732LysArg: 1.732 ± 0.491
3.031LysSer: 3.031 ± 0.449
1.515LysThr: 1.515 ± 0.74
3.464LysVal: 3.464 ± 2.0
0.0LysTrp: 0.0 ± 0.0
1.515LysTyr: 1.515 ± 0.239
0.0LysXaa: 0.0 ± 0.0
Leu
10.825LeuAla: 10.825 ± 2.592
2.165LeuCys: 2.165 ± 0.836
5.196LeuAsp: 5.196 ± 1.121
4.546LeuGlu: 4.546 ± 0.708
3.031LeuPhe: 3.031 ± 0.754
6.495LeuGly: 6.495 ± 1.32
1.948LeuHis: 1.948 ± 0.93
6.278LeuIle: 6.278 ± 1.015
4.113LeuLys: 4.113 ± 1.153
10.825LeuLeu: 10.825 ± 1.702
2.165LeuMet: 2.165 ± 0.567
4.763LeuAsn: 4.763 ± 0.696
4.33LeuPro: 4.33 ± 0.704
3.247LeuGln: 3.247 ± 0.536
5.845LeuArg: 5.845 ± 0.701
10.175LeuSer: 10.175 ± 1.804
8.876LeuThr: 8.876 ± 2.244
5.412LeuVal: 5.412 ± 0.364
1.515LeuTrp: 1.515 ± 0.627
4.113LeuTyr: 4.113 ± 0.842
0.0LeuXaa: 0.0 ± 0.0
Met
1.732MetAla: 1.732 ± 0.542
0.433MetCys: 0.433 ± 0.265
1.299MetAsp: 1.299 ± 0.568
2.814MetGlu: 2.814 ± 1.121
0.866MetPhe: 0.866 ± 0.53
0.866MetGly: 0.866 ± 0.754
0.216MetHis: 0.216 ± 0.132
1.948MetIle: 1.948 ± 0.712
0.866MetLys: 0.866 ± 0.499
2.381MetLeu: 2.381 ± 0.638
0.649MetMet: 0.649 ± 0.3
1.082MetAsn: 1.082 ± 0.529
0.649MetPro: 0.649 ± 0.296
2.165MetGln: 2.165 ± 0.431
0.649MetArg: 0.649 ± 0.284
2.598MetSer: 2.598 ± 0.591
2.165MetThr: 2.165 ± 0.717
1.515MetVal: 1.515 ± 0.73
0.0MetTrp: 0.0 ± 0.0
0.433MetTyr: 0.433 ± 0.23
0.0MetXaa: 0.0 ± 0.0
Asn
4.33AsnAla: 4.33 ± 0.492
1.082AsnCys: 1.082 ± 0.302
1.948AsnAsp: 1.948 ± 0.569
1.515AsnGlu: 1.515 ± 0.988
0.649AsnPhe: 0.649 ± 0.492
2.165AsnGly: 2.165 ± 0.666
0.649AsnHis: 0.649 ± 0.252
3.031AsnIle: 3.031 ± 1.325
1.948AsnLys: 1.948 ± 0.751
5.412AsnLeu: 5.412 ± 1.664
0.866AsnMet: 0.866 ± 0.401
0.866AsnAsn: 0.866 ± 0.46
3.247AsnPro: 3.247 ± 0.303
2.598AsnGln: 2.598 ± 0.975
3.247AsnArg: 3.247 ± 0.75
5.629AsnSer: 5.629 ± 1.011
1.948AsnThr: 1.948 ± 0.518
1.948AsnVal: 1.948 ± 1.096
0.866AsnTrp: 0.866 ± 0.53
1.732AsnTyr: 1.732 ± 0.438
0.0AsnXaa: 0.0 ± 0.0
Pro
3.897ProAla: 3.897 ± 0.732
0.216ProCys: 0.216 ± 0.132
3.031ProAsp: 3.031 ± 0.576
1.515ProGlu: 1.515 ± 0.771
2.165ProPhe: 2.165 ± 0.535
3.031ProGly: 3.031 ± 0.836
1.732ProHis: 1.732 ± 1.102
4.33ProIle: 4.33 ± 1.174
2.165ProLys: 2.165 ± 0.504
6.711ProLeu: 6.711 ± 0.845
0.216ProMet: 0.216 ± 0.132
1.515ProAsn: 1.515 ± 0.472
3.68ProPro: 3.68 ± 1.093
1.515ProGln: 1.515 ± 0.605
1.732ProArg: 1.732 ± 0.47
3.897ProSer: 3.897 ± 0.976
2.814ProThr: 2.814 ± 0.75
2.165ProVal: 2.165 ± 0.742
0.216ProTrp: 0.216 ± 0.132
2.165ProTyr: 2.165 ± 0.872
0.0ProXaa: 0.0 ± 0.0
Gln
4.113GlnAla: 4.113 ± 1.157
1.732GlnCys: 1.732 ± 0.672
0.866GlnAsp: 0.866 ± 0.369
1.732GlnGlu: 1.732 ± 0.651
1.515GlnPhe: 1.515 ± 0.413
3.247GlnGly: 3.247 ± 0.709
1.082GlnHis: 1.082 ± 0.495
3.897GlnIle: 3.897 ± 1.144
2.381GlnLys: 2.381 ± 0.656
5.196GlnLeu: 5.196 ± 0.822
1.515GlnMet: 1.515 ± 0.735
1.732GlnAsn: 1.732 ± 0.377
3.247GlnPro: 3.247 ± 1.223
3.247GlnGln: 3.247 ± 0.47
2.165GlnArg: 2.165 ± 0.543
4.113GlnSer: 4.113 ± 2.089
2.381GlnThr: 2.381 ± 0.792
4.33GlnVal: 4.33 ± 1.342
0.216GlnTrp: 0.216 ± 0.132
1.948GlnTyr: 1.948 ± 0.489
0.0GlnXaa: 0.0 ± 0.0
Arg
3.897ArgAla: 3.897 ± 1.418
0.866ArgCys: 0.866 ± 0.271
3.031ArgAsp: 3.031 ± 0.999
2.381ArgGlu: 2.381 ± 0.47
1.082ArgPhe: 1.082 ± 0.335
2.381ArgGly: 2.381 ± 0.408
0.866ArgHis: 0.866 ± 0.401
4.113ArgIle: 4.113 ± 0.539
3.031ArgLys: 3.031 ± 0.9
6.062ArgLeu: 6.062 ± 0.702
1.082ArgMet: 1.082 ± 0.313
2.814ArgAsn: 2.814 ± 0.816
2.381ArgPro: 2.381 ± 0.383
3.68ArgGln: 3.68 ± 0.83
1.732ArgArg: 1.732 ± 0.388
4.33ArgSer: 4.33 ± 1.233
2.165ArgThr: 2.165 ± 0.556
2.814ArgVal: 2.814 ± 0.681
0.216ArgTrp: 0.216 ± 0.271
1.515ArgTyr: 1.515 ± 0.439
0.0ArgXaa: 0.0 ± 0.0
Ser
7.144SerAla: 7.144 ± 1.462
2.165SerCys: 2.165 ± 1.134
4.979SerAsp: 4.979 ± 0.966
3.247SerGlu: 3.247 ± 0.43
1.515SerPhe: 1.515 ± 0.476
4.979SerGly: 4.979 ± 1.154
1.732SerHis: 1.732 ± 0.71
7.144SerIle: 7.144 ± 2.54
3.464SerLys: 3.464 ± 1.016
9.742SerLeu: 9.742 ± 1.61
1.515SerMet: 1.515 ± 0.563
5.196SerAsn: 5.196 ± 1.276
3.031SerPro: 3.031 ± 0.655
3.464SerGln: 3.464 ± 1.137
5.412SerArg: 5.412 ± 0.601
6.711SerSer: 6.711 ± 0.953
5.845SerThr: 5.845 ± 1.147
6.928SerVal: 6.928 ± 0.729
1.082SerTrp: 1.082 ± 0.436
3.897SerTyr: 3.897 ± 1.208
0.0SerXaa: 0.0 ± 0.0
Thr
3.68ThrAla: 3.68 ± 1.382
1.299ThrCys: 1.299 ± 0.448
2.598ThrAsp: 2.598 ± 0.576
2.381ThrGlu: 2.381 ± 1.19
1.948ThrPhe: 1.948 ± 0.665
3.68ThrGly: 3.68 ± 0.805
1.299ThrHis: 1.299 ± 0.492
4.763ThrIle: 4.763 ± 0.47
2.598ThrLys: 2.598 ± 0.758
6.711ThrLeu: 6.711 ± 0.537
2.165ThrMet: 2.165 ± 0.67
3.247ThrAsn: 3.247 ± 1.018
3.247ThrPro: 3.247 ± 1.423
2.814ThrGln: 2.814 ± 0.626
3.464ThrArg: 3.464 ± 0.885
7.577ThrSer: 7.577 ± 1.576
3.897ThrThr: 3.897 ± 1.223
2.381ThrVal: 2.381 ± 0.626
0.866ThrTrp: 0.866 ± 0.331
1.515ThrTyr: 1.515 ± 0.548
0.0ThrXaa: 0.0 ± 0.0
Val
5.412ValAla: 5.412 ± 0.696
0.649ValCys: 0.649 ± 0.296
4.113ValAsp: 4.113 ± 1.298
1.082ValGlu: 1.082 ± 0.533
1.948ValPhe: 1.948 ± 0.946
2.381ValGly: 2.381 ± 0.403
0.649ValHis: 0.649 ± 0.332
4.33ValIle: 4.33 ± 1.104
2.381ValLys: 2.381 ± 1.223
6.278ValLeu: 6.278 ± 1.268
1.515ValMet: 1.515 ± 0.464
3.247ValAsn: 3.247 ± 0.521
3.031ValPro: 3.031 ± 0.716
4.113ValGln: 4.113 ± 1.442
2.598ValArg: 2.598 ± 0.696
4.979ValSer: 4.979 ± 1.107
4.546ValThr: 4.546 ± 1.288
3.464ValVal: 3.464 ± 1.204
0.433ValTrp: 0.433 ± 0.25
2.165ValTyr: 2.165 ± 0.827
0.0ValXaa: 0.0 ± 0.0
Trp
0.649TrpAla: 0.649 ± 0.252
0.0TrpCys: 0.0 ± 0.0
0.216TrpAsp: 0.216 ± 0.132
0.649TrpGlu: 0.649 ± 0.305
0.649TrpPhe: 0.649 ± 0.252
0.649TrpGly: 0.649 ± 0.361
0.0TrpHis: 0.0 ± 0.0
0.649TrpIle: 0.649 ± 0.294
0.216TrpLys: 0.216 ± 0.132
1.082TrpLeu: 1.082 ± 0.48
0.0TrpMet: 0.0 ± 0.0
0.433TrpAsn: 0.433 ± 0.265
0.216TrpPro: 0.216 ± 0.132
0.0TrpGln: 0.0 ± 0.0
1.082TrpArg: 1.082 ± 0.472
0.866TrpSer: 0.866 ± 0.401
0.433TrpThr: 0.433 ± 0.265
0.433TrpVal: 0.433 ± 0.42
0.0TrpTrp: 0.0 ± 0.0
0.216TrpTyr: 0.216 ± 0.132
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.165TyrAla: 2.165 ± 0.59
1.299TyrCys: 1.299 ± 0.448
1.515TyrAsp: 1.515 ± 0.345
1.515TyrGlu: 1.515 ± 0.927
0.649TyrPhe: 0.649 ± 0.252
1.948TyrGly: 1.948 ± 0.679
0.433TyrHis: 0.433 ± 0.265
2.381TyrIle: 2.381 ± 0.999
0.866TyrLys: 0.866 ± 0.608
3.464TyrLeu: 3.464 ± 1.102
1.082TyrMet: 1.082 ± 0.295
2.381TyrAsn: 2.381 ± 0.425
1.299TyrPro: 1.299 ± 0.796
1.948TyrGln: 1.948 ± 0.657
2.165TyrArg: 2.165 ± 0.436
3.247TyrSer: 3.247 ± 0.626
1.515TyrThr: 1.515 ± 1.062
1.515TyrVal: 1.515 ± 0.722
0.0TyrTrp: 0.0 ± 0.0
1.082TyrTyr: 1.082 ± 0.49
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4620 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski