Amino acid dipepetide frequency for Sparus aurata polyomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.384AlaAla: 18.384 ± 10.388
0.484AlaCys: 0.484 ± 0.338
7.257AlaAsp: 7.257 ± 1.708
3.87AlaGlu: 3.87 ± 1.643
0.0AlaPhe: 0.0 ± 0.0
8.224AlaGly: 8.224 ± 3.488
0.968AlaHis: 0.968 ± 0.52
1.451AlaIle: 1.451 ± 0.82
3.387AlaLys: 3.387 ± 1.324
6.289AlaLeu: 6.289 ± 1.739
1.451AlaMet: 1.451 ± 0.579
1.935AlaAsn: 1.935 ± 1.04
6.773AlaPro: 6.773 ± 1.183
4.354AlaGln: 4.354 ± 1.347
1.935AlaArg: 1.935 ± 1.566
4.838AlaSer: 4.838 ± 1.203
2.903AlaThr: 2.903 ± 1.081
8.708AlaVal: 8.708 ± 5.416
0.968AlaTrp: 0.968 ± 0.708
1.451AlaTyr: 1.451 ± 0.382
0.0AlaXaa: 0.0 ± 0.0
Cys
0.484CysAla: 0.484 ± 0.338
0.0CysCys: 0.0 ± 0.0
1.451CysAsp: 1.451 ± 1.014
0.0CysGlu: 0.0 ± 0.0
0.484CysPhe: 0.484 ± 0.582
1.451CysGly: 1.451 ± 1.014
0.0CysHis: 0.0 ± 0.0
0.484CysIle: 0.484 ± 0.338
0.968CysLys: 0.968 ± 0.708
0.968CysLeu: 0.968 ± 0.708
0.484CysMet: 0.484 ± 0.338
0.968CysAsn: 0.968 ± 0.676
2.419CysPro: 2.419 ± 1.199
0.484CysGln: 0.484 ± 0.734
0.484CysArg: 0.484 ± 0.338
0.484CysSer: 0.484 ± 0.338
0.484CysThr: 0.484 ± 0.582
0.484CysVal: 0.484 ± 0.338
0.0CysTrp: 0.0 ± 0.0
0.968CysTyr: 0.968 ± 0.52
0.0CysXaa: 0.0 ± 0.0
Asp
3.87AspAla: 3.87 ± 1.672
0.0AspCys: 0.0 ± 0.0
5.806AspAsp: 5.806 ± 1.961
8.224AspGlu: 8.224 ± 1.706
1.935AspPhe: 1.935 ± 0.877
5.322AspGly: 5.322 ± 1.404
1.451AspHis: 1.451 ± 1.175
2.903AspIle: 2.903 ± 1.023
0.484AspLys: 0.484 ± 0.338
2.419AspLeu: 2.419 ± 0.461
2.419AspMet: 2.419 ± 1.24
1.935AspAsn: 1.935 ± 1.186
3.87AspPro: 3.87 ± 1.405
1.935AspGln: 1.935 ± 0.906
3.387AspArg: 3.387 ± 1.352
5.806AspSer: 5.806 ± 2.315
1.935AspThr: 1.935 ± 1.289
2.903AspVal: 2.903 ± 1.081
0.484AspTrp: 0.484 ± 0.392
1.451AspTyr: 1.451 ± 0.656
0.0AspXaa: 0.0 ± 0.0
Glu
4.838GluAla: 4.838 ± 2.244
0.0GluCys: 0.0 ± 0.0
6.289GluAsp: 6.289 ± 2.598
6.773GluGlu: 6.773 ± 1.953
0.968GluPhe: 0.968 ± 0.36
6.289GluGly: 6.289 ± 1.37
1.935GluHis: 1.935 ± 0.961
1.451GluIle: 1.451 ± 0.382
3.387GluLys: 3.387 ± 1.48
5.806GluLeu: 5.806 ± 2.646
2.419GluMet: 2.419 ± 0.824
1.935GluAsn: 1.935 ± 0.41
1.935GluPro: 1.935 ± 0.702
1.935GluGln: 1.935 ± 0.998
5.806GluArg: 5.806 ± 1.166
2.903GluSer: 2.903 ± 1.133
2.903GluThr: 2.903 ± 1.313
3.87GluVal: 3.87 ± 0.766
1.935GluTrp: 1.935 ± 0.877
2.419GluTyr: 2.419 ± 0.895
0.0GluXaa: 0.0 ± 0.0
Phe
0.968PheAla: 0.968 ± 0.676
2.419PheCys: 2.419 ± 1.135
1.451PheAsp: 1.451 ± 0.382
1.451PheGlu: 1.451 ± 1.014
2.419PhePhe: 2.419 ± 2.096
1.935PheGly: 1.935 ± 0.844
1.451PheHis: 1.451 ± 0.988
1.935PheIle: 1.935 ± 0.877
1.451PheLys: 1.451 ± 1.116
2.903PheLeu: 2.903 ± 1.511
1.451PheMet: 1.451 ± 1.014
0.968PheAsn: 0.968 ± 0.52
1.935PhePro: 1.935 ± 0.905
1.451PheGln: 1.451 ± 1.051
2.903PheArg: 2.903 ± 0.85
1.451PheSer: 1.451 ± 0.839
1.451PheThr: 1.451 ± 0.382
0.484PheVal: 0.484 ± 0.338
1.935PheTrp: 1.935 ± 1.352
1.935PheTyr: 1.935 ± 0.814
0.0PheXaa: 0.0 ± 0.0
Gly
10.643GlyAla: 10.643 ± 5.18
0.968GlyCys: 0.968 ± 0.676
3.387GlyAsp: 3.387 ± 0.891
6.289GlyGlu: 6.289 ± 1.806
4.838GlyPhe: 4.838 ± 1.25
6.289GlyGly: 6.289 ± 2.055
2.903GlyHis: 2.903 ± 1.433
1.935GlyIle: 1.935 ± 1.04
3.387GlyLys: 3.387 ± 1.189
6.289GlyLeu: 6.289 ± 1.345
1.935GlyMet: 1.935 ± 0.935
0.0GlyAsn: 0.0 ± 0.0
4.354GlyPro: 4.354 ± 2.251
2.419GlyGln: 2.419 ± 0.602
3.87GlyArg: 3.87 ± 0.637
5.806GlySer: 5.806 ± 0.871
6.773GlyThr: 6.773 ± 1.149
5.806GlyVal: 5.806 ± 2.724
2.903GlyTrp: 2.903 ± 0.751
3.87GlyTyr: 3.87 ± 1.991
0.0GlyXaa: 0.0 ± 0.0
His
1.451HisAla: 1.451 ± 0.672
0.484HisCys: 0.484 ± 0.734
1.935HisAsp: 1.935 ± 1.566
1.451HisGlu: 1.451 ± 0.82
2.419HisPhe: 2.419 ± 0.873
1.451HisGly: 1.451 ± 1.175
2.419HisHis: 2.419 ± 1.494
0.484HisIle: 0.484 ± 0.734
0.484HisLys: 0.484 ± 0.392
3.387HisLeu: 3.387 ± 0.717
0.484HisMet: 0.484 ± 0.313
1.451HisAsn: 1.451 ± 1.116
2.419HisPro: 2.419 ± 1.434
0.968HisGln: 0.968 ± 0.676
0.968HisArg: 0.968 ± 0.36
2.419HisSer: 2.419 ± 0.953
0.968HisThr: 0.968 ± 0.708
0.484HisVal: 0.484 ± 0.582
0.484HisTrp: 0.484 ± 0.582
1.451HisTyr: 1.451 ± 0.579
0.0HisXaa: 0.0 ± 0.0
Ile
2.903IleAla: 2.903 ± 0.571
0.0IleCys: 0.0 ± 0.0
1.451IleAsp: 1.451 ± 0.579
2.419IleGlu: 2.419 ± 1.196
0.968IlePhe: 0.968 ± 0.676
1.935IleGly: 1.935 ± 1.04
0.484IleHis: 0.484 ± 0.392
1.451IleIle: 1.451 ± 0.672
2.903IleLys: 2.903 ± 1.656
1.451IleLeu: 1.451 ± 0.82
0.968IleMet: 0.968 ± 0.673
0.484IleAsn: 0.484 ± 0.582
4.838IlePro: 4.838 ± 1.104
1.451IleGln: 1.451 ± 0.382
0.968IleArg: 0.968 ± 1.029
3.387IleSer: 3.387 ± 2.082
1.451IleThr: 1.451 ± 0.579
1.451IleVal: 1.451 ± 0.672
0.968IleTrp: 0.968 ± 0.593
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.935LysAla: 1.935 ± 0.702
0.0LysCys: 0.0 ± 0.0
1.935LysAsp: 1.935 ± 0.906
2.903LysGlu: 2.903 ± 1.133
2.419LysPhe: 2.419 ± 1.444
3.87LysGly: 3.87 ± 1.152
1.935LysHis: 1.935 ± 0.905
1.451LysIle: 1.451 ± 1.746
6.289LysLys: 6.289 ± 2.754
3.387LysLeu: 3.387 ± 1.845
0.968LysMet: 0.968 ± 0.635
0.968LysAsn: 0.968 ± 0.676
0.968LysPro: 0.968 ± 0.783
3.87LysGln: 3.87 ± 1.16
5.806LysArg: 5.806 ± 1.833
3.387LysSer: 3.387 ± 1.019
1.451LysThr: 1.451 ± 1.108
3.87LysVal: 3.87 ± 0.82
1.451LysTrp: 1.451 ± 0.579
0.968LysTyr: 0.968 ± 0.52
0.0LysXaa: 0.0 ± 0.0
Leu
5.322LeuAla: 5.322 ± 1.1
0.968LeuCys: 0.968 ± 0.676
2.419LeuAsp: 2.419 ± 1.518
4.838LeuGlu: 4.838 ± 0.854
4.354LeuPhe: 4.354 ± 1.459
6.773LeuGly: 6.773 ± 1.654
0.484LeuHis: 0.484 ± 0.338
4.354LeuIle: 4.354 ± 1.869
4.838LeuLys: 4.838 ± 1.903
3.87LeuLeu: 3.87 ± 1.811
1.935LeuMet: 1.935 ± 0.721
3.387LeuAsn: 3.387 ± 2.218
4.354LeuPro: 4.354 ± 1.295
2.419LeuGln: 2.419 ± 0.461
4.838LeuArg: 4.838 ± 0.778
5.322LeuSer: 5.322 ± 2.451
7.257LeuThr: 7.257 ± 2.763
0.484LeuVal: 0.484 ± 0.582
0.484LeuTrp: 0.484 ± 0.338
2.419LeuTyr: 2.419 ± 0.824
0.0LeuXaa: 0.0 ± 0.0
Met
1.451MetAla: 1.451 ± 0.672
0.968MetCys: 0.968 ± 0.52
0.968MetAsp: 0.968 ± 0.676
1.451MetGlu: 1.451 ± 0.656
0.484MetPhe: 0.484 ± 0.338
4.354MetGly: 4.354 ± 1.374
0.484MetHis: 0.484 ± 0.392
0.484MetIle: 0.484 ± 0.338
0.0MetLys: 0.0 ± 0.0
4.354MetLeu: 4.354 ± 1.469
0.0MetMet: 0.0 ± 0.0
1.935MetAsn: 1.935 ± 0.721
2.419MetPro: 2.419 ± 0.648
0.0MetGln: 0.0 ± 0.0
1.451MetArg: 1.451 ± 1.014
3.387MetSer: 3.387 ± 0.717
1.935MetThr: 1.935 ± 0.41
1.935MetVal: 1.935 ± 1.04
0.484MetTrp: 0.484 ± 0.338
0.968MetTyr: 0.968 ± 0.676
0.0MetXaa: 0.0 ± 0.0
Asn
2.419AsnAla: 2.419 ± 1.201
0.968AsnCys: 0.968 ± 0.52
1.451AsnAsp: 1.451 ± 1.116
1.935AsnGlu: 1.935 ± 1.054
0.484AsnPhe: 0.484 ± 0.338
1.451AsnGly: 1.451 ± 0.82
0.968AsnHis: 0.968 ± 0.676
0.0AsnIle: 0.0 ± 0.0
2.419AsnLys: 2.419 ± 0.461
2.903AsnLeu: 2.903 ± 1.357
0.968AsnMet: 0.968 ± 0.36
0.484AsnAsn: 0.484 ± 0.582
2.903AsnPro: 2.903 ± 2.338
1.935AsnGln: 1.935 ± 0.694
1.451AsnArg: 1.451 ± 0.672
1.451AsnSer: 1.451 ± 1.746
0.968AsnThr: 0.968 ± 0.36
1.451AsnVal: 1.451 ± 1.051
0.484AsnTrp: 0.484 ± 0.734
0.484AsnTyr: 0.484 ± 0.392
0.0AsnXaa: 0.0 ± 0.0
Pro
3.87ProAla: 3.87 ± 0.991
0.968ProCys: 0.968 ± 0.676
2.419ProAsp: 2.419 ± 1.196
3.387ProGlu: 3.387 ± 2.892
3.387ProPhe: 3.387 ± 1.022
4.838ProGly: 4.838 ± 2.222
1.935ProHis: 1.935 ± 1.381
0.968ProIle: 0.968 ± 0.962
3.87ProLys: 3.87 ± 1.213
3.87ProLeu: 3.87 ± 1.566
1.451ProMet: 1.451 ± 0.382
2.419ProAsn: 2.419 ± 2.227
9.192ProPro: 9.192 ± 3.13
3.87ProGln: 3.87 ± 2.499
4.838ProArg: 4.838 ± 2.282
3.87ProSer: 3.87 ± 1.253
5.806ProThr: 5.806 ± 2.143
7.257ProVal: 7.257 ± 2.016
1.451ProTrp: 1.451 ± 1.051
0.484ProTyr: 0.484 ± 0.582
0.0ProXaa: 0.0 ± 0.0
Gln
3.387GlnAla: 3.387 ± 0.817
0.0GlnCys: 0.0 ± 0.0
0.484GlnAsp: 0.484 ± 0.338
1.451GlnGlu: 1.451 ± 0.831
1.451GlnPhe: 1.451 ± 0.382
3.87GlnGly: 3.87 ± 1.374
0.968GlnHis: 0.968 ± 0.36
0.968GlnIle: 0.968 ± 0.777
1.451GlnLys: 1.451 ± 0.831
1.935GlnLeu: 1.935 ± 0.694
1.451GlnMet: 1.451 ± 0.672
3.387GlnAsn: 3.387 ± 1.47
2.419GlnPro: 2.419 ± 1.188
3.387GlnGln: 3.387 ± 1.996
3.87GlnArg: 3.87 ± 0.917
2.419GlnSer: 2.419 ± 1.773
3.87GlnThr: 3.87 ± 0.917
2.903GlnVal: 2.903 ± 1.782
1.451GlnTrp: 1.451 ± 0.672
1.451GlnTyr: 1.451 ± 0.382
0.0GlnXaa: 0.0 ± 0.0
Arg
3.87ArgAla: 3.87 ± 0.637
0.484ArgCys: 0.484 ± 0.338
5.806ArgAsp: 5.806 ± 2.135
6.289ArgGlu: 6.289 ± 1.943
2.419ArgPhe: 2.419 ± 1.196
4.354ArgGly: 4.354 ± 1.805
1.451ArgHis: 1.451 ± 0.655
1.451ArgIle: 1.451 ± 0.579
3.387ArgLys: 3.387 ± 1.352
4.838ArgLeu: 4.838 ± 1.356
1.451ArgMet: 1.451 ± 0.831
1.451ArgAsn: 1.451 ± 0.655
2.419ArgPro: 2.419 ± 0.953
2.903ArgGln: 2.903 ± 0.751
6.773ArgArg: 6.773 ± 3.582
3.87ArgSer: 3.87 ± 2.582
1.935ArgThr: 1.935 ± 1.041
5.806ArgVal: 5.806 ± 3.675
2.419ArgTrp: 2.419 ± 0.904
0.968ArgTyr: 0.968 ± 0.52
0.0ArgXaa: 0.0 ± 0.0
Ser
6.289SerAla: 6.289 ± 1.748
1.935SerCys: 1.935 ± 1.352
2.903SerAsp: 2.903 ± 1.56
2.419SerGlu: 2.419 ± 0.461
1.451SerPhe: 1.451 ± 0.839
8.708SerGly: 8.708 ± 2.088
1.451SerHis: 1.451 ± 0.655
3.87SerIle: 3.87 ± 0.862
2.419SerLys: 2.419 ± 0.953
4.838SerLeu: 4.838 ± 1.338
2.903SerMet: 2.903 ± 1.511
1.451SerAsn: 1.451 ± 0.579
4.838SerPro: 4.838 ± 2.849
3.387SerGln: 3.387 ± 0.911
3.387SerArg: 3.387 ± 0.763
5.806SerSer: 5.806 ± 1.591
1.935SerThr: 1.935 ± 0.41
4.838SerVal: 4.838 ± 0.867
2.419SerTrp: 2.419 ± 1.444
1.935SerTyr: 1.935 ± 0.721
0.0SerXaa: 0.0 ± 0.0
Thr
2.903ThrAla: 2.903 ± 1.47
0.968ThrCys: 0.968 ± 0.676
4.838ThrAsp: 4.838 ± 1.296
3.387ThrGlu: 3.387 ± 1.352
0.968ThrPhe: 0.968 ± 0.962
2.419ThrGly: 2.419 ± 1.552
1.451ThrHis: 1.451 ± 0.579
1.451ThrIle: 1.451 ± 0.656
3.387ThrLys: 3.387 ± 2.308
4.838ThrLeu: 4.838 ± 1.857
0.968ThrMet: 0.968 ± 0.36
0.0ThrAsn: 0.0 ± 0.0
5.806ThrPro: 5.806 ± 2.671
2.419ThrGln: 2.419 ± 0.648
3.387ThrArg: 3.387 ± 2.193
2.903ThrSer: 2.903 ± 2.338
4.354ThrThr: 4.354 ± 1.487
4.354ThrVal: 4.354 ± 0.916
1.451ThrTrp: 1.451 ± 1.108
1.935ThrTyr: 1.935 ± 0.844
0.0ThrXaa: 0.0 ± 0.0
Val
7.741ValAla: 7.741 ± 2.443
0.968ValCys: 0.968 ± 0.708
3.387ValAsp: 3.387 ± 1.707
5.806ValGlu: 5.806 ± 1.73
0.968ValPhe: 0.968 ± 0.676
6.289ValGly: 6.289 ± 2.926
2.903ValHis: 2.903 ± 1.106
2.419ValIle: 2.419 ± 0.904
3.387ValLys: 3.387 ± 1.022
3.87ValLeu: 3.87 ± 1.811
2.419ValMet: 2.419 ± 0.648
1.935ValAsn: 1.935 ± 1.04
3.87ValPro: 3.87 ± 0.886
2.419ValGln: 2.419 ± 1.196
4.354ValArg: 4.354 ± 2.458
6.289ValSer: 6.289 ± 1.471
1.935ValThr: 1.935 ± 0.905
6.773ValVal: 6.773 ± 2.602
0.484ValTrp: 0.484 ± 0.392
0.484ValTyr: 0.484 ± 0.338
0.0ValXaa: 0.0 ± 0.0
Trp
1.451TrpAla: 1.451 ± 0.82
0.484TrpCys: 0.484 ± 0.734
2.419TrpAsp: 2.419 ± 1.005
0.968TrpGlu: 0.968 ± 0.52
0.968TrpPhe: 0.968 ± 0.676
3.387TrpGly: 3.387 ± 1.639
0.484TrpHis: 0.484 ± 0.392
0.484TrpIle: 0.484 ± 0.582
0.968TrpLys: 0.968 ± 0.708
1.451TrpLeu: 1.451 ± 0.82
0.484TrpMet: 0.484 ± 0.582
0.484TrpAsn: 0.484 ± 0.392
0.0TrpPro: 0.0 ± 0.0
0.484TrpGln: 0.484 ± 0.392
2.419TrpArg: 2.419 ± 1.245
1.935TrpSer: 1.935 ± 1.352
0.484TrpThr: 0.484 ± 0.338
2.419TrpVal: 2.419 ± 1.196
0.484TrpTrp: 0.484 ± 0.734
0.484TrpTyr: 0.484 ± 0.392
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.451TyrAla: 1.451 ± 0.672
0.484TyrCys: 0.484 ± 0.338
0.484TyrAsp: 0.484 ± 0.392
0.484TyrGlu: 0.484 ± 0.392
0.968TyrPhe: 0.968 ± 0.52
1.451TyrGly: 1.451 ± 0.82
1.935TyrHis: 1.935 ± 1.139
1.935TyrIle: 1.935 ± 0.41
1.451TyrLys: 1.451 ± 0.656
1.451TyrLeu: 1.451 ± 0.656
2.419TyrMet: 2.419 ± 0.461
0.0TyrAsn: 0.0 ± 0.0
2.419TyrPro: 2.419 ± 1.696
0.484TyrGln: 0.484 ± 0.338
1.451TyrArg: 1.451 ± 0.579
1.451TyrSer: 1.451 ± 0.839
3.387TyrThr: 3.387 ± 1.829
2.419TyrVal: 2.419 ± 0.648
0.0TyrTrp: 0.0 ± 0.0
1.451TyrTyr: 1.451 ± 1.108
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2068 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski