Amino acid dipepetide frequency for California sea lion polyomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.161AlaAla: 7.161 ± 2.759
0.512AlaCys: 0.512 ± 0.791
0.0AlaAsp: 0.0 ± 0.0
1.023AlaGlu: 1.023 ± 0.813
2.046AlaPhe: 2.046 ± 1.124
4.604AlaGly: 4.604 ± 0.943
0.0AlaHis: 0.0 ± 0.0
2.558AlaIle: 2.558 ± 1.161
3.581AlaLys: 3.581 ± 0.785
8.184AlaLeu: 8.184 ± 2.022
1.023AlaMet: 1.023 ± 0.448
1.535AlaAsn: 1.535 ± 0.749
3.581AlaPro: 3.581 ± 0.76
2.558AlaGln: 2.558 ± 0.896
3.581AlaArg: 3.581 ± 1.225
4.604AlaSer: 4.604 ± 2.546
3.069AlaThr: 3.069 ± 1.354
3.581AlaVal: 3.581 ± 0.976
0.512AlaTrp: 0.512 ± 0.357
1.023AlaTyr: 1.023 ± 0.953
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.023CysCys: 1.023 ± 0.448
0.512CysAsp: 0.512 ± 0.431
0.0CysGlu: 0.0 ± 0.0
0.512CysPhe: 0.512 ± 0.63
0.0CysGly: 0.0 ± 0.0
0.512CysHis: 0.512 ± 0.357
0.0CysIle: 0.0 ± 0.0
3.069CysLys: 3.069 ± 1.167
3.069CysLeu: 3.069 ± 1.697
1.023CysMet: 1.023 ± 0.714
0.512CysAsn: 0.512 ± 0.431
1.535CysPro: 1.535 ± 1.823
0.512CysGln: 0.512 ± 0.357
1.023CysArg: 1.023 ± 0.714
0.512CysSer: 0.512 ± 0.357
0.512CysThr: 0.512 ± 0.357
1.023CysVal: 1.023 ± 0.657
1.023CysTrp: 1.023 ± 0.647
2.046CysTyr: 2.046 ± 1.807
0.0CysXaa: 0.0 ± 0.0
Asp
1.023AspAla: 1.023 ± 0.448
0.0AspCys: 0.0 ± 0.0
4.604AspAsp: 4.604 ± 1.977
5.115AspGlu: 5.115 ± 0.474
0.512AspPhe: 0.512 ± 0.357
2.558AspGly: 2.558 ± 0.783
0.512AspHis: 0.512 ± 0.357
4.092AspIle: 4.092 ± 0.667
6.65AspLys: 6.65 ± 1.254
8.184AspLeu: 8.184 ± 1.558
2.046AspMet: 2.046 ± 0.466
2.046AspAsn: 2.046 ± 0.789
3.581AspPro: 3.581 ± 1.361
1.535AspGln: 1.535 ± 1.071
1.535AspArg: 1.535 ± 0.849
3.069AspSer: 3.069 ± 1.608
1.023AspThr: 1.023 ± 0.448
4.604AspVal: 4.604 ± 1.121
2.558AspTrp: 2.558 ± 1.457
3.069AspTyr: 3.069 ± 0.973
0.0AspXaa: 0.0 ± 0.0
Glu
3.069GluAla: 3.069 ± 1.225
1.023GluCys: 1.023 ± 0.448
10.742GluAsp: 10.742 ± 1.767
9.207GluGlu: 9.207 ± 2.62
2.558GluPhe: 2.558 ± 0.617
3.069GluGly: 3.069 ± 0.872
1.535GluHis: 1.535 ± 0.804
3.069GluIle: 3.069 ± 0.632
5.115GluLys: 5.115 ± 1.686
7.673GluLeu: 7.673 ± 1.298
1.023GluMet: 1.023 ± 0.644
5.627GluAsn: 5.627 ± 1.542
1.535GluPro: 1.535 ± 0.804
0.512GluGln: 0.512 ± 0.357
1.023GluArg: 1.023 ± 0.448
4.604GluSer: 4.604 ± 0.827
5.627GluThr: 5.627 ± 0.641
5.627GluVal: 5.627 ± 1.682
0.512GluTrp: 0.512 ± 0.357
1.023GluTyr: 1.023 ± 0.657
0.0GluXaa: 0.0 ± 0.0
Phe
2.046PheAla: 2.046 ± 1.026
0.512PheCys: 0.512 ± 0.357
1.023PheAsp: 1.023 ± 0.657
3.581PheGlu: 3.581 ± 1.193
1.535PhePhe: 1.535 ± 0.749
4.604PheGly: 4.604 ± 1.242
0.512PheHis: 0.512 ± 0.357
1.535PheIle: 1.535 ± 0.849
1.023PheLys: 1.023 ± 0.448
3.069PheLeu: 3.069 ± 1.344
1.023PheMet: 1.023 ± 1.165
1.535PheAsn: 1.535 ± 0.442
3.581PhePro: 3.581 ± 0.817
1.023PheGln: 1.023 ± 0.714
1.023PheArg: 1.023 ± 0.448
2.558PheSer: 2.558 ± 1.629
1.023PheThr: 1.023 ± 0.49
1.535PheVal: 1.535 ± 0.681
1.535PheTrp: 1.535 ± 1.025
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.115GlyAla: 5.115 ± 2.583
0.512GlyCys: 0.512 ± 0.357
8.184GlyAsp: 8.184 ± 0.755
4.604GlyGlu: 4.604 ± 2.149
3.581GlyPhe: 3.581 ± 1.227
6.138GlyGly: 6.138 ± 0.801
3.581GlyHis: 3.581 ± 0.867
4.092GlyIle: 4.092 ± 0.659
3.069GlyLys: 3.069 ± 1.003
6.138GlyLeu: 6.138 ± 1.15
1.535GlyMet: 1.535 ± 0.849
1.535GlyAsn: 1.535 ± 0.849
7.161GlyPro: 7.161 ± 2.144
2.046GlyGln: 2.046 ± 0.887
1.535GlyArg: 1.535 ± 1.025
2.558GlySer: 2.558 ± 1.106
4.604GlyThr: 4.604 ± 1.15
6.65GlyVal: 6.65 ± 1.056
0.0GlyTrp: 0.0 ± 0.0
0.512GlyTyr: 0.512 ± 0.357
0.0GlyXaa: 0.0 ± 0.0
His
3.069HisAla: 3.069 ± 1.499
2.046HisCys: 2.046 ± 1.428
0.0HisAsp: 0.0 ± 0.0
2.046HisGlu: 2.046 ± 0.997
0.512HisPhe: 0.512 ± 0.431
1.023HisGly: 1.023 ± 0.448
0.512HisHis: 0.512 ± 0.357
0.512HisIle: 0.512 ± 0.63
1.535HisLys: 1.535 ± 0.681
2.558HisLeu: 2.558 ± 1.436
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.535HisPro: 1.535 ± 0.584
0.512HisGln: 0.512 ± 0.357
3.069HisArg: 3.069 ± 1.275
1.535HisSer: 1.535 ± 1.294
0.0HisThr: 0.0 ± 0.0
2.046HisVal: 2.046 ± 1.625
0.0HisTrp: 0.0 ± 0.0
0.512HisTyr: 0.512 ± 0.357
0.0HisXaa: 0.0 ± 0.0
Ile
1.023IleAla: 1.023 ± 0.552
0.512IleCys: 0.512 ± 0.357
3.581IleAsp: 3.581 ± 0.925
3.581IleGlu: 3.581 ± 1.084
1.023IlePhe: 1.023 ± 0.657
3.069IleGly: 3.069 ± 1.482
0.512IleHis: 0.512 ± 0.431
1.023IleIle: 1.023 ± 0.714
1.535IleLys: 1.535 ± 0.686
4.604IleLeu: 4.604 ± 1.464
2.046IleMet: 2.046 ± 1.129
1.023IleAsn: 1.023 ± 0.448
3.069IlePro: 3.069 ± 1.082
4.604IleGln: 4.604 ± 1.695
0.512IleArg: 0.512 ± 0.357
3.069IleSer: 3.069 ± 0.783
1.535IleThr: 1.535 ± 0.863
2.046IleVal: 2.046 ± 0.997
0.512IleTrp: 0.512 ± 0.63
1.535IleTyr: 1.535 ± 0.849
0.0IleXaa: 0.0 ± 0.0
Lys
3.069LysAla: 3.069 ± 1.765
2.046LysCys: 2.046 ± 1.313
2.558LysAsp: 2.558 ± 0.829
5.115LysGlu: 5.115 ± 1.94
2.558LysPhe: 2.558 ± 1.332
4.604LysGly: 4.604 ± 1.229
1.023LysHis: 1.023 ± 0.714
3.069LysIle: 3.069 ± 1.097
7.673LysLys: 7.673 ± 2.262
7.673LysLeu: 7.673 ± 2.627
1.023LysMet: 1.023 ± 0.665
4.092LysAsn: 4.092 ± 1.722
1.535LysPro: 1.535 ± 0.584
1.535LysGln: 1.535 ± 0.681
6.138LysArg: 6.138 ± 0.801
3.581LysSer: 3.581 ± 0.589
3.069LysThr: 3.069 ± 0.497
1.023LysVal: 1.023 ± 0.863
0.0LysTrp: 0.0 ± 0.0
3.581LysTyr: 3.581 ± 1.361
0.0LysXaa: 0.0 ± 0.0
Leu
7.161LeuAla: 7.161 ± 2.117
2.558LeuCys: 2.558 ± 1.096
6.65LeuAsp: 6.65 ± 1.231
8.696LeuGlu: 8.696 ± 1.668
6.138LeuPhe: 6.138 ± 1.474
5.115LeuGly: 5.115 ± 0.474
2.046LeuHis: 2.046 ± 0.723
2.046LeuIle: 2.046 ± 0.724
2.046LeuLys: 2.046 ± 0.719
17.391LeuLeu: 17.391 ± 3.133
2.046LeuMet: 2.046 ± 0.719
7.673LeuAsn: 7.673 ± 0.714
9.207LeuPro: 9.207 ± 1.397
5.627LeuGln: 5.627 ± 0.723
3.581LeuArg: 3.581 ± 0.76
8.184LeuSer: 8.184 ± 2.285
6.65LeuThr: 6.65 ± 1.486
1.535LeuVal: 1.535 ± 0.652
0.512LeuTrp: 0.512 ± 0.63
5.627LeuTyr: 5.627 ± 1.388
0.0LeuXaa: 0.0 ± 0.0
Met
1.535MetAla: 1.535 ± 0.681
0.512MetCys: 0.512 ± 0.63
1.023MetAsp: 1.023 ± 0.657
1.535MetGlu: 1.535 ± 0.804
0.0MetPhe: 0.0 ± 0.0
1.023MetGly: 1.023 ± 0.49
0.512MetHis: 0.512 ± 0.357
0.512MetIle: 0.512 ± 0.63
2.046MetLys: 2.046 ± 1.008
1.023MetLeu: 1.023 ± 0.863
0.0MetMet: 0.0 ± 0.0
1.023MetAsn: 1.023 ± 0.657
0.512MetPro: 0.512 ± 0.357
0.512MetGln: 0.512 ± 0.431
1.023MetArg: 1.023 ± 0.657
2.046MetSer: 2.046 ± 1.124
1.023MetThr: 1.023 ± 0.863
1.023MetVal: 1.023 ± 0.657
1.535MetTrp: 1.535 ± 0.584
0.512MetTyr: 0.512 ± 0.357
0.0MetXaa: 0.0 ± 0.0
Asn
3.069AsnAla: 3.069 ± 1.003
1.023AsnCys: 1.023 ± 0.657
1.023AsnAsp: 1.023 ± 0.714
3.069AsnGlu: 3.069 ± 1.167
1.535AsnPhe: 1.535 ± 1.025
1.023AsnGly: 1.023 ± 0.448
1.535AsnHis: 1.535 ± 0.681
2.558AsnIle: 2.558 ± 1.436
3.581AsnLys: 3.581 ± 1.256
3.581AsnLeu: 3.581 ± 1.068
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
2.558AsnPro: 2.558 ± 0.947
0.0AsnGln: 0.0 ± 0.0
5.115AsnArg: 5.115 ± 1.342
2.558AsnSer: 2.558 ± 1.13
2.046AsnThr: 2.046 ± 0.503
1.535AsnVal: 1.535 ± 0.804
0.0AsnTrp: 0.0 ± 0.0
1.535AsnTyr: 1.535 ± 0.804
0.0AsnXaa: 0.0 ± 0.0
Pro
2.046ProAla: 2.046 ± 1.479
1.023ProCys: 1.023 ± 0.647
6.65ProAsp: 6.65 ± 2.394
1.535ProGlu: 1.535 ± 0.686
1.023ProPhe: 1.023 ± 0.756
5.627ProGly: 5.627 ± 2.152
1.023ProHis: 1.023 ± 0.813
2.046ProIle: 2.046 ± 0.789
4.604ProLys: 4.604 ± 0.804
8.184ProLeu: 8.184 ± 2.772
2.046ProMet: 2.046 ± 0.789
1.535ProAsn: 1.535 ± 1.025
9.719ProPro: 9.719 ± 2.947
2.046ProGln: 2.046 ± 1.137
4.092ProArg: 4.092 ± 1.472
6.138ProSer: 6.138 ± 1.035
3.581ProThr: 3.581 ± 1.117
2.558ProVal: 2.558 ± 1.629
0.0ProTrp: 0.0 ± 0.0
1.535ProTyr: 1.535 ± 0.804
0.0ProXaa: 0.0 ± 0.0
Gln
2.046GlnAla: 2.046 ± 0.814
0.0GlnCys: 0.0 ± 0.0
2.046GlnAsp: 2.046 ± 1.428
3.581GlnGlu: 3.581 ± 1.036
1.535GlnPhe: 1.535 ± 0.686
1.535GlnGly: 1.535 ± 0.686
0.0GlnHis: 0.0 ± 0.0
3.069GlnIle: 3.069 ± 0.632
4.092GlnLys: 4.092 ± 1.495
4.092GlnLeu: 4.092 ± 1.006
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
2.558GlnPro: 2.558 ± 1.988
1.535GlnGln: 1.535 ± 0.686
3.581GlnArg: 3.581 ± 2.259
2.558GlnSer: 2.558 ± 0.915
1.023GlnThr: 1.023 ± 0.448
3.581GlnVal: 3.581 ± 1.361
0.512GlnTrp: 0.512 ± 0.357
1.535GlnTyr: 1.535 ± 0.749
0.0GlnXaa: 0.0 ± 0.0
Arg
1.535ArgAla: 1.535 ± 0.849
0.512ArgCys: 0.512 ± 0.63
2.046ArgAsp: 2.046 ± 0.503
2.558ArgGlu: 2.558 ± 0.853
2.558ArgPhe: 2.558 ± 0.915
4.604ArgGly: 4.604 ± 1.706
2.558ArgHis: 2.558 ± 0.853
3.069ArgIle: 3.069 ± 0.632
6.65ArgLys: 6.65 ± 2.104
3.581ArgLeu: 3.581 ± 1.989
1.023ArgMet: 1.023 ± 0.718
1.535ArgAsn: 1.535 ± 0.681
2.558ArgPro: 2.558 ± 1.503
2.046ArgGln: 2.046 ± 0.914
6.138ArgArg: 6.138 ± 4.246
6.138ArgSer: 6.138 ± 0.812
2.046ArgThr: 2.046 ± 0.896
2.558ArgVal: 2.558 ± 1.228
1.023ArgTrp: 1.023 ± 0.813
4.604ArgTyr: 4.604 ± 1.071
0.0ArgXaa: 0.0 ± 0.0
Ser
3.581SerAla: 3.581 ± 1.523
1.023SerCys: 1.023 ± 0.863
2.558SerAsp: 2.558 ± 1.102
4.604SerGlu: 4.604 ± 1.008
1.023SerPhe: 1.023 ± 0.49
4.604SerGly: 4.604 ± 1.229
2.558SerHis: 2.558 ± 1.457
2.046SerIle: 2.046 ± 0.944
2.558SerLys: 2.558 ± 0.974
8.184SerLeu: 8.184 ± 3.139
1.535SerMet: 1.535 ± 1.228
4.604SerAsn: 4.604 ± 2.119
3.069SerPro: 3.069 ± 0.872
4.092SerGln: 4.092 ± 1.208
6.65SerArg: 6.65 ± 0.542
2.046SerSer: 2.046 ± 0.719
3.581SerThr: 3.581 ± 0.925
4.092SerVal: 4.092 ± 0.659
1.535SerTrp: 1.535 ± 0.849
2.046SerTyr: 2.046 ± 0.769
0.0SerXaa: 0.0 ± 0.0
Thr
0.512ThrAla: 0.512 ± 0.357
1.535ThrCys: 1.535 ± 1.236
0.512ThrAsp: 0.512 ± 0.486
6.138ThrGlu: 6.138 ± 1.529
3.069ThrPhe: 3.069 ± 0.497
5.627ThrGly: 5.627 ± 2.528
1.023ThrHis: 1.023 ± 0.448
1.535ThrIle: 1.535 ± 1.294
1.023ThrLys: 1.023 ± 0.863
3.069ThrLeu: 3.069 ± 1.675
0.0ThrMet: 0.0 ± 0.0
1.023ThrAsn: 1.023 ± 0.448
5.627ThrPro: 5.627 ± 1.092
2.046ThrGln: 2.046 ± 1.006
2.558ThrArg: 2.558 ± 1.315
2.046ThrSer: 2.046 ± 0.896
4.092ThrThr: 4.092 ± 0.879
5.115ThrVal: 5.115 ± 0.893
1.535ThrTrp: 1.535 ± 1.025
2.558ThrTyr: 2.558 ± 0.446
0.0ThrXaa: 0.0 ± 0.0
Val
3.069ValAla: 3.069 ± 1.146
0.512ValCys: 0.512 ± 0.357
2.558ValAsp: 2.558 ± 0.847
4.604ValGlu: 4.604 ± 1.15
0.512ValPhe: 0.512 ± 0.357
6.65ValGly: 6.65 ± 3.317
0.512ValHis: 0.512 ± 0.431
2.046ValIle: 2.046 ± 0.724
1.535ValLys: 1.535 ± 0.686
3.581ValLeu: 3.581 ± 0.973
1.023ValMet: 1.023 ± 0.448
1.023ValAsn: 1.023 ± 0.657
3.069ValPro: 3.069 ± 0.872
4.604ValGln: 4.604 ± 1.121
4.092ValArg: 4.092 ± 1.198
6.138ValSer: 6.138 ± 2.347
4.604ValThr: 4.604 ± 0.79
2.046ValVal: 2.046 ± 1.21
2.046ValTrp: 2.046 ± 0.873
1.023ValTyr: 1.023 ± 0.448
0.0ValXaa: 0.0 ± 0.0
Trp
1.535TrpAla: 1.535 ± 1.196
0.0TrpCys: 0.0 ± 0.0
1.023TrpAsp: 1.023 ± 0.714
0.512TrpGlu: 0.512 ± 0.431
0.512TrpPhe: 0.512 ± 0.63
3.581TrpGly: 3.581 ± 1.875
0.512TrpHis: 0.512 ± 0.431
0.512TrpIle: 0.512 ± 0.63
2.046TrpLys: 2.046 ± 0.873
1.535TrpLeu: 1.535 ± 0.681
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.023TrpGln: 1.023 ± 0.813
1.535TrpArg: 1.535 ± 1.236
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.535TrpVal: 1.535 ± 0.681
0.512TrpTrp: 0.512 ± 0.357
1.023TrpTyr: 1.023 ± 0.813
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.069TyrAla: 3.069 ± 1.275
1.535TyrCys: 1.535 ± 1.236
1.023TyrAsp: 1.023 ± 0.714
3.069TyrGlu: 3.069 ± 1.003
2.046TyrPhe: 2.046 ± 0.896
4.092TyrGly: 4.092 ± 1.132
2.046TyrHis: 2.046 ± 0.719
1.023TyrIle: 1.023 ± 0.647
1.535TyrLys: 1.535 ± 0.804
5.115TyrLeu: 5.115 ± 1.774
0.0TyrMet: 0.0 ± 0.0
1.023TyrAsn: 1.023 ± 0.714
1.023TyrPro: 1.023 ± 0.863
0.512TyrGln: 0.512 ± 0.431
1.535TyrArg: 1.535 ± 0.849
2.046TyrSer: 2.046 ± 0.503
1.535TyrThr: 1.535 ± 0.647
1.535TyrVal: 1.535 ± 0.442
1.535TyrTrp: 1.535 ± 1.196
2.046TyrTyr: 2.046 ± 0.873
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1956 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski