Amino acid dipepetide frequency for Sorex araneus polyomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.807AlaAla: 6.807 ± 2.577
0.567AlaCys: 0.567 ± 0.344
4.538AlaAsp: 4.538 ± 1.613
6.807AlaGlu: 6.807 ± 2.149
2.269AlaPhe: 2.269 ± 0.681
3.403AlaGly: 3.403 ± 1.196
1.134AlaHis: 1.134 ± 0.688
3.971AlaIle: 3.971 ± 2.262
6.239AlaLys: 6.239 ± 0.779
5.105AlaLeu: 5.105 ± 2.06
2.836AlaMet: 2.836 ± 0.858
0.567AlaAsn: 0.567 ± 0.344
1.702AlaPro: 1.702 ± 0.64
2.269AlaGln: 2.269 ± 0.876
5.672AlaArg: 5.672 ± 1.378
2.269AlaSer: 2.269 ± 0.961
2.836AlaThr: 2.836 ± 0.863
4.538AlaVal: 4.538 ± 1.001
0.0AlaTrp: 0.0 ± 0.0
1.702AlaTyr: 1.702 ± 0.731
0.0AlaXaa: 0.0 ± 0.0
Cys
1.702CysAla: 1.702 ± 0.64
1.702CysCys: 1.702 ± 0.705
1.702CysAsp: 1.702 ± 0.64
0.0CysGlu: 0.0 ± 0.0
1.134CysPhe: 1.134 ± 1.312
1.134CysGly: 1.134 ± 0.688
0.0CysHis: 0.0 ± 0.0
1.134CysIle: 1.134 ± 0.603
1.134CysLys: 1.134 ± 0.508
1.702CysLeu: 1.702 ± 0.64
0.567CysMet: 0.567 ± 0.656
2.269CysAsn: 2.269 ± 0.681
1.134CysPro: 1.134 ± 0.508
1.702CysGln: 1.702 ± 1.032
0.567CysArg: 0.567 ± 0.656
2.269CysSer: 2.269 ± 0.681
1.702CysThr: 1.702 ± 0.731
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.702CysTyr: 1.702 ± 1.212
0.0CysXaa: 0.0 ± 0.0
Asp
1.702AspAla: 1.702 ± 0.681
0.567AspCys: 0.567 ± 0.344
2.269AspAsp: 2.269 ± 0.894
3.403AspGlu: 3.403 ± 1.523
3.971AspPhe: 3.971 ± 1.597
3.971AspGly: 3.971 ± 0.603
0.567AspHis: 0.567 ± 0.344
2.836AspIle: 2.836 ± 1.061
3.403AspLys: 3.403 ± 1.053
3.971AspLeu: 3.971 ± 0.758
1.134AspMet: 1.134 ± 1.169
2.269AspAsn: 2.269 ± 0.894
4.538AspPro: 4.538 ± 1.001
2.269AspGln: 2.269 ± 0.876
1.134AspArg: 1.134 ± 0.508
3.403AspSer: 3.403 ± 1.053
1.134AspThr: 1.134 ± 0.603
3.971AspVal: 3.971 ± 1.474
3.403AspTrp: 3.403 ± 1.661
1.134AspTyr: 1.134 ± 0.508
0.0AspXaa: 0.0 ± 0.0
Glu
4.538GluAla: 4.538 ± 1.266
0.567GluCys: 0.567 ± 0.344
1.134GluAsp: 1.134 ± 0.508
2.836GluGlu: 2.836 ± 0.495
0.567GluPhe: 0.567 ± 0.585
3.971GluGly: 3.971 ± 0.764
2.269GluHis: 2.269 ± 1.088
1.702GluIle: 1.702 ± 0.64
2.269GluLys: 2.269 ± 1.377
3.971GluLeu: 3.971 ± 1.339
2.269GluMet: 2.269 ± 0.88
3.971GluAsn: 3.971 ± 1.517
1.702GluPro: 1.702 ± 0.64
3.403GluGln: 3.403 ± 1.663
3.403GluArg: 3.403 ± 0.456
5.105GluSer: 5.105 ± 1.914
5.105GluThr: 5.105 ± 1.302
4.538GluVal: 4.538 ± 1.94
1.702GluTrp: 1.702 ± 1.049
0.567GluTyr: 0.567 ± 0.344
0.0GluXaa: 0.0 ± 0.0
Phe
5.672PheAla: 5.672 ± 1.752
0.567PheCys: 0.567 ± 0.656
1.134PheAsp: 1.134 ± 0.688
3.971PheGlu: 3.971 ± 1.597
0.567PhePhe: 0.567 ± 0.344
2.836PheGly: 2.836 ± 1.551
0.567PheHis: 0.567 ± 0.344
0.567PheIle: 0.567 ± 0.344
3.971PheLys: 3.971 ± 1.893
5.105PheLeu: 5.105 ± 0.367
1.702PheMet: 1.702 ± 0.804
2.269PheAsn: 2.269 ± 0.989
1.702PhePro: 1.702 ± 1.032
1.702PheGln: 1.702 ± 0.731
2.269PheArg: 2.269 ± 0.572
1.134PheSer: 1.134 ± 0.508
3.971PheThr: 3.971 ± 1.401
0.0PheVal: 0.0 ± 0.0
0.567PheTrp: 0.567 ± 0.344
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.105GlyAla: 5.105 ± 2.429
0.567GlyCys: 0.567 ± 0.344
5.672GlyAsp: 5.672 ± 1.246
2.836GlyGlu: 2.836 ± 0.863
3.403GlyPhe: 3.403 ± 1.854
9.075GlyGly: 9.075 ± 0.632
3.403GlyHis: 3.403 ± 1.385
3.403GlyIle: 3.403 ± 0.885
2.836GlyLys: 2.836 ± 0.817
5.672GlyLeu: 5.672 ± 2.497
0.567GlyMet: 0.567 ± 0.484
5.672GlyAsn: 5.672 ± 0.763
3.971GlyPro: 3.971 ± 1.266
5.105GlyGln: 5.105 ± 1.082
2.269GlyArg: 2.269 ± 0.876
5.672GlySer: 5.672 ± 1.101
1.702GlyThr: 1.702 ± 0.64
4.538GlyVal: 4.538 ± 1.28
0.567GlyTrp: 0.567 ± 0.585
1.134GlyTyr: 1.134 ± 0.688
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.567HisCys: 0.567 ± 0.344
1.702HisAsp: 1.702 ± 0.64
2.836HisGlu: 2.836 ± 1.087
1.134HisPhe: 1.134 ± 0.873
0.567HisGly: 0.567 ± 0.344
0.567HisHis: 0.567 ± 0.344
1.702HisIle: 1.702 ± 0.803
0.0HisLys: 0.0 ± 0.0
2.269HisLeu: 2.269 ± 0.681
2.269HisMet: 2.269 ± 0.53
0.567HisAsn: 0.567 ± 0.344
2.269HisPro: 2.269 ± 1.263
1.702HisGln: 1.702 ± 0.803
2.269HisArg: 2.269 ± 0.97
0.567HisSer: 0.567 ± 0.344
0.0HisThr: 0.0 ± 0.0
1.702HisVal: 1.702 ± 0.804
0.0HisTrp: 0.0 ± 0.0
2.269HisTyr: 2.269 ± 0.876
0.0HisXaa: 0.0 ± 0.0
Ile
1.134IleAla: 1.134 ± 0.688
1.134IleCys: 1.134 ± 0.688
3.971IleAsp: 3.971 ± 0.89
6.239IleGlu: 6.239 ± 1.041
2.836IlePhe: 2.836 ± 1.259
2.836IleGly: 2.836 ± 1.241
0.567IleHis: 0.567 ± 0.344
0.567IleIle: 0.567 ± 0.585
1.702IleLys: 1.702 ± 0.5
9.075IleLeu: 9.075 ± 2.271
0.567IleMet: 0.567 ± 0.344
2.269IleAsn: 2.269 ± 1.377
2.836IlePro: 2.836 ± 0.741
3.971IleGln: 3.971 ± 1.351
0.567IleArg: 0.567 ± 0.344
5.672IleSer: 5.672 ± 1.873
3.971IleThr: 3.971 ± 1.355
2.836IleVal: 2.836 ± 1.194
2.269IleTrp: 2.269 ± 1.745
1.134IleTyr: 1.134 ± 0.603
0.0IleXaa: 0.0 ± 0.0
Lys
4.538LysAla: 4.538 ± 2.174
1.702LysCys: 1.702 ± 1.032
2.836LysAsp: 2.836 ± 0.817
1.134LysGlu: 1.134 ± 0.508
1.702LysPhe: 1.702 ± 0.731
3.971LysGly: 3.971 ± 1.148
2.269LysHis: 2.269 ± 0.97
3.971LysIle: 3.971 ± 1.729
5.672LysLys: 5.672 ± 1.229
7.941LysLeu: 7.941 ± 0.968
2.836LysMet: 2.836 ± 0.878
2.836LysAsn: 2.836 ± 1.061
1.134LysPro: 1.134 ± 0.603
0.567LysGln: 0.567 ± 0.344
6.239LysArg: 6.239 ± 0.924
2.269LysSer: 2.269 ± 1.015
5.672LysThr: 5.672 ± 2.206
1.134LysVal: 1.134 ± 0.688
0.0LysTrp: 0.0 ± 0.0
2.269LysTyr: 2.269 ± 0.572
0.0LysXaa: 0.0 ± 0.0
Leu
6.239LeuAla: 6.239 ± 3.162
1.702LeuCys: 1.702 ± 0.705
7.374LeuAsp: 7.374 ± 2.483
3.971LeuGlu: 3.971 ± 0.89
5.105LeuPhe: 5.105 ± 1.065
5.672LeuGly: 5.672 ± 2.894
4.538LeuHis: 4.538 ± 1.334
8.508LeuIle: 8.508 ± 1.248
3.971LeuLys: 3.971 ± 2.394
10.777LeuLeu: 10.777 ± 2.506
3.971LeuMet: 3.971 ± 0.599
3.403LeuAsn: 3.403 ± 1.039
5.672LeuPro: 5.672 ± 1.1
5.105LeuGln: 5.105 ± 2.118
3.971LeuArg: 3.971 ± 1.015
4.538LeuSer: 4.538 ± 1.594
5.105LeuThr: 5.105 ± 1.254
2.836LeuVal: 2.836 ± 1.901
2.269LeuTrp: 2.269 ± 0.894
2.269LeuTyr: 2.269 ± 0.876
0.0LeuXaa: 0.0 ± 0.0
Met
3.403MetAla: 3.403 ± 0.456
0.0MetCys: 0.0 ± 0.0
2.836MetAsp: 2.836 ± 0.858
0.567MetGlu: 0.567 ± 0.585
1.702MetPhe: 1.702 ± 0.64
2.269MetGly: 2.269 ± 0.977
0.0MetHis: 0.0 ± 0.0
6.239MetIle: 6.239 ± 2.982
2.836MetLys: 2.836 ± 1.259
2.269MetLeu: 2.269 ± 1.088
1.702MetMet: 1.702 ± 0.752
2.269MetAsn: 2.269 ± 0.572
1.134MetPro: 1.134 ± 1.169
0.0MetGln: 0.0 ± 0.0
2.269MetArg: 2.269 ± 0.894
1.134MetSer: 1.134 ± 0.676
1.702MetThr: 1.702 ± 1.039
2.836MetVal: 2.836 ± 1.48
1.702MetTrp: 1.702 ± 0.64
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.134AsnAla: 1.134 ± 0.688
1.702AsnCys: 1.702 ± 0.731
2.269AsnAsp: 2.269 ± 0.894
3.971AsnGlu: 3.971 ± 1.343
1.134AsnPhe: 1.134 ± 0.873
2.269AsnGly: 2.269 ± 1.015
0.0AsnHis: 0.0 ± 0.0
4.538AsnIle: 4.538 ± 2.222
5.672AsnLys: 5.672 ± 2.29
3.403AsnLeu: 3.403 ± 1.809
2.836AsnMet: 2.836 ± 0.863
1.134AsnAsn: 1.134 ± 0.688
2.836AsnPro: 2.836 ± 1.103
0.0AsnGln: 0.0 ± 0.0
0.567AsnArg: 0.567 ± 0.484
2.269AsnSer: 2.269 ± 0.97
3.403AsnThr: 3.403 ± 2.861
5.672AsnVal: 5.672 ± 1.378
0.0AsnTrp: 0.0 ± 0.0
1.134AsnTyr: 1.134 ± 0.508
0.0AsnXaa: 0.0 ± 0.0
Pro
3.403ProAla: 3.403 ± 0.953
2.269ProCys: 2.269 ± 1.377
5.672ProAsp: 5.672 ± 0.84
2.269ProGlu: 2.269 ± 0.572
1.702ProPhe: 1.702 ± 1.032
3.971ProGly: 3.971 ± 1.662
0.567ProHis: 0.567 ± 0.344
1.134ProIle: 1.134 ± 0.508
5.105ProLys: 5.105 ± 1.526
4.538ProLeu: 4.538 ± 1.668
2.269ProMet: 2.269 ± 1.034
0.0ProAsn: 0.0 ± 0.0
7.941ProPro: 7.941 ± 1.757
0.567ProGln: 0.567 ± 0.585
2.269ProArg: 2.269 ± 1.609
4.538ProSer: 4.538 ± 1.487
0.567ProThr: 0.567 ± 0.344
3.971ProVal: 3.971 ± 2.171
0.0ProTrp: 0.0 ± 0.0
3.403ProTyr: 3.403 ± 0.638
0.0ProXaa: 0.0 ± 0.0
Gln
2.269GlnAla: 2.269 ± 0.876
2.836GlnCys: 2.836 ± 1.174
0.0GlnAsp: 0.0 ± 0.0
1.702GlnGlu: 1.702 ± 0.803
1.134GlnPhe: 1.134 ± 0.688
2.269GlnGly: 2.269 ± 0.572
2.269GlnHis: 2.269 ± 1.31
2.836GlnIle: 2.836 ± 0.495
3.403GlnLys: 3.403 ± 1.607
6.239GlnLeu: 6.239 ± 1.552
2.269GlnMet: 2.269 ± 0.572
0.567GlnAsn: 0.567 ± 0.344
2.269GlnPro: 2.269 ± 1.101
4.538GlnGln: 4.538 ± 1.214
1.134GlnArg: 1.134 ± 0.508
1.134GlnSer: 1.134 ± 0.508
1.702GlnThr: 1.702 ± 0.681
3.403GlnVal: 3.403 ± 0.638
0.0GlnTrp: 0.0 ± 0.0
0.567GlnTyr: 0.567 ± 0.585
0.0GlnXaa: 0.0 ± 0.0
Arg
2.836ArgAla: 2.836 ± 0.689
1.134ArgCys: 1.134 ± 1.312
1.702ArgAsp: 1.702 ± 1.032
2.836ArgGlu: 2.836 ± 0.817
2.269ArgPhe: 2.269 ± 0.894
3.403ArgGly: 3.403 ± 0.633
2.269ArgHis: 2.269 ± 0.572
2.269ArgIle: 2.269 ± 0.894
2.836ArgLys: 2.836 ± 0.831
3.403ArgLeu: 3.403 ± 1.251
2.269ArgMet: 2.269 ± 1.101
4.538ArgAsn: 4.538 ± 1.158
1.702ArgPro: 1.702 ± 0.803
2.836ArgGln: 2.836 ± 1.347
6.807ArgArg: 6.807 ± 3.322
0.0ArgSer: 0.0 ± 0.0
2.269ArgThr: 2.269 ± 0.572
3.971ArgVal: 3.971 ± 1.266
1.134ArgTrp: 1.134 ± 0.873
1.702ArgTyr: 1.702 ± 1.336
0.0ArgXaa: 0.0 ± 0.0
Ser
5.672SerAla: 5.672 ± 1.313
1.134SerCys: 1.134 ± 1.169
1.702SerAsp: 1.702 ± 1.032
2.269SerGlu: 2.269 ± 1.089
2.836SerPhe: 2.836 ± 1.721
7.374SerGly: 7.374 ± 1.044
0.567SerHis: 0.567 ± 0.585
2.836SerIle: 2.836 ± 1.174
1.702SerLys: 1.702 ± 0.64
5.105SerLeu: 5.105 ± 1.335
1.134SerMet: 1.134 ± 0.969
1.702SerAsn: 1.702 ± 0.64
3.971SerPro: 3.971 ± 1.266
2.836SerGln: 2.836 ± 0.538
3.971SerArg: 3.971 ± 1.266
6.807SerSer: 6.807 ± 0.598
4.538SerThr: 4.538 ± 1.665
2.836SerVal: 2.836 ± 1.528
0.0SerTrp: 0.0 ± 0.0
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.538ThrAla: 4.538 ± 2.185
2.269ThrCys: 2.269 ± 1.868
3.971ThrAsp: 3.971 ± 1.385
2.836ThrGlu: 2.836 ± 0.817
3.403ThrPhe: 3.403 ± 1.182
2.269ThrGly: 2.269 ± 0.53
1.702ThrHis: 1.702 ± 0.64
2.269ThrIle: 2.269 ± 0.894
0.0ThrLys: 0.0 ± 0.0
6.239ThrLeu: 6.239 ± 1.156
1.134ThrMet: 1.134 ± 0.688
2.836ThrAsn: 2.836 ± 1.822
4.538ThrPro: 4.538 ± 1.143
1.134ThrGln: 1.134 ± 0.508
2.269ThrArg: 2.269 ± 1.015
5.672ThrSer: 5.672 ± 1.441
3.971ThrThr: 3.971 ± 1.119
4.538ThrVal: 4.538 ± 1.339
1.702ThrTrp: 1.702 ± 1.336
1.134ThrTyr: 1.134 ± 0.875
0.0ThrXaa: 0.0 ± 0.0
Val
4.538ValAla: 4.538 ± 2.018
0.0ValCys: 0.0 ± 0.0
0.567ValAsp: 0.567 ± 0.344
3.403ValGlu: 3.403 ± 1.277
0.567ValPhe: 0.567 ± 0.344
3.971ValGly: 3.971 ± 1.863
1.134ValHis: 1.134 ± 0.508
5.105ValIle: 5.105 ± 1.457
4.538ValLys: 4.538 ± 1.339
7.941ValLeu: 7.941 ± 2.017
1.702ValMet: 1.702 ± 0.64
3.971ValAsn: 3.971 ± 0.831
3.403ValPro: 3.403 ± 1.541
1.702ValGln: 1.702 ± 1.167
1.134ValArg: 1.134 ± 1.169
2.269ValSer: 2.269 ± 0.977
6.807ValThr: 6.807 ± 1.321
3.403ValVal: 3.403 ± 0.456
0.567ValTrp: 0.567 ± 0.656
1.134ValTyr: 1.134 ± 0.508
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.702TrpGlu: 1.702 ± 0.803
1.702TrpPhe: 1.702 ± 0.705
3.971TrpGly: 3.971 ± 2.648
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.567TrpLeu: 0.567 ± 0.656
1.134TrpMet: 1.134 ± 0.873
1.134TrpAsn: 1.134 ± 0.603
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.702TrpArg: 1.702 ± 1.049
0.567TrpSer: 0.567 ± 0.585
0.567TrpThr: 0.567 ± 0.344
1.134TrpVal: 1.134 ± 0.873
0.0TrpTrp: 0.0 ± 0.0
1.702TrpTyr: 1.702 ± 0.64
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
2.269TyrCys: 2.269 ± 1.206
0.0TyrAsp: 0.0 ± 0.0
0.567TyrGlu: 0.567 ± 0.585
1.134TyrPhe: 1.134 ± 1.169
3.971TyrGly: 3.971 ± 1.351
0.567TyrHis: 0.567 ± 0.585
0.567TyrIle: 0.567 ± 0.344
3.403TyrLys: 3.403 ± 1.57
1.702TyrLeu: 1.702 ± 1.032
1.134TyrMet: 1.134 ± 0.508
1.702TyrAsn: 1.702 ± 0.731
1.702TyrPro: 1.702 ± 1.754
1.134TyrGln: 1.134 ± 0.688
1.702TyrArg: 1.702 ± 1.049
1.702TyrSer: 1.702 ± 0.64
1.702TyrThr: 1.702 ± 1.049
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
2.836TyrTyr: 2.836 ± 1.241
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1764 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski