Amino acid dipepetide frequency for Pan troglodytes verus polyomavirus 8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.949AlaAla: 5.949 ± 4.683
0.496AlaCys: 0.496 ± 0.342
1.983AlaAsp: 1.983 ± 0.754
1.983AlaGlu: 1.983 ± 0.911
2.479AlaPhe: 2.479 ± 1.178
2.479AlaGly: 2.479 ± 0.891
2.479AlaHis: 2.479 ± 1.092
4.462AlaIle: 4.462 ± 1.605
1.487AlaLys: 1.487 ± 0.535
4.958AlaLeu: 4.958 ± 2.186
1.487AlaMet: 1.487 ± 0.875
2.975AlaAsn: 2.975 ± 1.07
4.958AlaPro: 4.958 ± 2.386
2.975AlaGln: 2.975 ± 0.908
2.975AlaArg: 2.975 ± 0.648
2.975AlaSer: 2.975 ± 0.954
4.958AlaThr: 4.958 ± 1.398
3.471AlaVal: 3.471 ± 1.262
1.983AlaTrp: 1.983 ± 0.928
3.471AlaTyr: 3.471 ± 0.855
0.0AlaXaa: 0.0 ± 0.0
Cys
0.496CysAla: 0.496 ± 0.556
0.0CysCys: 0.0 ± 0.0
1.487CysAsp: 1.487 ± 0.536
0.496CysGlu: 0.496 ± 0.342
2.975CysPhe: 2.975 ± 1.123
1.487CysGly: 1.487 ± 0.803
0.992CysHis: 0.992 ± 0.642
0.496CysIle: 0.496 ± 0.342
3.966CysLys: 3.966 ± 1.385
0.992CysLeu: 0.992 ± 0.549
0.992CysMet: 0.992 ± 0.549
0.0CysAsn: 0.0 ± 0.0
0.992CysPro: 0.992 ± 0.612
1.487CysGln: 1.487 ± 1.027
0.496CysArg: 0.496 ± 0.623
0.0CysSer: 0.0 ± 0.0
0.496CysThr: 0.496 ± 0.342
0.496CysVal: 0.496 ± 0.342
0.496CysTrp: 0.496 ± 0.415
0.992CysTyr: 0.992 ± 0.612
0.0CysXaa: 0.0 ± 0.0
Asp
1.983AspAla: 1.983 ± 0.937
0.992AspCys: 0.992 ± 1.111
2.479AspAsp: 2.479 ± 1.178
4.462AspGlu: 4.462 ± 1.925
5.454AspPhe: 5.454 ± 2.098
2.975AspGly: 2.975 ± 0.935
1.487AspHis: 1.487 ± 0.489
2.975AspIle: 2.975 ± 1.095
5.949AspLys: 5.949 ± 2.644
7.437AspLeu: 7.437 ± 1.072
1.487AspMet: 1.487 ± 0.656
0.992AspAsn: 0.992 ± 0.831
3.966AspPro: 3.966 ± 1.386
0.992AspGln: 0.992 ± 0.685
0.496AspArg: 0.496 ± 0.415
6.445AspSer: 6.445 ± 1.906
1.983AspThr: 1.983 ± 0.996
2.479AspVal: 2.479 ± 1.092
0.496AspTrp: 0.496 ± 0.488
1.487AspTyr: 1.487 ± 0.535
0.0AspXaa: 0.0 ± 0.0
Glu
4.462GluAla: 4.462 ± 1.571
1.983GluCys: 1.983 ± 1.194
6.941GluAsp: 6.941 ± 1.339
9.42GluGlu: 9.42 ± 2.447
3.471GluPhe: 3.471 ± 1.016
1.487GluGly: 1.487 ± 1.058
0.992GluHis: 0.992 ± 0.457
3.471GluIle: 3.471 ± 1.052
5.454GluLys: 5.454 ± 1.713
3.471GluLeu: 3.471 ± 1.636
0.992GluMet: 0.992 ± 0.457
0.992GluAsn: 0.992 ± 0.831
1.487GluPro: 1.487 ± 0.692
2.975GluGln: 2.975 ± 0.97
5.949GluArg: 5.949 ± 1.449
4.958GluSer: 4.958 ± 2.053
3.966GluThr: 3.966 ± 1.033
4.958GluVal: 4.958 ± 1.773
1.983GluTrp: 1.983 ± 0.928
2.975GluTyr: 2.975 ± 1.074
0.0GluXaa: 0.0 ± 0.0
Phe
3.471PheAla: 3.471 ± 1.963
1.983PheCys: 1.983 ± 0.979
0.992PheAsp: 0.992 ± 0.612
1.487PheGlu: 1.487 ± 1.027
2.479PhePhe: 2.479 ± 0.828
3.966PheGly: 3.966 ± 1.033
2.479PheHis: 2.479 ± 0.832
2.479PheIle: 2.479 ± 1.244
2.479PheLys: 2.479 ± 1.312
4.958PheLeu: 4.958 ± 1.53
0.0PheMet: 0.0 ± 0.0
3.471PheAsn: 3.471 ± 0.98
2.479PhePro: 2.479 ± 0.631
0.496PheGln: 0.496 ± 0.342
3.471PheArg: 3.471 ± 1.358
4.958PheSer: 4.958 ± 1.194
0.496PheThr: 0.496 ± 0.415
1.487PheVal: 1.487 ± 0.535
0.992PheTrp: 0.992 ± 0.682
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.949GlyAla: 5.949 ± 1.341
0.0GlyCys: 0.0 ± 0.0
2.975GlyAsp: 2.975 ± 0.775
4.462GlyGlu: 4.462 ± 1.275
1.983GlyPhe: 1.983 ± 0.666
5.454GlyGly: 5.454 ± 0.75
2.479GlyHis: 2.479 ± 0.794
1.487GlyIle: 1.487 ± 0.696
2.975GlyLys: 2.975 ± 1.303
6.941GlyLeu: 6.941 ± 1.092
1.487GlyMet: 1.487 ± 0.489
2.479GlyAsn: 2.479 ± 0.832
2.975GlyPro: 2.975 ± 1.095
0.992GlyGln: 0.992 ± 0.831
0.0GlyArg: 0.0 ± 0.0
3.471GlySer: 3.471 ± 0.936
5.949GlyThr: 5.949 ± 1.539
5.454GlyVal: 5.454 ± 1.694
0.0GlyTrp: 0.0 ± 0.0
1.487GlyTyr: 1.487 ± 0.839
0.0GlyXaa: 0.0 ± 0.0
His
0.496HisAla: 0.496 ± 0.342
0.992HisCys: 0.992 ± 0.549
0.992HisAsp: 0.992 ± 0.457
1.983HisGlu: 1.983 ± 1.042
1.487HisPhe: 1.487 ± 0.692
1.487HisGly: 1.487 ± 1.027
0.0HisHis: 0.0 ± 0.0
0.992HisIle: 0.992 ± 0.457
0.992HisLys: 0.992 ± 0.561
0.496HisLeu: 0.496 ± 0.342
0.496HisMet: 0.496 ± 0.342
0.496HisAsn: 0.496 ± 0.342
1.487HisPro: 1.487 ± 0.875
1.983HisGln: 1.983 ± 0.824
1.487HisArg: 1.487 ± 0.728
4.958HisSer: 4.958 ± 1.527
0.496HisThr: 0.496 ± 0.415
1.983HisVal: 1.983 ± 0.488
1.487HisTrp: 1.487 ± 0.878
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.966IleAla: 3.966 ± 1.757
2.479IleCys: 2.479 ± 1.312
3.471IleAsp: 3.471 ± 0.497
2.479IleGlu: 2.479 ± 0.879
1.983IlePhe: 1.983 ± 1.183
1.983IleGly: 1.983 ± 0.875
0.496IleHis: 0.496 ± 0.415
0.992IleIle: 0.992 ± 0.549
0.0IleLys: 0.0 ± 0.0
3.966IleLeu: 3.966 ± 1.074
0.496IleMet: 0.496 ± 0.342
1.983IleAsn: 1.983 ± 0.992
2.975IlePro: 2.975 ± 0.459
6.445IleGln: 6.445 ± 2.508
1.487IleArg: 1.487 ± 0.707
2.975IleSer: 2.975 ± 1.095
4.958IleThr: 4.958 ± 1.919
1.983IleVal: 1.983 ± 0.584
0.496IleTrp: 0.496 ± 0.342
1.487IleTyr: 1.487 ± 0.535
0.0IleXaa: 0.0 ± 0.0
Lys
4.958LysAla: 4.958 ± 0.745
2.975LysCys: 2.975 ± 1.325
1.983LysAsp: 1.983 ± 0.992
3.966LysGlu: 3.966 ± 1.741
0.496LysPhe: 0.496 ± 0.342
4.958LysGly: 4.958 ± 0.943
1.487LysHis: 1.487 ± 0.728
2.479LysIle: 2.479 ± 0.832
11.899LysLys: 11.899 ± 2.625
3.471LysLeu: 3.471 ± 1.15
5.454LysMet: 5.454 ± 1.757
1.983LysAsn: 1.983 ± 1.194
2.479LysPro: 2.479 ± 0.826
0.992LysGln: 0.992 ± 0.591
8.924LysArg: 8.924 ± 1.881
1.487LysSer: 1.487 ± 0.692
4.462LysThr: 4.462 ± 1.177
5.949LysVal: 5.949 ± 0.798
0.992LysTrp: 0.992 ± 0.642
1.983LysTyr: 1.983 ± 0.658
0.0LysXaa: 0.0 ± 0.0
Leu
3.966LeuAla: 3.966 ± 1.738
2.479LeuCys: 2.479 ± 1.003
6.941LeuAsp: 6.941 ± 1.292
7.437LeuGlu: 7.437 ± 1.032
5.949LeuPhe: 5.949 ± 1.22
3.966LeuGly: 3.966 ± 0.85
2.975LeuHis: 2.975 ± 0.991
3.471LeuIle: 3.471 ± 1.231
3.471LeuLys: 3.471 ± 2.293
12.395LeuLeu: 12.395 ± 1.564
2.975LeuMet: 2.975 ± 1.222
5.949LeuAsn: 5.949 ± 0.906
5.454LeuPro: 5.454 ± 0.75
2.479LeuGln: 2.479 ± 0.461
2.479LeuArg: 2.479 ± 0.785
3.966LeuSer: 3.966 ± 0.897
2.975LeuThr: 2.975 ± 0.955
1.983LeuVal: 1.983 ± 0.691
0.0LeuTrp: 0.0 ± 0.0
4.958LeuTyr: 4.958 ± 0.558
0.0LeuXaa: 0.0 ± 0.0
Met
1.983MetAla: 1.983 ± 0.488
0.0MetCys: 0.0 ± 0.0
3.966MetAsp: 3.966 ± 1.856
4.462MetGlu: 4.462 ± 1.714
0.992MetPhe: 0.992 ± 0.642
1.487MetGly: 1.487 ± 0.489
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
2.479MetLys: 2.479 ± 0.832
2.975MetLeu: 2.975 ± 0.71
0.496MetMet: 0.496 ± 0.415
2.479MetAsn: 2.479 ± 0.832
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.983MetArg: 1.983 ± 0.824
0.0MetSer: 0.0 ± 0.0
0.496MetThr: 0.496 ± 0.342
1.983MetVal: 1.983 ± 0.727
0.496MetTrp: 0.496 ± 0.415
1.983MetTyr: 1.983 ± 0.658
0.0MetXaa: 0.0 ± 0.0
Asn
3.471AsnAla: 3.471 ± 0.936
0.496AsnCys: 0.496 ± 0.342
0.496AsnAsp: 0.496 ± 0.415
2.975AsnGlu: 2.975 ± 1.069
0.992AsnPhe: 0.992 ± 0.457
0.496AsnGly: 0.496 ± 0.415
0.496AsnHis: 0.496 ± 0.342
3.966AsnIle: 3.966 ± 0.888
1.983AsnLys: 1.983 ± 0.584
4.958AsnLeu: 4.958 ± 0.586
0.0AsnMet: 0.0 ± 0.637
2.975AsnAsn: 2.975 ± 0.648
2.975AsnPro: 2.975 ± 0.654
1.487AsnGln: 1.487 ± 0.535
3.471AsnArg: 3.471 ± 1.227
4.958AsnSer: 4.958 ± 1.397
2.975AsnThr: 2.975 ± 1.24
0.992AsnVal: 0.992 ± 0.457
0.0AsnTrp: 0.0 ± 0.0
1.487AsnTyr: 1.487 ± 0.707
0.0AsnXaa: 0.0 ± 0.0
Pro
3.966ProAla: 3.966 ± 1.16
0.992ProCys: 0.992 ± 0.612
5.454ProAsp: 5.454 ± 2.226
3.471ProGlu: 3.471 ± 0.745
0.0ProPhe: 0.0 ± 0.0
4.462ProGly: 4.462 ± 1.836
0.496ProHis: 0.496 ± 0.342
2.479ProIle: 2.479 ± 0.732
6.445ProLys: 6.445 ± 2.596
3.471ProLeu: 3.471 ± 1.48
0.992ProMet: 0.992 ± 0.457
1.487ProAsn: 1.487 ± 0.707
3.471ProPro: 3.471 ± 1.194
1.487ProGln: 1.487 ± 1.058
0.992ProArg: 0.992 ± 0.682
4.462ProSer: 4.462 ± 1.178
1.983ProThr: 1.983 ± 0.891
2.975ProVal: 2.975 ± 1.085
0.0ProTrp: 0.0 ± 0.0
1.983ProTyr: 1.983 ± 0.658
0.0ProXaa: 0.0 ± 0.0
Gln
4.462GlnAla: 4.462 ± 1.098
0.992GlnCys: 0.992 ± 0.685
2.479GlnAsp: 2.479 ± 0.916
2.479GlnGlu: 2.479 ± 1.471
0.0GlnPhe: 0.0 ± 0.0
2.975GlnGly: 2.975 ± 1.606
0.992GlnHis: 0.992 ± 0.642
3.471GlnIle: 3.471 ± 0.904
3.471GlnLys: 3.471 ± 0.781
2.479GlnLeu: 2.479 ± 0.727
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
2.975GlnPro: 2.975 ± 0.585
1.487GlnGln: 1.487 ± 0.489
2.975GlnArg: 2.975 ± 1.488
2.479GlnSer: 2.479 ± 0.461
1.487GlnThr: 1.487 ± 0.966
2.975GlnVal: 2.975 ± 0.942
1.487GlnTrp: 1.487 ± 0.707
1.983GlnTyr: 1.983 ± 0.584
0.0GlnXaa: 0.0 ± 0.0
Arg
0.496ArgAla: 0.496 ± 0.623
0.0ArgCys: 0.0 ± 0.0
3.471ArgAsp: 3.471 ± 1.535
5.454ArgGlu: 5.454 ± 1.071
3.471ArgPhe: 3.471 ± 1.412
3.471ArgGly: 3.471 ± 1.584
1.983ArgHis: 1.983 ± 0.584
1.983ArgIle: 1.983 ± 0.777
4.462ArgLys: 4.462 ± 1.62
1.983ArgLeu: 1.983 ± 0.584
1.983ArgMet: 1.983 ± 0.928
1.487ArgAsn: 1.487 ± 0.948
1.487ArgPro: 1.487 ± 0.535
2.479ArgGln: 2.479 ± 1.244
4.958ArgArg: 4.958 ± 2.271
5.454ArgSer: 5.454 ± 2.032
4.462ArgThr: 4.462 ± 1.173
2.479ArgVal: 2.479 ± 0.461
0.0ArgTrp: 0.0 ± 0.0
2.479ArgTyr: 2.479 ± 1.598
0.0ArgXaa: 0.0 ± 0.0
Ser
1.487SerAla: 1.487 ± 0.692
0.992SerCys: 0.992 ± 0.549
4.462SerAsp: 4.462 ± 1.198
3.471SerGlu: 3.471 ± 0.967
3.966SerPhe: 3.966 ± 0.885
3.966SerGly: 3.966 ± 1.241
0.496SerHis: 0.496 ± 0.342
4.958SerIle: 4.958 ± 1.445
1.983SerLys: 1.983 ± 0.584
4.462SerLeu: 4.462 ± 1.228
4.462SerMet: 4.462 ± 0.91
1.983SerAsn: 1.983 ± 0.488
1.983SerPro: 1.983 ± 1.165
6.941SerGln: 6.941 ± 1.03
4.958SerArg: 4.958 ± 0.906
7.933SerSer: 7.933 ± 1.777
3.966SerThr: 3.966 ± 0.868
5.454SerVal: 5.454 ± 1.407
3.471SerTrp: 3.471 ± 1.771
0.992SerTyr: 0.992 ± 0.682
0.0SerXaa: 0.0 ± 0.0
Thr
3.471ThrAla: 3.471 ± 0.595
0.992ThrCys: 0.992 ± 0.831
0.992ThrAsp: 0.992 ± 0.78
5.949ThrGlu: 5.949 ± 0.778
1.983ThrPhe: 1.983 ± 1.369
4.462ThrGly: 4.462 ± 1.902
0.992ThrHis: 0.992 ± 0.682
3.471ThrIle: 3.471 ± 1.849
4.958ThrLys: 4.958 ± 0.884
4.462ThrLeu: 4.462 ± 1.451
0.992ThrMet: 0.992 ± 0.549
0.992ThrAsn: 0.992 ± 0.831
4.958ThrPro: 4.958 ± 0.586
3.966ThrGln: 3.966 ± 1.15
0.992ThrArg: 0.992 ± 0.457
1.983ThrSer: 1.983 ± 0.864
3.471ThrThr: 3.471 ± 0.936
4.462ThrVal: 4.462 ± 2.092
1.487ThrTrp: 1.487 ± 0.93
2.975ThrTyr: 2.975 ± 0.783
0.0ThrXaa: 0.0 ± 0.0
Val
4.462ValAla: 4.462 ± 2.184
0.0ValCys: 0.0 ± 0.0
1.983ValAsp: 1.983 ± 0.777
3.471ValGlu: 3.471 ± 1.487
2.479ValPhe: 2.479 ± 1.312
1.983ValGly: 1.983 ± 1.129
1.983ValHis: 1.983 ± 0.835
2.479ValIle: 2.479 ± 0.828
4.958ValLys: 4.958 ± 1.637
5.949ValLeu: 5.949 ± 1.756
0.496ValMet: 0.496 ± 0.415
5.454ValAsn: 5.454 ± 2.224
1.983ValPro: 1.983 ± 1.194
1.487ValGln: 1.487 ± 0.858
2.975ValArg: 2.975 ± 1.095
4.958ValSer: 4.958 ± 1.465
4.462ValThr: 4.462 ± 1.033
1.487ValVal: 1.487 ± 0.535
0.496ValTrp: 0.496 ± 0.556
0.496ValTyr: 0.496 ± 0.342
0.0ValXaa: 0.0 ± 0.0
Trp
0.496TrpAla: 0.496 ± 0.556
0.0TrpCys: 0.0 ± 0.0
1.487TrpAsp: 1.487 ± 0.579
1.487TrpGlu: 1.487 ± 0.656
0.992TrpPhe: 0.992 ± 0.549
2.479TrpGly: 2.479 ± 0.916
0.992TrpHis: 0.992 ± 0.682
0.496TrpIle: 0.496 ± 0.556
0.992TrpLys: 0.992 ± 0.685
0.992TrpLeu: 0.992 ± 0.685
0.992TrpMet: 0.992 ± 0.682
2.479TrpAsn: 2.479 ± 1.431
0.496TrpPro: 0.496 ± 0.556
0.0TrpGln: 0.0 ± 0.0
0.496TrpArg: 0.496 ± 0.415
0.0TrpSer: 0.0 ± 0.0
1.487TrpThr: 1.487 ± 0.93
0.496TrpVal: 0.496 ± 0.415
1.487TrpTrp: 1.487 ± 0.875
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.992TyrAla: 0.992 ± 0.494
0.992TyrCys: 0.992 ± 1.111
1.983TyrAsp: 1.983 ± 0.584
0.992TyrGlu: 0.992 ± 0.682
0.992TyrPhe: 0.992 ± 0.831
2.975TyrGly: 2.975 ± 0.77
0.496TyrHis: 0.496 ± 0.342
0.496TyrIle: 0.496 ± 0.415
2.975TyrLys: 2.975 ± 0.799
5.454TyrLeu: 5.454 ± 1.12
1.487TyrMet: 1.487 ± 0.707
1.487TyrAsn: 1.487 ± 0.535
1.487TyrPro: 1.487 ± 1.246
0.496TyrGln: 0.496 ± 0.488
2.479TyrArg: 2.479 ± 1.09
3.966TyrSer: 3.966 ± 0.957
2.479TyrThr: 2.479 ± 0.461
0.496TyrVal: 0.496 ± 0.415
0.496TyrTrp: 0.496 ± 0.342
1.983TyrTyr: 1.983 ± 1.365
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2018 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski