Amino acid dipepetide frequency for Macaca mulatta papillomavirus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.419AlaAla: 5.419 ± 1.87
0.834AlaCys: 0.834 ± 0.855
1.667AlaAsp: 1.667 ± 0.632
4.585AlaGlu: 4.585 ± 2.36
3.335AlaPhe: 3.335 ± 0.594
4.168AlaGly: 4.168 ± 0.697
0.417AlaHis: 0.417 ± 0.367
3.752AlaIle: 3.752 ± 0.955
3.752AlaLys: 3.752 ± 1.357
6.253AlaLeu: 6.253 ± 2.009
0.0AlaMet: 0.0 ± 0.0
2.084AlaAsn: 2.084 ± 0.919
2.918AlaPro: 2.918 ± 1.008
2.501AlaGln: 2.501 ± 0.46
2.918AlaArg: 2.918 ± 0.684
1.667AlaSer: 1.667 ± 0.568
3.752AlaThr: 3.752 ± 1.203
3.335AlaVal: 3.335 ± 1.071
0.834AlaTrp: 0.834 ± 0.428
0.834AlaTyr: 0.834 ± 0.365
0.0AlaXaa: 0.0 ± 0.0
Cys
0.834CysAla: 0.834 ± 0.415
1.251CysCys: 1.251 ± 0.728
0.834CysAsp: 0.834 ± 0.574
1.251CysGlu: 1.251 ± 1.141
1.251CysPhe: 1.251 ± 0.648
0.417CysGly: 0.417 ± 0.514
0.0CysHis: 0.0 ± 0.0
1.667CysIle: 1.667 ± 1.709
1.251CysLys: 1.251 ± 0.552
2.918CysLeu: 2.918 ± 2.062
0.417CysMet: 0.417 ± 0.367
0.417CysAsn: 0.417 ± 0.514
2.084CysPro: 2.084 ± 0.824
0.417CysGln: 0.417 ± 0.38
0.417CysArg: 0.417 ± 0.427
2.084CysSer: 2.084 ± 1.178
0.417CysThr: 0.417 ± 0.38
0.417CysVal: 0.417 ± 0.38
0.417CysTrp: 0.417 ± 0.367
0.417CysTyr: 0.417 ± 0.367
0.0CysXaa: 0.0 ± 0.0
Asp
2.918AspAla: 2.918 ± 0.944
1.251AspCys: 1.251 ± 0.7
5.836AspAsp: 5.836 ± 2.348
7.086AspGlu: 7.086 ± 0.753
2.501AspPhe: 2.501 ± 0.673
2.918AspGly: 2.918 ± 2.183
0.834AspHis: 0.834 ± 0.365
5.002AspIle: 5.002 ± 1.549
1.667AspLys: 1.667 ± 0.855
6.669AspLeu: 6.669 ± 1.76
0.834AspMet: 0.834 ± 0.415
3.752AspAsn: 3.752 ± 1.589
7.503AspPro: 7.503 ± 2.267
1.667AspGln: 1.667 ± 0.878
2.084AspArg: 2.084 ± 0.701
5.002AspSer: 5.002 ± 1.446
7.086AspThr: 7.086 ± 0.813
2.084AspVal: 2.084 ± 1.819
0.417AspTrp: 0.417 ± 0.367
1.667AspTyr: 1.667 ± 0.747
0.0AspXaa: 0.0 ± 0.0
Glu
5.002GluAla: 5.002 ± 1.455
1.251GluCys: 1.251 ± 0.722
4.168GluAsp: 4.168 ± 0.997
8.337GluGlu: 8.337 ± 1.674
2.918GluPhe: 2.918 ± 1.333
3.752GluGly: 3.752 ± 0.994
1.251GluHis: 1.251 ± 0.513
3.335GluIle: 3.335 ± 1.098
2.918GluLys: 2.918 ± 1.221
5.002GluLeu: 5.002 ± 1.291
1.251GluMet: 1.251 ± 0.776
2.918GluAsn: 2.918 ± 0.515
2.084GluPro: 2.084 ± 0.854
2.501GluGln: 2.501 ± 1.304
3.752GluArg: 3.752 ± 1.299
2.501GluSer: 2.501 ± 1.102
3.752GluThr: 3.752 ± 1.17
1.667GluVal: 1.667 ± 0.855
0.834GluTrp: 0.834 ± 0.761
2.501GluTyr: 2.501 ± 0.837
0.0GluXaa: 0.0 ± 0.0
Phe
1.251PheAla: 1.251 ± 0.331
1.251PheCys: 1.251 ± 0.859
2.501PheAsp: 2.501 ± 1.02
2.918PheGlu: 2.918 ± 0.738
2.918PhePhe: 2.918 ± 1.462
2.501PheGly: 2.501 ± 0.934
0.834PheHis: 0.834 ± 0.569
0.834PheIle: 0.834 ± 0.415
2.918PheLys: 2.918 ± 1.008
3.752PheLeu: 3.752 ± 0.96
1.251PheMet: 1.251 ± 0.4
2.918PheAsn: 2.918 ± 1.266
1.251PhePro: 1.251 ± 0.569
2.501PheGln: 2.501 ± 1.112
1.667PheArg: 1.667 ± 0.569
2.501PheSer: 2.501 ± 1.102
1.667PheThr: 1.667 ± 0.568
2.918PheVal: 2.918 ± 0.83
1.667PheTrp: 1.667 ± 1.033
1.251PheTyr: 1.251 ± 0.331
0.0PheXaa: 0.0 ± 0.0
Gly
2.084GlyAla: 2.084 ± 0.889
0.834GlyCys: 0.834 ± 0.481
7.086GlyAsp: 7.086 ± 1.762
4.168GlyGlu: 4.168 ± 1.192
1.667GlyPhe: 1.667 ± 0.921
3.752GlyGly: 3.752 ± 2.286
1.667GlyHis: 1.667 ± 0.691
4.585GlyIle: 4.585 ± 1.477
2.918GlyLys: 2.918 ± 1.153
4.168GlyLeu: 4.168 ± 1.159
0.0GlyMet: 0.0 ± 0.0
3.335GlyAsn: 3.335 ± 1.084
3.752GlyPro: 3.752 ± 2.039
2.918GlyGln: 2.918 ± 0.883
5.002GlyArg: 5.002 ± 1.575
5.419GlySer: 5.419 ± 0.907
5.836GlyThr: 5.836 ± 2.062
0.834GlyVal: 0.834 ± 0.428
0.417GlyTrp: 0.417 ± 0.38
1.667GlyTyr: 1.667 ± 0.653
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.417HisAsp: 0.417 ± 0.38
0.0HisGlu: 0.0 ± 0.0
1.251HisPhe: 1.251 ± 0.54
1.251HisGly: 1.251 ± 0.694
0.0HisHis: 0.0 ± 0.0
0.834HisIle: 0.834 ± 0.734
0.834HisLys: 0.834 ± 0.365
1.251HisLeu: 1.251 ± 0.752
0.417HisMet: 0.417 ± 0.367
0.417HisAsn: 0.417 ± 0.367
2.084HisPro: 2.084 ± 1.078
1.251HisGln: 1.251 ± 0.648
2.084HisArg: 2.084 ± 1.109
2.084HisSer: 2.084 ± 0.956
0.417HisThr: 0.417 ± 0.367
1.251HisVal: 1.251 ± 0.694
1.251HisTrp: 1.251 ± 0.598
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.084IleAla: 2.084 ± 1.437
1.251IleCys: 1.251 ± 0.728
6.669IleAsp: 6.669 ± 1.732
3.752IleGlu: 3.752 ± 1.565
0.834IlePhe: 0.834 ± 0.504
4.585IleGly: 4.585 ± 2.053
0.834IleHis: 0.834 ± 0.603
4.168IleIle: 4.168 ± 1.36
2.501IleLys: 2.501 ± 1.146
2.918IleLeu: 2.918 ± 1.384
1.251IleMet: 1.251 ± 0.648
0.834IleAsn: 0.834 ± 0.415
4.585IlePro: 4.585 ± 2.323
2.918IleGln: 2.918 ± 0.507
1.667IleArg: 1.667 ± 1.011
5.002IleSer: 5.002 ± 1.273
4.585IleThr: 4.585 ± 1.528
5.002IleVal: 5.002 ± 0.807
0.417IleTrp: 0.417 ± 0.367
4.585IleTyr: 4.585 ± 0.564
0.0IleXaa: 0.0 ± 0.0
Lys
3.752LysAla: 3.752 ± 1.445
1.251LysCys: 1.251 ± 0.648
3.752LysAsp: 3.752 ± 1.134
4.168LysGlu: 4.168 ± 1.497
2.918LysPhe: 2.918 ± 1.465
3.335LysGly: 3.335 ± 1.098
2.501LysHis: 2.501 ± 1.058
2.084LysIle: 2.084 ± 0.554
2.918LysLys: 2.918 ± 1.506
4.585LysLeu: 4.585 ± 2.117
0.834LysMet: 0.834 ± 0.629
4.168LysAsn: 4.168 ± 0.712
1.667LysPro: 1.667 ± 0.732
0.834LysGln: 0.834 ± 0.599
5.002LysArg: 5.002 ± 0.472
4.168LysSer: 4.168 ± 2.045
0.834LysThr: 0.834 ± 0.428
3.335LysVal: 3.335 ± 0.372
1.251LysTrp: 1.251 ± 0.541
2.918LysTyr: 2.918 ± 0.832
0.0LysXaa: 0.0 ± 0.0
Leu
6.669LeuAla: 6.669 ± 0.771
1.251LeuCys: 1.251 ± 1.089
5.419LeuAsp: 5.419 ± 1.475
2.918LeuGlu: 2.918 ± 1.302
4.585LeuPhe: 4.585 ± 1.348
6.669LeuGly: 6.669 ± 3.051
2.501LeuHis: 2.501 ± 0.738
3.335LeuIle: 3.335 ± 1.203
5.836LeuLys: 5.836 ± 1.611
9.17LeuLeu: 9.17 ± 2.899
2.918LeuMet: 2.918 ± 1.671
5.419LeuAsn: 5.419 ± 1.228
6.253LeuPro: 6.253 ± 2.207
8.754LeuGln: 8.754 ± 1.26
1.667LeuArg: 1.667 ± 1.007
5.002LeuSer: 5.002 ± 1.144
4.168LeuThr: 4.168 ± 1.046
2.918LeuVal: 2.918 ± 0.914
0.0LeuTrp: 0.0 ± 0.0
5.419LeuTyr: 5.419 ± 0.817
0.0LeuXaa: 0.0 ± 0.0
Met
0.834MetAla: 0.834 ± 0.428
0.834MetCys: 0.834 ± 0.761
2.084MetAsp: 2.084 ± 0.566
0.417MetGlu: 0.417 ± 0.367
1.251MetPhe: 1.251 ± 0.513
0.834MetGly: 0.834 ± 0.365
0.0MetHis: 0.0 ± 0.0
1.251MetIle: 1.251 ± 0.866
0.417MetLys: 0.417 ± 0.38
1.251MetLeu: 1.251 ± 0.722
0.417MetMet: 0.417 ± 0.367
0.834MetAsn: 0.834 ± 0.428
1.251MetPro: 1.251 ± 1.141
0.417MetGln: 0.417 ± 0.38
0.417MetArg: 0.417 ± 0.38
1.667MetSer: 1.667 ± 0.963
0.834MetThr: 0.834 ± 0.365
1.251MetVal: 1.251 ± 0.875
0.0MetTrp: 0.0 ± 0.0
0.834MetTyr: 0.834 ± 0.51
0.0MetXaa: 0.0 ± 0.0
Asn
1.667AsnAla: 1.667 ± 1.522
1.251AsnCys: 1.251 ± 0.875
2.084AsnAsp: 2.084 ± 0.365
2.501AsnGlu: 2.501 ± 0.409
1.251AsnPhe: 1.251 ± 0.7
2.084AsnGly: 2.084 ± 0.765
0.0AsnHis: 0.0 ± 0.0
4.168AsnIle: 4.168 ± 2.462
3.752AsnLys: 3.752 ± 0.91
3.335AsnLeu: 3.335 ± 1.09
1.667AsnMet: 1.667 ± 0.569
2.918AsnAsn: 2.918 ± 1.516
1.667AsnPro: 1.667 ± 0.83
1.667AsnGln: 1.667 ± 1.467
3.335AsnArg: 3.335 ± 1.271
3.335AsnSer: 3.335 ± 0.735
4.585AsnThr: 4.585 ± 0.865
2.501AsnVal: 2.501 ± 0.46
1.667AsnTrp: 1.667 ± 0.595
2.501AsnTyr: 2.501 ± 0.482
0.0AsnXaa: 0.0 ± 0.0
Pro
5.002ProAla: 5.002 ± 1.757
0.834ProCys: 0.834 ± 0.655
6.253ProAsp: 6.253 ± 1.992
3.752ProGlu: 3.752 ± 1.092
2.501ProPhe: 2.501 ± 1.175
2.918ProGly: 2.918 ± 1.589
0.417ProHis: 0.417 ± 0.367
3.335ProIle: 3.335 ± 2.003
3.752ProLys: 3.752 ± 1.044
5.419ProLeu: 5.419 ± 1.302
0.0ProMet: 0.0 ± 0.0
2.501ProAsn: 2.501 ± 0.709
7.92ProPro: 7.92 ± 2.436
1.667ProGln: 1.667 ± 0.732
2.501ProArg: 2.501 ± 0.877
6.669ProSer: 6.669 ± 3.669
5.002ProThr: 5.002 ± 1.795
3.335ProVal: 3.335 ± 1.36
0.0ProTrp: 0.0 ± 0.0
1.251ProTyr: 1.251 ± 0.773
0.0ProXaa: 0.0 ± 0.0
Gln
1.667GlnAla: 1.667 ± 0.999
0.834GlnCys: 0.834 ± 0.855
2.918GlnAsp: 2.918 ± 0.998
2.084GlnGlu: 2.084 ± 1.092
1.251GlnPhe: 1.251 ± 0.331
1.667GlnGly: 1.667 ± 0.963
0.834GlnHis: 0.834 ± 0.603
3.752GlnIle: 3.752 ± 1.17
2.501GlnLys: 2.501 ± 1.543
5.419GlnLeu: 5.419 ± 1.66
1.667GlnMet: 1.667 ± 0.727
2.501GlnAsn: 2.501 ± 1.104
1.667GlnPro: 1.667 ± 0.912
1.251GlnGln: 1.251 ± 0.384
2.084GlnArg: 2.084 ± 1.058
2.501GlnSer: 2.501 ± 1.739
0.834GlnThr: 0.834 ± 0.628
4.168GlnVal: 4.168 ± 1.213
0.834GlnTrp: 0.834 ± 0.365
1.667GlnTyr: 1.667 ± 0.569
0.0GlnXaa: 0.0 ± 0.0
Arg
4.168ArgAla: 4.168 ± 1.473
1.667ArgCys: 1.667 ± 0.803
2.501ArgAsp: 2.501 ± 0.455
1.251ArgGlu: 1.251 ± 0.774
1.667ArgPhe: 1.667 ± 0.573
4.168ArgGly: 4.168 ± 0.929
0.834ArgHis: 0.834 ± 0.734
2.501ArgIle: 2.501 ± 1.041
4.585ArgLys: 4.585 ± 0.501
6.669ArgLeu: 6.669 ± 1.726
0.834ArgMet: 0.834 ± 0.415
2.084ArgAsn: 2.084 ± 0.923
3.752ArgPro: 3.752 ± 1.684
2.084ArgGln: 2.084 ± 0.854
5.836ArgArg: 5.836 ± 3.438
3.752ArgSer: 3.752 ± 1.119
2.084ArgThr: 2.084 ± 0.889
2.918ArgVal: 2.918 ± 1.105
0.834ArgTrp: 0.834 ± 0.735
0.417ArgTyr: 0.417 ± 0.367
0.0ArgXaa: 0.0 ± 0.0
Ser
3.335SerAla: 3.335 ± 1.009
0.834SerCys: 0.834 ± 0.365
2.918SerAsp: 2.918 ± 0.698
5.419SerGlu: 5.419 ± 1.472
2.084SerPhe: 2.084 ± 1.08
5.419SerGly: 5.419 ± 1.608
1.251SerHis: 1.251 ± 0.689
5.002SerIle: 5.002 ± 1.613
2.501SerLys: 2.501 ± 1.443
9.587SerLeu: 9.587 ± 1.729
0.834SerMet: 0.834 ± 0.365
2.918SerAsn: 2.918 ± 1.164
5.002SerPro: 5.002 ± 0.692
3.335SerGln: 3.335 ± 1.09
2.918SerArg: 2.918 ± 0.821
4.585SerSer: 4.585 ± 2.182
5.836SerThr: 5.836 ± 2.567
5.836SerVal: 5.836 ± 0.683
0.417SerTrp: 0.417 ± 0.38
1.251SerTyr: 1.251 ± 1.141
0.0SerXaa: 0.0 ± 0.0
Thr
2.501ThrAla: 2.501 ± 0.564
1.251ThrCys: 1.251 ± 0.681
3.752ThrAsp: 3.752 ± 1.173
3.335ThrGlu: 3.335 ± 1.098
1.667ThrPhe: 1.667 ± 0.786
5.419ThrGly: 5.419 ± 1.248
0.834ThrHis: 0.834 ± 0.481
4.585ThrIle: 4.585 ± 1.854
2.084ThrLys: 2.084 ± 0.705
4.585ThrLeu: 4.585 ± 0.767
0.0ThrMet: 0.0 ± 0.0
2.918ThrAsn: 2.918 ± 0.882
3.752ThrPro: 3.752 ± 1.048
1.251ThrGln: 1.251 ± 0.365
4.585ThrArg: 4.585 ± 2.063
5.419ThrSer: 5.419 ± 1.384
6.669ThrThr: 6.669 ± 3.536
5.836ThrVal: 5.836 ± 1.321
0.834ThrTrp: 0.834 ± 0.761
1.251ThrTyr: 1.251 ± 0.722
0.0ThrXaa: 0.0 ± 0.0
Val
3.335ValAla: 3.335 ± 1.22
0.0ValCys: 0.0 ± 0.0
5.002ValAsp: 5.002 ± 2.346
2.501ValGlu: 2.501 ± 0.447
2.501ValPhe: 2.501 ± 0.829
3.752ValGly: 3.752 ± 1.551
0.834ValHis: 0.834 ± 0.51
3.335ValIle: 3.335 ± 0.606
2.918ValLys: 2.918 ± 0.944
2.918ValLeu: 2.918 ± 0.541
0.834ValMet: 0.834 ± 0.428
2.918ValAsn: 2.918 ± 0.881
3.752ValPro: 3.752 ± 0.926
2.501ValGln: 2.501 ± 0.907
3.335ValArg: 3.335 ± 0.889
6.253ValSer: 6.253 ± 2.323
2.084ValThr: 2.084 ± 1.462
3.752ValVal: 3.752 ± 1.147
0.417ValTrp: 0.417 ± 0.367
4.168ValTyr: 4.168 ± 1.366
0.0ValXaa: 0.0 ± 0.0
Trp
0.417TrpAla: 0.417 ± 0.38
0.0TrpCys: 0.0 ± 0.0
0.834TrpAsp: 0.834 ± 0.481
0.0TrpGlu: 0.0 ± 0.0
0.417TrpPhe: 0.417 ± 0.364
0.834TrpGly: 0.834 ± 0.533
0.417TrpHis: 0.417 ± 0.367
0.417TrpIle: 0.417 ± 0.38
1.251TrpLys: 1.251 ± 0.722
1.667TrpLeu: 1.667 ± 0.635
0.0TrpMet: 0.0 ± 0.0
0.417TrpAsn: 0.417 ± 0.367
0.417TrpPro: 0.417 ± 0.367
0.417TrpGln: 0.417 ± 0.38
1.251TrpArg: 1.251 ± 0.598
0.0TrpSer: 0.0 ± 0.0
1.251TrpThr: 1.251 ± 0.774
1.667TrpVal: 1.667 ± 0.999
0.0TrpTrp: 0.0 ± 0.0
0.834TrpTyr: 0.834 ± 0.365
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.084TyrAla: 2.084 ± 0.698
1.251TyrCys: 1.251 ± 0.569
2.084TyrAsp: 2.084 ± 0.985
1.667TyrGlu: 1.667 ± 0.573
2.501TyrPhe: 2.501 ± 0.73
1.667TyrGly: 1.667 ± 0.569
0.834TyrHis: 0.834 ± 0.51
2.501TyrIle: 2.501 ± 0.409
5.002TyrLys: 5.002 ± 1.084
3.335TyrLeu: 3.335 ± 0.955
1.251TyrMet: 1.251 ± 0.875
1.667TyrAsn: 1.667 ± 1.11
1.667TyrPro: 1.667 ± 0.638
1.251TyrGln: 1.251 ± 0.774
2.084TyrArg: 2.084 ± 0.857
1.667TyrSer: 1.667 ± 0.653
0.834TyrThr: 0.834 ± 0.428
2.084TyrVal: 2.084 ± 1.124
0.0TyrTrp: 0.0 ± 0.0
1.667TyrTyr: 1.667 ± 1.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2400 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski