Amino acid dipepetide frequency for Gammapapillomavirus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.146AlaAla: 4.146 ± 1.488
0.415AlaCys: 0.415 ± 0.481
2.488AlaAsp: 2.488 ± 0.707
5.39AlaGlu: 5.39 ± 1.937
3.731AlaPhe: 3.731 ± 1.386
2.073AlaGly: 2.073 ± 1.395
1.244AlaHis: 1.244 ± 0.896
2.073AlaIle: 2.073 ± 0.976
2.488AlaLys: 2.488 ± 1.008
4.146AlaLeu: 4.146 ± 1.478
1.244AlaMet: 1.244 ± 1.107
2.488AlaAsn: 2.488 ± 0.916
3.317AlaPro: 3.317 ± 1.843
1.658AlaGln: 1.658 ± 1.18
2.902AlaArg: 2.902 ± 0.81
4.146AlaSer: 4.146 ± 1.161
3.731AlaThr: 3.731 ± 1.009
1.244AlaVal: 1.244 ± 0.588
0.829AlaTrp: 0.829 ± 0.433
2.488AlaTyr: 2.488 ± 1.124
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.829CysCys: 0.829 ± 0.652
1.244CysAsp: 1.244 ± 0.796
2.073CysGlu: 2.073 ± 1.265
0.829CysPhe: 0.829 ± 0.652
0.829CysGly: 0.829 ± 0.825
0.0CysHis: 0.0 ± 0.0
1.244CysIle: 1.244 ± 0.928
2.488CysLys: 2.488 ± 1.036
1.658CysLeu: 1.658 ± 1.2
0.415CysMet: 0.415 ± 0.481
0.829CysAsn: 0.829 ± 0.502
1.244CysPro: 1.244 ± 0.754
1.658CysGln: 1.658 ± 0.633
1.244CysArg: 1.244 ± 0.92
2.073CysSer: 2.073 ± 1.611
1.244CysThr: 1.244 ± 0.518
0.829CysVal: 0.829 ± 0.825
0.829CysTrp: 0.829 ± 0.453
0.415CysTyr: 0.415 ± 0.373
0.0CysXaa: 0.0 ± 0.0
Asp
6.633AspAla: 6.633 ± 0.738
1.658AspCys: 1.658 ± 0.961
4.146AspAsp: 4.146 ± 1.092
3.317AspGlu: 3.317 ± 0.989
2.073AspPhe: 2.073 ± 1.108
3.317AspGly: 3.317 ± 0.845
0.829AspHis: 0.829 ± 0.453
9.121AspIle: 9.121 ± 3.378
2.902AspLys: 2.902 ± 0.696
6.633AspLeu: 6.633 ± 1.753
0.829AspMet: 0.829 ± 0.812
4.146AspAsn: 4.146 ± 0.826
4.561AspPro: 4.561 ± 2.397
2.073AspGln: 2.073 ± 0.821
2.073AspArg: 2.073 ± 1.22
4.561AspSer: 4.561 ± 1.432
2.073AspThr: 2.073 ± 1.0
3.731AspVal: 3.731 ± 1.442
0.829AspTrp: 0.829 ± 0.652
1.244AspTyr: 1.244 ± 0.63
0.0AspXaa: 0.0 ± 0.0
Glu
2.902GluAla: 2.902 ± 1.426
0.829GluCys: 0.829 ± 0.652
6.219GluAsp: 6.219 ± 1.46
4.561GluGlu: 4.561 ± 1.18
2.902GluPhe: 2.902 ± 1.095
2.902GluGly: 2.902 ± 1.162
0.415GluHis: 0.415 ± 0.378
2.488GluIle: 2.488 ± 1.114
2.073GluLys: 2.073 ± 0.784
4.561GluLeu: 4.561 ± 1.648
1.244GluMet: 1.244 ± 0.978
5.39GluAsn: 5.39 ± 1.184
4.146GluPro: 4.146 ± 0.888
2.902GluGln: 2.902 ± 1.116
2.073GluArg: 2.073 ± 1.0
2.902GluSer: 2.902 ± 1.081
4.146GluThr: 4.146 ± 0.928
2.488GluVal: 2.488 ± 0.83
1.244GluTrp: 1.244 ± 0.67
1.658GluTyr: 1.658 ± 0.747
0.0GluXaa: 0.0 ± 0.0
Phe
2.488PheAla: 2.488 ± 0.693
1.244PheCys: 1.244 ± 0.819
1.244PheAsp: 1.244 ± 0.396
1.658PheGlu: 1.658 ± 0.767
2.488PhePhe: 2.488 ± 1.359
2.488PheGly: 2.488 ± 0.546
1.244PheHis: 1.244 ± 0.449
3.317PheIle: 3.317 ± 0.855
3.317PheLys: 3.317 ± 1.517
2.902PheLeu: 2.902 ± 0.948
0.415PheMet: 0.415 ± 0.326
1.244PheAsn: 1.244 ± 0.427
2.488PhePro: 2.488 ± 0.659
2.488PheGln: 2.488 ± 0.704
2.488PheArg: 2.488 ± 0.774
4.146PheSer: 4.146 ± 1.565
1.244PheThr: 1.244 ± 0.793
3.317PheVal: 3.317 ± 1.082
0.829PheTrp: 0.829 ± 0.453
2.073PheTyr: 2.073 ± 1.057
0.0PheXaa: 0.0 ± 0.0
Gly
2.902GlyAla: 2.902 ± 1.067
1.244GlyCys: 1.244 ± 0.831
5.804GlyAsp: 5.804 ± 1.11
6.219GlyGlu: 6.219 ± 2.053
0.829GlyPhe: 0.829 ± 0.453
4.561GlyGly: 4.561 ± 3.181
0.829GlyHis: 0.829 ± 0.812
3.317GlyIle: 3.317 ± 0.795
2.902GlyLys: 2.902 ± 0.597
4.975GlyLeu: 4.975 ± 1.048
0.415GlyMet: 0.415 ± 0.406
3.317GlyAsn: 3.317 ± 0.845
3.317GlyPro: 3.317 ± 1.034
1.244GlyGln: 1.244 ± 0.758
3.317GlyArg: 3.317 ± 0.511
3.317GlySer: 3.317 ± 0.965
6.633GlyThr: 6.633 ± 2.034
3.731GlyVal: 3.731 ± 0.813
0.415GlyTrp: 0.415 ± 0.419
0.829GlyTyr: 0.829 ± 0.437
0.0GlyXaa: 0.0 ± 0.0
His
0.415HisAla: 0.415 ± 0.406
0.415HisCys: 0.415 ± 0.326
0.415HisAsp: 0.415 ± 0.373
0.415HisGlu: 0.415 ± 0.326
0.829HisPhe: 0.829 ± 0.371
1.658HisGly: 1.658 ± 1.105
0.829HisHis: 0.829 ± 0.517
0.829HisIle: 0.829 ± 0.469
0.829HisLys: 0.829 ± 0.433
2.073HisLeu: 2.073 ± 0.623
0.0HisMet: 0.0 ± 0.0
0.829HisAsn: 0.829 ± 0.453
2.073HisPro: 2.073 ± 0.895
0.415HisGln: 0.415 ± 0.326
0.829HisArg: 0.829 ± 0.469
0.829HisSer: 0.829 ± 0.371
0.415HisThr: 0.415 ± 0.373
1.244HisVal: 1.244 ± 0.635
0.415HisTrp: 0.415 ± 0.406
2.073HisTyr: 2.073 ± 0.856
0.0HisXaa: 0.0 ± 0.0
Ile
1.658IleAla: 1.658 ± 0.633
1.244IleCys: 1.244 ± 0.518
4.975IleAsp: 4.975 ± 2.024
4.561IleGlu: 4.561 ± 0.917
0.0IlePhe: 0.0 ± 0.0
2.902IleGly: 2.902 ± 1.19
0.0IleHis: 0.0 ± 0.0
5.39IleIle: 5.39 ± 1.989
2.902IleLys: 2.902 ± 1.053
5.39IleLeu: 5.39 ± 1.786
0.415IleMet: 0.415 ± 0.326
1.658IleAsn: 1.658 ± 0.844
4.561IlePro: 4.561 ± 2.59
0.829IleGln: 0.829 ± 0.437
2.073IleArg: 2.073 ± 1.391
7.048IleSer: 7.048 ± 1.744
4.561IleThr: 4.561 ± 1.175
3.731IleVal: 3.731 ± 1.161
0.829IleTrp: 0.829 ± 0.502
1.658IleTyr: 1.658 ± 0.935
0.0IleXaa: 0.0 ± 0.0
Lys
1.658LysAla: 1.658 ± 0.764
1.244LysCys: 1.244 ± 0.92
2.902LysAsp: 2.902 ± 1.201
3.731LysGlu: 3.731 ± 1.289
1.244LysPhe: 1.244 ± 0.806
4.146LysGly: 4.146 ± 1.492
0.829LysHis: 0.829 ± 0.502
1.658LysIle: 1.658 ± 0.606
2.488LysLys: 2.488 ± 1.132
3.731LysLeu: 3.731 ± 1.405
2.073LysMet: 2.073 ± 0.894
1.658LysAsn: 1.658 ± 0.555
1.244LysPro: 1.244 ± 0.514
3.317LysGln: 3.317 ± 1.18
5.39LysArg: 5.39 ± 1.54
4.975LysSer: 4.975 ± 1.889
2.073LysThr: 2.073 ± 0.969
5.39LysVal: 5.39 ± 0.734
0.829LysTrp: 0.829 ± 0.468
1.658LysTyr: 1.658 ± 0.961
0.0LysXaa: 0.0 ± 0.0
Leu
5.39LeuAla: 5.39 ± 1.328
2.488LeuCys: 2.488 ± 1.637
6.219LeuAsp: 6.219 ± 1.597
4.146LeuGlu: 4.146 ± 1.425
4.975LeuPhe: 4.975 ± 0.711
5.39LeuGly: 5.39 ± 1.331
1.244LeuHis: 1.244 ± 0.675
2.488LeuIle: 2.488 ± 0.868
7.877LeuLys: 7.877 ± 2.541
12.438LeuLeu: 12.438 ± 3.684
1.658LeuMet: 1.658 ± 0.831
4.146LeuAsn: 4.146 ± 0.998
3.731LeuPro: 3.731 ± 0.733
6.219LeuGln: 6.219 ± 1.462
2.073LeuArg: 2.073 ± 0.903
7.463LeuSer: 7.463 ± 2.565
4.146LeuThr: 4.146 ± 1.136
3.731LeuVal: 3.731 ± 2.122
0.415LeuTrp: 0.415 ± 0.406
2.902LeuTyr: 2.902 ± 0.858
0.0LeuXaa: 0.0 ± 0.0
Met
0.415MetAla: 0.415 ± 0.326
0.829MetCys: 0.829 ± 0.56
0.829MetAsp: 0.829 ± 0.437
0.829MetGlu: 0.829 ± 0.632
0.415MetPhe: 0.415 ± 0.326
0.829MetGly: 0.829 ± 0.652
0.415MetHis: 0.415 ± 0.537
0.0MetIle: 0.0 ± 0.0
0.415MetLys: 0.415 ± 0.481
0.415MetLeu: 0.415 ± 0.373
0.0MetMet: 0.0 ± 0.0
2.073MetAsn: 2.073 ± 1.575
0.415MetPro: 0.415 ± 0.326
1.244MetGln: 1.244 ± 0.449
1.658MetArg: 1.658 ± 0.66
1.658MetSer: 1.658 ± 0.738
1.658MetThr: 1.658 ± 0.657
1.658MetVal: 1.658 ± 0.63
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.073AsnAla: 2.073 ± 1.265
0.415AsnCys: 0.415 ± 0.378
3.317AsnAsp: 3.317 ± 0.93
2.488AsnGlu: 2.488 ± 0.923
2.073AsnPhe: 2.073 ± 0.712
2.902AsnGly: 2.902 ± 0.749
0.415AsnHis: 0.415 ± 0.326
2.073AsnIle: 2.073 ± 0.801
2.073AsnLys: 2.073 ± 0.712
3.317AsnLeu: 3.317 ± 0.743
0.415AsnMet: 0.415 ± 0.481
2.073AsnAsn: 2.073 ± 1.23
3.317AsnPro: 3.317 ± 0.965
2.073AsnGln: 2.073 ± 0.364
2.902AsnArg: 2.902 ± 1.069
6.219AsnSer: 6.219 ± 2.668
4.975AsnThr: 4.975 ± 2.163
2.073AsnVal: 2.073 ± 0.846
0.415AsnTrp: 0.415 ± 0.406
0.829AsnTyr: 0.829 ± 0.549
0.0AsnXaa: 0.0 ± 0.0
Pro
2.488ProAla: 2.488 ± 0.883
2.073ProCys: 2.073 ± 0.846
4.975ProAsp: 4.975 ± 1.577
3.317ProGlu: 3.317 ± 0.915
0.0ProPhe: 0.0 ± 0.0
3.731ProGly: 3.731 ± 1.354
0.415ProHis: 0.415 ± 0.419
2.488ProIle: 2.488 ± 0.785
3.317ProLys: 3.317 ± 0.986
4.975ProLeu: 4.975 ± 1.521
0.0ProMet: 0.0 ± 0.0
4.146ProAsn: 4.146 ± 1.492
7.048ProPro: 7.048 ± 2.44
3.317ProGln: 3.317 ± 1.382
2.902ProArg: 2.902 ± 1.872
6.219ProSer: 6.219 ± 1.992
2.488ProThr: 2.488 ± 1.296
3.731ProVal: 3.731 ± 0.856
0.829ProTrp: 0.829 ± 0.549
2.073ProTyr: 2.073 ± 1.191
0.0ProXaa: 0.0 ± 0.0
Gln
2.902GlnAla: 2.902 ± 0.507
0.829GlnCys: 0.829 ± 0.502
2.073GlnAsp: 2.073 ± 0.374
4.146GlnGlu: 4.146 ± 0.877
2.488GlnPhe: 2.488 ± 0.659
2.073GlnGly: 2.073 ± 1.104
0.415GlnHis: 0.415 ± 0.419
4.146GlnIle: 4.146 ± 0.777
2.073GlnLys: 2.073 ± 0.999
4.975GlnLeu: 4.975 ± 1.106
0.0GlnMet: 0.0 ± 0.0
1.658GlnAsn: 1.658 ± 0.689
2.488GlnPro: 2.488 ± 1.115
2.902GlnGln: 2.902 ± 0.854
3.317GlnArg: 3.317 ± 2.076
2.073GlnSer: 2.073 ± 0.933
2.902GlnThr: 2.902 ± 1.547
1.658GlnVal: 1.658 ± 0.657
0.0GlnTrp: 0.0 ± 0.0
1.244GlnTyr: 1.244 ± 0.518
0.0GlnXaa: 0.0 ± 0.0
Arg
3.731ArgAla: 3.731 ± 1.456
2.073ArgCys: 2.073 ± 0.921
2.073ArgAsp: 2.073 ± 1.233
1.658ArgGlu: 1.658 ± 0.689
4.561ArgPhe: 4.561 ± 1.7
1.658ArgGly: 1.658 ± 0.747
2.902ArgHis: 2.902 ± 0.918
0.415ArgIle: 0.415 ± 0.378
3.317ArgLys: 3.317 ± 0.636
5.804ArgLeu: 5.804 ± 0.968
1.658ArgMet: 1.658 ± 0.817
1.244ArgAsn: 1.244 ± 0.677
3.317ArgPro: 3.317 ± 1.36
2.488ArgGln: 2.488 ± 1.351
8.292ArgArg: 8.292 ± 1.939
4.975ArgSer: 4.975 ± 1.707
2.902ArgThr: 2.902 ± 0.671
4.146ArgVal: 4.146 ± 1.644
0.415ArgTrp: 0.415 ± 0.419
0.415ArgTyr: 0.415 ± 0.373
0.0ArgXaa: 0.0 ± 0.0
Ser
3.317SerAla: 3.317 ± 1.614
0.829SerCys: 0.829 ± 0.502
6.219SerAsp: 6.219 ± 0.984
2.488SerGlu: 2.488 ± 1.334
3.317SerPhe: 3.317 ± 1.413
6.219SerGly: 6.219 ± 1.759
2.902SerHis: 2.902 ± 0.998
6.219SerIle: 6.219 ± 1.813
4.561SerLys: 4.561 ± 1.437
7.463SerLeu: 7.463 ± 1.9
0.829SerMet: 0.829 ± 0.453
4.146SerAsn: 4.146 ± 1.572
4.975SerPro: 4.975 ± 1.864
1.244SerGln: 1.244 ± 0.449
5.39SerArg: 5.39 ± 1.171
12.023SerSer: 12.023 ± 2.984
8.292SerThr: 8.292 ± 2.744
5.39SerVal: 5.39 ± 1.369
1.244SerTrp: 1.244 ± 0.67
1.658SerTyr: 1.658 ± 0.867
0.0SerXaa: 0.0 ± 0.0
Thr
2.488ThrAla: 2.488 ± 0.793
2.073ThrCys: 2.073 ± 0.483
2.902ThrAsp: 2.902 ± 0.969
2.488ThrGlu: 2.488 ± 0.521
4.146ThrPhe: 4.146 ± 1.305
6.633ThrGly: 6.633 ± 1.544
1.244ThrHis: 1.244 ± 0.449
2.902ThrIle: 2.902 ± 1.396
1.244ThrLys: 1.244 ± 0.888
4.146ThrLeu: 4.146 ± 1.316
0.829ThrMet: 0.829 ± 0.433
1.658ThrAsn: 1.658 ± 0.689
3.731ThrPro: 3.731 ± 1.248
2.902ThrGln: 2.902 ± 0.605
3.317ThrArg: 3.317 ± 0.935
7.463ThrSer: 7.463 ± 1.832
3.731ThrThr: 3.731 ± 0.71
7.048ThrVal: 7.048 ± 1.623
0.415ThrTrp: 0.415 ± 0.373
1.658ThrTyr: 1.658 ± 1.18
0.0ThrXaa: 0.0 ± 0.0
Val
4.561ValAla: 4.561 ± 1.437
0.415ValCys: 0.415 ± 0.406
4.975ValAsp: 4.975 ± 2.083
2.488ValGlu: 2.488 ± 0.562
2.902ValPhe: 2.902 ± 0.4
3.731ValGly: 3.731 ± 1.46
1.244ValHis: 1.244 ± 0.765
4.146ValIle: 4.146 ± 2.043
1.658ValLys: 1.658 ± 0.595
5.804ValLeu: 5.804 ± 0.931
1.244ValMet: 1.244 ± 0.383
2.073ValAsn: 2.073 ± 1.021
4.146ValPro: 4.146 ± 1.336
3.317ValGln: 3.317 ± 1.023
2.488ValArg: 2.488 ± 0.936
4.975ValSer: 4.975 ± 0.81
3.317ValThr: 3.317 ± 2.07
4.146ValVal: 4.146 ± 0.887
0.415ValTrp: 0.415 ± 0.406
4.146ValTyr: 4.146 ± 1.136
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.244TrpAsp: 1.244 ± 0.514
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.415TrpGly: 0.415 ± 0.419
0.415TrpHis: 0.415 ± 0.373
0.829TrpIle: 0.829 ± 0.652
0.829TrpLys: 0.829 ± 0.453
1.658TrpLeu: 1.658 ± 0.698
0.829TrpMet: 0.829 ± 0.812
1.244TrpAsn: 1.244 ± 0.796
0.415TrpPro: 0.415 ± 0.406
0.829TrpGln: 0.829 ± 0.433
0.415TrpArg: 0.415 ± 0.481
0.415TrpSer: 0.415 ± 0.373
1.244TrpThr: 1.244 ± 0.74
0.829TrpVal: 0.829 ± 0.433
0.0TrpTrp: 0.0 ± 0.0
0.415TrpTyr: 0.415 ± 0.326
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.488TyrAla: 2.488 ± 0.774
0.829TyrCys: 0.829 ± 0.825
2.488TyrAsp: 2.488 ± 0.621
2.073TyrGlu: 2.073 ± 1.191
3.317TyrPhe: 3.317 ± 1.269
1.658TyrGly: 1.658 ± 0.891
0.0TyrHis: 0.0 ± 0.0
1.244TyrIle: 1.244 ± 0.603
2.488TyrLys: 2.488 ± 0.755
2.073TyrLeu: 2.073 ± 1.104
0.829TyrMet: 0.829 ± 0.46
0.415TyrAsn: 0.415 ± 0.378
0.0TyrPro: 0.0 ± 0.0
1.244TyrGln: 1.244 ± 0.449
2.902TyrArg: 2.902 ± 0.674
1.244TyrSer: 1.244 ± 0.449
1.244TyrThr: 1.244 ± 0.742
2.073TyrVal: 2.073 ± 1.237
0.829TyrTrp: 0.829 ± 0.468
1.658TyrTyr: 1.658 ± 1.099
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2413 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski