Amino acid dipepetide frequency for Miniopterus schreibersii papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.954AlaAla: 9.954 ± 1.525
0.415AlaCys: 0.415 ± 0.402
4.562AlaAsp: 4.562 ± 0.687
5.392AlaGlu: 5.392 ± 1.554
2.903AlaPhe: 2.903 ± 0.779
3.733AlaGly: 3.733 ± 1.972
0.83AlaHis: 0.83 ± 0.631
2.489AlaIle: 2.489 ± 1.196
2.489AlaLys: 2.489 ± 0.818
6.636AlaLeu: 6.636 ± 1.958
0.83AlaMet: 0.83 ± 0.631
2.074AlaAsn: 2.074 ± 0.872
4.977AlaPro: 4.977 ± 1.47
1.244AlaGln: 1.244 ± 0.729
4.977AlaArg: 4.977 ± 1.208
6.636AlaSer: 6.636 ± 1.58
3.733AlaThr: 3.733 ± 1.193
4.562AlaVal: 4.562 ± 1.226
0.415AlaTrp: 0.415 ± 0.402
0.83AlaTyr: 0.83 ± 0.465
0.0AlaXaa: 0.0 ± 0.0
Cys
0.83CysAla: 0.83 ± 0.402
1.244CysCys: 1.244 ± 1.591
1.244CysAsp: 1.244 ± 0.947
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.83CysGly: 0.83 ± 0.764
0.0CysHis: 0.0 ± 0.0
1.244CysIle: 1.244 ± 0.947
1.659CysLys: 1.659 ± 1.284
1.659CysLeu: 1.659 ± 1.473
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.244CysPro: 1.244 ± 0.729
0.0CysGln: 0.0 ± 0.0
2.903CysArg: 2.903 ± 1.285
2.489CysSer: 2.489 ± 0.66
2.074CysThr: 2.074 ± 1.091
1.244CysVal: 1.244 ± 0.617
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.489AspAla: 2.489 ± 0.737
0.83AspCys: 0.83 ± 0.631
6.221AspAsp: 6.221 ± 2.513
3.318AspGlu: 3.318 ± 0.934
4.148AspPhe: 4.148 ± 0.896
4.148AspGly: 4.148 ± 1.266
1.244AspHis: 1.244 ± 1.101
3.733AspIle: 3.733 ± 0.661
2.074AspLys: 2.074 ± 0.367
4.148AspLeu: 4.148 ± 1.153
1.244AspMet: 1.244 ± 0.341
4.562AspAsn: 4.562 ± 1.016
3.318AspPro: 3.318 ± 1.221
2.903AspGln: 2.903 ± 1.028
2.489AspArg: 2.489 ± 1.248
2.489AspSer: 2.489 ± 1.029
4.562AspThr: 4.562 ± 2.461
2.074AspVal: 2.074 ± 0.7
0.83AspTrp: 0.83 ± 0.631
1.659AspTyr: 1.659 ± 0.664
0.0AspXaa: 0.0 ± 0.0
Glu
2.903GluAla: 2.903 ± 0.942
2.074GluCys: 2.074 ± 1.165
5.392GluAsp: 5.392 ± 0.871
6.221GluGlu: 6.221 ± 1.54
1.659GluPhe: 1.659 ± 1.043
2.074GluGly: 2.074 ± 0.88
1.244GluHis: 1.244 ± 0.74
2.903GluIle: 2.903 ± 0.948
1.659GluLys: 1.659 ± 1.137
7.466GluLeu: 7.466 ± 1.129
0.83GluMet: 0.83 ± 0.413
4.148GluAsn: 4.148 ± 0.969
4.148GluPro: 4.148 ± 1.07
2.074GluGln: 2.074 ± 0.969
4.562GluArg: 4.562 ± 0.682
4.977GluSer: 4.977 ± 1.362
5.392GluThr: 5.392 ± 0.925
4.148GluVal: 4.148 ± 1.462
1.244GluTrp: 1.244 ± 0.409
0.415GluTyr: 0.415 ± 0.439
0.0GluXaa: 0.0 ± 0.0
Phe
2.903PheAla: 2.903 ± 1.042
1.244PheCys: 1.244 ± 1.163
3.318PheAsp: 3.318 ± 0.755
1.659PheGlu: 1.659 ± 0.681
2.903PhePhe: 2.903 ± 0.815
0.83PheGly: 0.83 ± 0.402
1.244PheHis: 1.244 ± 0.576
2.074PheIle: 2.074 ± 0.463
2.489PheLys: 2.489 ± 1.029
6.636PheLeu: 6.636 ± 1.411
0.415PheMet: 0.415 ± 0.343
0.83PheAsn: 0.83 ± 0.403
1.244PhePro: 1.244 ± 0.65
2.489PheGln: 2.489 ± 1.12
2.489PheArg: 2.489 ± 0.472
1.244PheSer: 1.244 ± 0.632
0.83PheThr: 0.83 ± 0.618
2.489PheVal: 2.489 ± 1.137
1.244PheTrp: 1.244 ± 0.601
2.074PheTyr: 2.074 ± 1.071
0.0PheXaa: 0.0 ± 0.0
Gly
5.392GlyAla: 5.392 ± 0.847
0.83GlyCys: 0.83 ± 0.618
3.733GlyAsp: 3.733 ± 0.909
3.318GlyGlu: 3.318 ± 0.723
1.659GlyPhe: 1.659 ± 0.54
5.807GlyGly: 5.807 ± 1.683
1.244GlyHis: 1.244 ± 0.729
2.903GlyIle: 2.903 ± 1.556
3.318GlyLys: 3.318 ± 0.866
3.733GlyLeu: 3.733 ± 1.004
0.0GlyMet: 0.0 ± 0.0
2.074GlyAsn: 2.074 ± 0.426
4.977GlyPro: 4.977 ± 1.48
2.489GlyGln: 2.489 ± 0.768
8.71GlyArg: 8.71 ± 2.86
2.489GlySer: 2.489 ± 0.461
6.221GlyThr: 6.221 ± 1.182
2.074GlyVal: 2.074 ± 0.643
0.83GlyTrp: 0.83 ± 0.631
2.074GlyTyr: 2.074 ± 1.311
0.0GlyXaa: 0.0 ± 0.0
His
1.244HisAla: 1.244 ± 0.409
0.0HisCys: 0.0 ± 0.0
0.415HisAsp: 0.415 ± 0.343
1.244HisGlu: 1.244 ± 0.66
1.244HisPhe: 1.244 ± 0.514
2.489HisGly: 2.489 ± 1.575
0.415HisHis: 0.415 ± 0.343
0.83HisIle: 0.83 ± 0.604
0.415HisLys: 0.415 ± 0.316
2.074HisLeu: 2.074 ± 0.669
0.83HisMet: 0.83 ± 0.365
0.415HisAsn: 0.415 ± 0.402
2.074HisPro: 2.074 ± 0.976
2.074HisGln: 2.074 ± 0.937
2.903HisArg: 2.903 ± 1.04
2.074HisSer: 2.074 ± 0.88
1.659HisThr: 1.659 ± 0.807
1.244HisVal: 1.244 ± 1.029
0.83HisTrp: 0.83 ± 0.402
1.244HisTyr: 1.244 ± 0.423
0.0HisXaa: 0.0 ± 0.0
Ile
1.659IleAla: 1.659 ± 0.554
0.83IleCys: 0.83 ± 0.402
3.733IleAsp: 3.733 ± 1.045
4.148IleGlu: 4.148 ± 0.607
1.659IlePhe: 1.659 ± 0.989
3.318IleGly: 3.318 ± 0.754
0.415IleHis: 0.415 ± 0.402
2.903IleIle: 2.903 ± 0.786
0.83IleLys: 0.83 ± 0.413
5.807IleLeu: 5.807 ± 1.526
0.83IleMet: 0.83 ± 0.409
0.415IleAsn: 0.415 ± 0.402
4.562IlePro: 4.562 ± 1.581
2.903IleGln: 2.903 ± 1.063
2.489IleArg: 2.489 ± 0.871
3.318IleSer: 3.318 ± 1.725
2.903IleThr: 2.903 ± 1.543
2.074IleVal: 2.074 ± 0.498
0.0IleTrp: 0.0 ± 0.0
2.074IleTyr: 2.074 ± 0.914
0.0IleXaa: 0.0 ± 0.0
Lys
2.903LysAla: 2.903 ± 1.114
1.244LysCys: 1.244 ± 0.785
2.074LysAsp: 2.074 ± 0.832
2.489LysGlu: 2.489 ± 1.143
2.489LysPhe: 2.489 ± 1.479
0.83LysGly: 0.83 ± 0.631
0.83LysHis: 0.83 ± 0.631
3.318LysIle: 3.318 ± 1.035
4.977LysLys: 4.977 ± 0.945
2.074LysLeu: 2.074 ± 0.761
0.415LysMet: 0.415 ± 0.368
2.489LysAsn: 2.489 ± 0.641
1.244LysPro: 1.244 ± 0.626
2.074LysGln: 2.074 ± 1.048
3.318LysArg: 3.318 ± 0.857
4.148LysSer: 4.148 ± 2.435
4.148LysThr: 4.148 ± 1.045
3.318LysVal: 3.318 ± 0.843
0.415LysTrp: 0.415 ± 0.483
3.318LysTyr: 3.318 ± 1.079
0.0LysXaa: 0.0 ± 0.0
Leu
7.466LeuAla: 7.466 ± 1.469
1.659LeuCys: 1.659 ± 1.236
3.733LeuAsp: 3.733 ± 0.756
6.221LeuGlu: 6.221 ± 1.646
6.221LeuPhe: 6.221 ± 1.801
5.807LeuGly: 5.807 ± 2.997
3.733LeuHis: 3.733 ± 1.255
3.733LeuIle: 3.733 ± 0.754
6.221LeuLys: 6.221 ± 2.495
6.636LeuLeu: 6.636 ± 2.252
0.83LeuMet: 0.83 ± 0.631
3.733LeuAsn: 3.733 ± 1.133
2.903LeuPro: 2.903 ± 1.337
5.392LeuGln: 5.392 ± 1.018
5.392LeuArg: 5.392 ± 1.556
6.221LeuSer: 6.221 ± 0.976
4.562LeuThr: 4.562 ± 1.564
4.562LeuVal: 4.562 ± 0.788
0.0LeuTrp: 0.0 ± 0.0
3.318LeuTyr: 3.318 ± 1.253
0.0LeuXaa: 0.0 ± 0.0
Met
3.318MetAla: 3.318 ± 1.066
0.0MetCys: 0.0 ± 0.0
0.83MetAsp: 0.83 ± 0.804
0.83MetGlu: 0.83 ± 0.403
0.83MetPhe: 0.83 ± 0.402
0.415MetGly: 0.415 ± 0.343
0.0MetHis: 0.0 ± 0.0
0.415MetIle: 0.415 ± 0.402
0.415MetLys: 0.415 ± 0.316
2.903MetLeu: 2.903 ± 0.974
0.0MetMet: 0.0 ± 0.0
1.244MetAsn: 1.244 ± 0.663
0.0MetPro: 0.0 ± 0.0
1.244MetGln: 1.244 ± 0.471
0.83MetArg: 0.83 ± 0.604
2.074MetSer: 2.074 ± 0.771
0.83MetThr: 0.83 ± 0.413
1.244MetVal: 1.244 ± 0.601
0.83MetTrp: 0.83 ± 0.495
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.659AsnAla: 1.659 ± 0.647
1.659AsnCys: 1.659 ± 0.709
0.415AsnAsp: 0.415 ± 0.316
2.074AsnGlu: 2.074 ± 1.155
0.415AsnPhe: 0.415 ± 0.402
2.074AsnGly: 2.074 ± 0.832
0.415AsnHis: 0.415 ± 0.343
2.903AsnIle: 2.903 ± 0.699
3.318AsnLys: 3.318 ± 0.807
2.489AsnLeu: 2.489 ± 1.312
0.83AsnMet: 0.83 ± 0.402
1.244AsnAsn: 1.244 ± 0.74
2.489AsnPro: 2.489 ± 0.89
2.074AsnGln: 2.074 ± 0.784
2.489AsnArg: 2.489 ± 0.439
2.903AsnSer: 2.903 ± 0.962
2.489AsnThr: 2.489 ± 1.093
0.83AsnVal: 0.83 ± 0.631
0.0AsnTrp: 0.0 ± 0.0
2.074AsnTyr: 2.074 ± 0.878
0.0AsnXaa: 0.0 ± 0.0
Pro
5.807ProAla: 5.807 ± 2.101
0.415ProCys: 0.415 ± 0.53
3.733ProAsp: 3.733 ± 1.313
3.733ProGlu: 3.733 ± 1.256
2.074ProPhe: 2.074 ± 0.788
3.733ProGly: 3.733 ± 1.254
1.659ProHis: 1.659 ± 1.208
2.074ProIle: 2.074 ± 0.988
4.562ProLys: 4.562 ± 0.899
5.392ProLeu: 5.392 ± 0.647
1.659ProMet: 1.659 ± 0.935
1.659ProAsn: 1.659 ± 1.607
7.051ProPro: 7.051 ± 3.317
2.489ProGln: 2.489 ± 1.178
4.562ProArg: 4.562 ± 1.542
4.148ProSer: 4.148 ± 1.016
3.318ProThr: 3.318 ± 0.843
3.733ProVal: 3.733 ± 1.447
0.415ProTrp: 0.415 ± 0.439
0.83ProTyr: 0.83 ± 0.804
0.0ProXaa: 0.0 ± 0.0
Gln
2.903GlnAla: 2.903 ± 0.708
0.0GlnCys: 0.0 ± 0.0
1.244GlnAsp: 1.244 ± 0.73
4.148GlnGlu: 4.148 ± 0.93
0.83GlnPhe: 0.83 ± 0.402
4.148GlnGly: 4.148 ± 1.199
1.659GlnHis: 1.659 ± 0.632
2.074GlnIle: 2.074 ± 0.77
0.83GlnLys: 0.83 ± 0.465
3.733GlnLeu: 3.733 ± 1.909
2.489GlnMet: 2.489 ± 1.02
0.83GlnAsn: 0.83 ± 0.402
3.318GlnPro: 3.318 ± 0.909
1.659GlnGln: 1.659 ± 0.7
3.733GlnArg: 3.733 ± 0.863
2.074GlnSer: 2.074 ± 0.426
1.659GlnThr: 1.659 ± 1.12
2.074GlnVal: 2.074 ± 1.235
0.415GlnTrp: 0.415 ± 0.316
0.83GlnTyr: 0.83 ± 0.686
0.0GlnXaa: 0.0 ± 0.0
Arg
4.148ArgAla: 4.148 ± 1.02
1.659ArgCys: 1.659 ± 1.228
3.733ArgAsp: 3.733 ± 1.125
5.392ArgGlu: 5.392 ± 1.479
2.903ArgPhe: 2.903 ± 1.107
4.562ArgGly: 4.562 ± 1.388
2.489ArgHis: 2.489 ± 0.673
4.562ArgIle: 4.562 ± 0.936
2.903ArgLys: 2.903 ± 0.766
8.71ArgLeu: 8.71 ± 2.287
1.659ArgMet: 1.659 ± 0.952
1.659ArgAsn: 1.659 ± 0.647
5.807ArgPro: 5.807 ± 1.224
2.074ArgGln: 2.074 ± 0.875
8.295ArgArg: 8.295 ± 3.197
2.903ArgSer: 2.903 ± 1.253
6.221ArgThr: 6.221 ± 1.978
4.562ArgVal: 4.562 ± 1.398
0.0ArgTrp: 0.0 ± 0.0
2.489ArgTyr: 2.489 ± 1.341
0.0ArgXaa: 0.0 ± 0.0
Ser
1.659SerAla: 1.659 ± 0.681
1.659SerCys: 1.659 ± 1.242
3.318SerAsp: 3.318 ± 1.76
4.562SerGlu: 4.562 ± 1.237
2.074SerPhe: 2.074 ± 0.872
6.636SerGly: 6.636 ± 0.859
2.489SerHis: 2.489 ± 1.261
1.659SerIle: 1.659 ± 0.703
2.489SerLys: 2.489 ± 1.098
5.392SerLeu: 5.392 ± 1.317
1.659SerMet: 1.659 ± 0.83
3.733SerAsn: 3.733 ± 2.041
3.318SerPro: 3.318 ± 1.038
2.489SerGln: 2.489 ± 0.815
5.392SerArg: 5.392 ± 1.299
7.881SerSer: 7.881 ± 2.753
7.466SerThr: 7.466 ± 0.895
6.221SerVal: 6.221 ± 2.116
0.0SerTrp: 0.0 ± 0.0
0.83SerTyr: 0.83 ± 0.402
0.0SerXaa: 0.0 ± 0.0
Thr
5.392ThrAla: 5.392 ± 1.259
0.83ThrCys: 0.83 ± 0.804
3.733ThrAsp: 3.733 ± 1.511
5.392ThrGlu: 5.392 ± 1.27
3.318ThrPhe: 3.318 ± 1.15
7.051ThrGly: 7.051 ± 1.37
1.659ThrHis: 1.659 ± 0.977
2.903ThrIle: 2.903 ± 1.524
1.244ThrLys: 1.244 ± 0.409
4.977ThrLeu: 4.977 ± 1.301
2.074ThrMet: 2.074 ± 0.618
2.074ThrAsn: 2.074 ± 1.31
5.392ThrPro: 5.392 ± 0.757
1.244ThrGln: 1.244 ± 0.409
3.733ThrArg: 3.733 ± 1.538
6.636ThrSer: 6.636 ± 1.573
4.977ThrThr: 4.977 ± 1.456
4.977ThrVal: 4.977 ± 1.467
0.83ThrTrp: 0.83 ± 0.675
1.244ThrTyr: 1.244 ± 0.947
0.0ThrXaa: 0.0 ± 0.0
Val
4.977ValAla: 4.977 ± 1.654
1.244ValCys: 1.244 ± 0.65
4.977ValAsp: 4.977 ± 1.049
3.733ValGlu: 3.733 ± 0.845
1.244ValPhe: 1.244 ± 0.632
3.318ValGly: 3.318 ± 1.037
2.489ValHis: 2.489 ± 1.381
2.074ValIle: 2.074 ± 0.914
2.489ValLys: 2.489 ± 0.961
5.392ValLeu: 5.392 ± 1.648
0.415ValMet: 0.415 ± 0.44
0.83ValAsn: 0.83 ± 0.403
4.148ValPro: 4.148 ± 1.584
2.074ValGln: 2.074 ± 0.727
4.562ValArg: 4.562 ± 0.743
4.977ValSer: 4.977 ± 1.698
3.318ValThr: 3.318 ± 0.742
5.807ValVal: 5.807 ± 2.112
0.83ValTrp: 0.83 ± 0.804
0.83ValTyr: 0.83 ± 0.402
0.0ValXaa: 0.0 ± 0.0
Trp
0.83TrpAla: 0.83 ± 0.402
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.415TrpGlu: 0.415 ± 0.439
0.0TrpPhe: 0.0 ± 0.0
0.83TrpGly: 0.83 ± 0.495
0.83TrpHis: 0.83 ± 0.804
1.244TrpIle: 1.244 ± 0.733
0.83TrpLys: 0.83 ± 0.402
1.244TrpLeu: 1.244 ± 0.74
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.415TrpGln: 0.415 ± 0.439
0.415TrpArg: 0.415 ± 0.316
0.415TrpSer: 0.415 ± 0.316
1.659TrpThr: 1.659 ± 0.889
0.83TrpVal: 0.83 ± 0.402
0.83TrpTrp: 0.83 ± 0.877
0.415TrpTyr: 0.415 ± 0.316
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.244TyrAla: 1.244 ± 0.601
0.83TyrCys: 0.83 ± 0.631
2.074TyrAsp: 2.074 ± 0.893
1.244TyrGlu: 1.244 ± 0.576
2.074TyrPhe: 2.074 ± 0.662
1.244TyrGly: 1.244 ± 0.632
1.244TyrHis: 1.244 ± 0.341
0.83TyrIle: 0.83 ± 0.402
2.903TyrLys: 2.903 ± 0.904
1.244TyrLeu: 1.244 ± 0.601
0.83TyrMet: 0.83 ± 0.631
0.83TyrAsn: 0.83 ± 0.621
0.83TyrPro: 0.83 ± 0.804
1.244TyrGln: 1.244 ± 0.409
2.489TyrArg: 2.489 ± 1.151
0.83TyrSer: 0.83 ± 0.495
1.659TyrThr: 1.659 ± 1.063
1.659TyrVal: 1.659 ± 0.933
1.244TyrTrp: 1.244 ± 0.409
2.489TyrTyr: 2.489 ± 1.301
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2412 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski