Amino acid dipepetide frequency for Mastomys coucha papillomavirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.772AlaAla: 4.772 ± 2.409
1.735AlaCys: 1.735 ± 0.769
5.206AlaAsp: 5.206 ± 2.065
3.471AlaGlu: 3.471 ± 1.641
3.471AlaPhe: 3.471 ± 1.53
3.905AlaGly: 3.905 ± 1.062
0.868AlaHis: 0.868 ± 0.631
1.302AlaIle: 1.302 ± 0.487
2.169AlaLys: 2.169 ± 1.119
4.338AlaLeu: 4.338 ± 2.329
2.169AlaMet: 2.169 ± 0.9
1.302AlaAsn: 1.302 ± 0.746
3.037AlaPro: 3.037 ± 1.122
1.735AlaGln: 1.735 ± 1.041
6.508AlaArg: 6.508 ± 1.937
2.169AlaSer: 2.169 ± 0.991
3.471AlaThr: 3.471 ± 0.739
3.037AlaVal: 3.037 ± 0.484
0.868AlaTrp: 0.868 ± 0.778
0.868AlaTyr: 0.868 ± 0.778
0.0AlaXaa: 0.0 ± 0.0
Cys
1.302CysAla: 1.302 ± 0.644
0.868CysCys: 0.868 ± 0.778
3.037CysAsp: 3.037 ± 1.028
1.735CysGlu: 1.735 ± 1.041
0.868CysPhe: 0.868 ± 0.778
0.434CysGly: 0.434 ± 0.39
0.868CysHis: 0.868 ± 0.532
0.434CysIle: 0.434 ± 0.512
1.735CysLys: 1.735 ± 1.363
0.868CysLeu: 0.868 ± 0.564
0.434CysMet: 0.434 ± 0.389
0.434CysAsn: 0.434 ± 0.389
1.735CysPro: 1.735 ± 0.893
0.868CysGln: 0.868 ± 0.526
0.434CysArg: 0.434 ± 0.512
1.735CysSer: 1.735 ± 1.141
1.302CysThr: 1.302 ± 0.688
2.169CysVal: 2.169 ± 1.215
1.302CysTrp: 1.302 ± 0.447
0.434CysTyr: 0.434 ± 0.442
0.0CysXaa: 0.0 ± 0.0
Asp
2.603AspAla: 2.603 ± 1.882
1.302AspCys: 1.302 ± 0.686
3.037AspAsp: 3.037 ± 1.184
3.905AspGlu: 3.905 ± 1.13
6.508AspPhe: 6.508 ± 2.043
3.471AspGly: 3.471 ± 0.515
1.302AspHis: 1.302 ± 1.032
6.508AspIle: 6.508 ± 3.042
1.302AspLys: 1.302 ± 0.688
3.471AspLeu: 3.471 ± 0.557
1.302AspMet: 1.302 ± 0.686
3.905AspAsn: 3.905 ± 0.99
3.471AspPro: 3.471 ± 1.008
2.603AspGln: 2.603 ± 1.034
2.603AspArg: 2.603 ± 0.728
7.809AspSer: 7.809 ± 1.923
3.471AspThr: 3.471 ± 0.782
4.772AspVal: 4.772 ± 1.549
0.868AspTrp: 0.868 ± 0.4
1.302AspTyr: 1.302 ± 0.447
0.0AspXaa: 0.0 ± 0.0
Glu
4.772GluAla: 4.772 ± 1.569
0.434GluCys: 0.434 ± 0.389
5.206GluAsp: 5.206 ± 1.156
6.074GluGlu: 6.074 ± 1.913
0.434GluPhe: 0.434 ± 0.389
4.772GluGly: 4.772 ± 1.852
0.868GluHis: 0.868 ± 0.4
3.905GluIle: 3.905 ± 0.626
2.603GluLys: 2.603 ± 0.806
5.206GluLeu: 5.206 ± 1.568
1.302GluMet: 1.302 ± 0.481
2.169GluAsn: 2.169 ± 0.411
3.037GluPro: 3.037 ± 0.943
3.037GluGln: 3.037 ± 0.646
3.905GluArg: 3.905 ± 1.675
3.037GluSer: 3.037 ± 1.233
2.603GluThr: 2.603 ± 1.383
6.074GluVal: 6.074 ± 1.534
0.0GluTrp: 0.0 ± 0.0
1.302GluTyr: 1.302 ± 0.917
0.0GluXaa: 0.0 ± 0.0
Phe
3.471PheAla: 3.471 ± 2.007
1.302PheCys: 1.302 ± 1.005
3.037PheAsp: 3.037 ± 1.159
5.64PheGlu: 5.64 ± 1.272
1.735PhePhe: 1.735 ± 0.8
4.338PheGly: 4.338 ± 0.847
0.0PheHis: 0.0 ± 0.0
3.037PheIle: 3.037 ± 0.606
3.037PheLys: 3.037 ± 1.251
4.772PheLeu: 4.772 ± 1.333
1.302PheMet: 1.302 ± 0.746
1.302PheAsn: 1.302 ± 0.985
1.302PhePro: 1.302 ± 0.688
0.868PheGln: 0.868 ± 0.4
1.735PheArg: 1.735 ± 0.218
3.905PheSer: 3.905 ± 1.156
3.471PheThr: 3.471 ± 0.39
1.735PheVal: 1.735 ± 1.335
1.735PheTrp: 1.735 ± 0.655
1.735PheTyr: 1.735 ± 0.559
0.0PheXaa: 0.0 ± 0.0
Gly
3.905GlyAla: 3.905 ± 0.701
1.735GlyCys: 1.735 ± 1.067
3.905GlyAsp: 3.905 ± 1.159
5.206GlyGlu: 5.206 ± 0.83
2.169GlyPhe: 2.169 ± 0.611
5.64GlyGly: 5.64 ± 3.749
1.735GlyHis: 1.735 ± 0.559
2.603GlyIle: 2.603 ± 0.988
2.603GlyLys: 2.603 ± 1.089
5.206GlyLeu: 5.206 ± 1.314
0.434GlyMet: 0.434 ± 0.404
5.206GlyAsn: 5.206 ± 0.816
5.64GlyPro: 5.64 ± 2.7
2.169GlyGln: 2.169 ± 0.644
5.64GlyArg: 5.64 ± 1.805
7.809GlySer: 7.809 ± 1.954
7.375GlyThr: 7.375 ± 3.229
3.037GlyVal: 3.037 ± 1.284
0.434GlyTrp: 0.434 ± 0.389
0.868GlyTyr: 0.868 ± 0.457
0.0GlyXaa: 0.0 ± 0.0
His
0.868HisAla: 0.868 ± 0.4
0.868HisCys: 0.868 ± 0.523
0.434HisAsp: 0.434 ± 0.404
0.434HisGlu: 0.434 ± 0.404
0.434HisPhe: 0.434 ± 0.462
0.434HisGly: 0.434 ± 0.389
0.0HisHis: 0.0 ± 0.0
1.735HisIle: 1.735 ± 0.528
0.868HisLys: 0.868 ± 0.924
0.868HisLeu: 0.868 ± 0.457
1.302HisMet: 1.302 ± 0.907
0.868HisAsn: 0.868 ± 0.457
2.169HisPro: 2.169 ± 0.691
1.302HisGln: 1.302 ± 0.644
0.868HisArg: 0.868 ± 0.778
1.735HisSer: 1.735 ± 1.047
0.434HisThr: 0.434 ± 0.512
2.603HisVal: 2.603 ± 0.511
0.434HisTrp: 0.434 ± 0.39
0.868HisTyr: 0.868 ± 0.454
0.0HisXaa: 0.0 ± 0.0
Ile
2.169IleAla: 2.169 ± 1.518
0.434IleCys: 0.434 ± 0.39
2.169IleAsp: 2.169 ± 2.019
4.772IleGlu: 4.772 ± 1.795
0.0IlePhe: 0.0 ± 0.0
3.037IleGly: 3.037 ± 0.791
0.434IleHis: 0.434 ± 0.462
1.735IleIle: 1.735 ± 1.141
0.868IleLys: 0.868 ± 0.532
4.338IleLeu: 4.338 ± 0.599
0.434IleMet: 0.434 ± 0.389
2.603IleAsn: 2.603 ± 1.531
1.302IlePro: 1.302 ± 0.816
0.868IleGln: 0.868 ± 0.681
3.037IleArg: 3.037 ± 0.943
3.905IleSer: 3.905 ± 0.542
2.169IleThr: 2.169 ± 0.991
3.471IleVal: 3.471 ± 1.594
0.434IleTrp: 0.434 ± 0.404
0.868IleTyr: 0.868 ± 0.457
0.0IleXaa: 0.0 ± 0.0
Lys
2.169LysAla: 2.169 ± 0.911
1.735LysCys: 1.735 ± 0.647
2.169LysAsp: 2.169 ± 0.96
2.603LysGlu: 2.603 ± 1.622
1.302LysPhe: 1.302 ± 0.985
4.338LysGly: 4.338 ± 2.259
2.603LysHis: 2.603 ± 0.977
0.434LysIle: 0.434 ± 0.462
2.169LysLys: 2.169 ± 1.407
3.471LysLeu: 3.471 ± 1.014
2.169LysMet: 2.169 ± 0.89
0.0LysAsn: 0.0 ± 0.0
0.868LysPro: 0.868 ± 0.778
2.169LysGln: 2.169 ± 1.053
3.905LysArg: 3.905 ± 0.648
2.603LysSer: 2.603 ± 1.794
2.169LysThr: 2.169 ± 0.891
3.905LysVal: 3.905 ± 1.195
0.434LysTrp: 0.434 ± 0.39
2.169LysTyr: 2.169 ± 0.717
0.0LysXaa: 0.0 ± 0.0
Leu
5.64LeuAla: 5.64 ± 0.883
0.868LeuCys: 0.868 ± 0.625
5.64LeuAsp: 5.64 ± 1.155
6.074LeuGlu: 6.074 ± 1.066
6.508LeuPhe: 6.508 ± 2.899
7.375LeuGly: 7.375 ± 1.623
2.603LeuHis: 2.603 ± 1.383
3.037LeuIle: 3.037 ± 0.966
5.206LeuLys: 5.206 ± 1.117
6.074LeuLeu: 6.074 ± 1.371
0.434LeuMet: 0.434 ± 0.57
1.302LeuAsn: 1.302 ± 0.823
3.905LeuPro: 3.905 ± 1.602
5.206LeuGln: 5.206 ± 1.009
5.206LeuArg: 5.206 ± 1.421
4.338LeuSer: 4.338 ± 0.734
2.603LeuThr: 2.603 ± 0.725
4.772LeuVal: 4.772 ± 1.426
0.0LeuTrp: 0.0 ± 0.0
3.471LeuTyr: 3.471 ± 0.912
0.0LeuXaa: 0.0 ± 0.0
Met
2.169MetAla: 2.169 ± 0.97
0.868MetCys: 0.868 ± 0.4
2.169MetAsp: 2.169 ± 0.623
0.868MetGlu: 0.868 ± 0.807
0.868MetPhe: 0.868 ± 0.4
0.434MetGly: 0.434 ± 0.389
0.434MetHis: 0.434 ± 0.389
0.868MetIle: 0.868 ± 0.454
0.434MetLys: 0.434 ± 0.389
1.735MetLeu: 1.735 ± 0.622
0.0MetMet: 0.0 ± 0.0
1.735MetAsn: 1.735 ± 0.655
0.868MetPro: 0.868 ± 0.4
0.868MetGln: 0.868 ± 0.523
1.302MetArg: 1.302 ± 0.746
1.302MetSer: 1.302 ± 0.798
1.735MetThr: 1.735 ± 0.86
3.037MetVal: 3.037 ± 0.873
0.434MetTrp: 0.434 ± 0.39
0.434MetTyr: 0.434 ± 0.462
0.0MetXaa: 0.0 ± 0.0
Asn
2.603AsnAla: 2.603 ± 0.858
0.434AsnCys: 0.434 ± 0.404
1.302AsnAsp: 1.302 ± 0.686
3.037AsnGlu: 3.037 ± 0.641
2.169AsnPhe: 2.169 ± 0.982
2.603AsnGly: 2.603 ± 1.268
0.868AsnHis: 0.868 ± 0.778
0.0AsnIle: 0.0 ± 0.0
2.169AsnLys: 2.169 ± 0.411
1.302AsnLeu: 1.302 ± 0.687
1.302AsnMet: 1.302 ± 0.447
2.169AsnAsn: 2.169 ± 0.714
3.905AsnPro: 3.905 ± 1.159
1.735AsnGln: 1.735 ± 0.559
3.037AsnArg: 3.037 ± 0.892
4.338AsnSer: 4.338 ± 0.461
4.772AsnThr: 4.772 ± 1.125
2.603AsnVal: 2.603 ± 1.045
0.868AsnTrp: 0.868 ± 0.778
0.434AsnTyr: 0.434 ± 0.389
0.0AsnXaa: 0.0 ± 0.0
Pro
2.603ProAla: 2.603 ± 1.51
1.302ProCys: 1.302 ± 0.917
7.375ProAsp: 7.375 ± 2.273
3.905ProGlu: 3.905 ± 1.583
1.735ProPhe: 1.735 ± 0.691
3.905ProGly: 3.905 ± 2.151
0.0ProHis: 0.0 ± 0.0
3.471ProIle: 3.471 ± 1.772
2.603ProLys: 2.603 ± 0.947
6.941ProLeu: 6.941 ± 1.874
2.169ProMet: 2.169 ± 0.906
3.471ProAsn: 3.471 ± 1.201
6.074ProPro: 6.074 ± 1.176
2.169ProGln: 2.169 ± 0.711
3.037ProArg: 3.037 ± 1.34
4.772ProSer: 4.772 ± 1.867
2.603ProThr: 2.603 ± 1.046
3.037ProVal: 3.037 ± 1.055
0.434ProTrp: 0.434 ± 0.462
1.302ProTyr: 1.302 ± 0.951
0.0ProXaa: 0.0 ± 0.0
Gln
1.302GlnAla: 1.302 ± 0.688
1.735GlnCys: 1.735 ± 0.764
2.169GlnAsp: 2.169 ± 1.314
1.735GlnGlu: 1.735 ± 0.699
2.603GlnPhe: 2.603 ± 0.372
2.603GlnGly: 2.603 ± 0.699
0.434GlnHis: 0.434 ± 0.389
0.868GlnIle: 0.868 ± 0.526
1.302GlnLys: 1.302 ± 0.688
3.905GlnLeu: 3.905 ± 1.21
1.735GlnMet: 1.735 ± 0.655
2.603GlnAsn: 2.603 ± 1.8
3.037GlnPro: 3.037 ± 0.581
2.603GlnGln: 2.603 ± 0.372
2.169GlnArg: 2.169 ± 0.935
1.302GlnSer: 1.302 ± 0.798
3.037GlnThr: 3.037 ± 0.641
2.169GlnVal: 2.169 ± 0.694
1.302GlnTrp: 1.302 ± 0.798
1.302GlnTyr: 1.302 ± 0.688
0.0GlnXaa: 0.0 ± 0.0
Arg
3.471ArgAla: 3.471 ± 0.736
1.302ArgCys: 1.302 ± 0.614
3.905ArgAsp: 3.905 ± 0.515
1.302ArgGlu: 1.302 ± 0.686
4.338ArgPhe: 4.338 ± 1.024
3.471ArgGly: 3.471 ± 0.853
2.603ArgHis: 2.603 ± 0.603
0.434ArgIle: 0.434 ± 0.404
4.772ArgLys: 4.772 ± 1.319
6.941ArgLeu: 6.941 ± 0.56
0.868ArgMet: 0.868 ± 0.479
3.037ArgAsn: 3.037 ± 0.988
3.905ArgPro: 3.905 ± 2.082
2.603ArgGln: 2.603 ± 0.48
9.111ArgArg: 9.111 ± 2.639
3.905ArgSer: 3.905 ± 0.796
4.772ArgThr: 4.772 ± 1.292
5.64ArgVal: 5.64 ± 1.165
0.0ArgTrp: 0.0 ± 0.0
2.169ArgTyr: 2.169 ± 1.119
0.0ArgXaa: 0.0 ± 0.0
Ser
2.603SerAla: 2.603 ± 1.373
1.735SerCys: 1.735 ± 0.697
3.471SerAsp: 3.471 ± 0.911
2.603SerGlu: 2.603 ± 1.815
4.338SerPhe: 4.338 ± 2.093
9.544SerGly: 9.544 ± 2.71
0.434SerHis: 0.434 ± 0.404
2.169SerIle: 2.169 ± 0.694
2.169SerLys: 2.169 ± 0.891
7.809SerLeu: 7.809 ± 2.035
0.868SerMet: 0.868 ± 0.781
2.169SerAsn: 2.169 ± 0.991
5.206SerPro: 5.206 ± 1.383
3.037SerGln: 3.037 ± 0.916
3.905SerArg: 3.905 ± 1.575
6.941SerSer: 6.941 ± 1.954
7.375SerThr: 7.375 ± 1.309
6.508SerVal: 6.508 ± 1.631
0.434SerTrp: 0.434 ± 0.462
1.735SerTyr: 1.735 ± 0.571
0.0SerXaa: 0.0 ± 0.0
Thr
3.037ThrAla: 3.037 ± 0.313
1.302ThrCys: 1.302 ± 0.634
3.905ThrAsp: 3.905 ± 0.667
3.037ThrGlu: 3.037 ± 1.214
3.905ThrPhe: 3.905 ± 1.705
6.508ThrGly: 6.508 ± 1.912
1.302ThrHis: 1.302 ± 1.211
3.471ThrIle: 3.471 ± 0.619
1.302ThrLys: 1.302 ± 0.487
4.338ThrLeu: 4.338 ± 1.414
1.735ThrMet: 1.735 ± 0.655
3.037ThrAsn: 3.037 ± 1.042
5.64ThrPro: 5.64 ± 0.557
3.037ThrGln: 3.037 ± 0.641
3.905ThrArg: 3.905 ± 0.669
4.772ThrSer: 4.772 ± 1.583
6.508ThrThr: 6.508 ± 1.366
6.508ThrVal: 6.508 ± 1.223
0.434ThrTrp: 0.434 ± 0.512
0.434ThrTyr: 0.434 ± 0.404
0.0ThrXaa: 0.0 ± 0.0
Val
4.338ValAla: 4.338 ± 1.424
1.735ValCys: 1.735 ± 1.335
6.074ValAsp: 6.074 ± 1.648
2.603ValGlu: 2.603 ± 0.879
2.603ValPhe: 2.603 ± 0.484
4.338ValGly: 4.338 ± 1.259
1.302ValHis: 1.302 ± 0.684
2.603ValIle: 2.603 ± 1.082
2.603ValLys: 2.603 ± 0.916
5.64ValLeu: 5.64 ± 0.976
0.868ValMet: 0.868 ± 0.457
3.037ValAsn: 3.037 ± 0.979
6.941ValPro: 6.941 ± 1.885
2.169ValGln: 2.169 ± 0.471
4.772ValArg: 4.772 ± 1.007
7.375ValSer: 7.375 ± 2.514
4.772ValThr: 4.772 ± 0.908
3.471ValVal: 3.471 ± 1.437
0.868ValTrp: 0.868 ± 0.532
3.037ValTyr: 3.037 ± 1.947
0.0ValXaa: 0.0 ± 0.0
Trp
0.434TrpAla: 0.434 ± 0.389
0.434TrpCys: 0.434 ± 0.389
0.434TrpAsp: 0.434 ± 0.39
0.868TrpGlu: 0.868 ± 0.532
0.0TrpPhe: 0.0 ± 0.0
0.434TrpGly: 0.434 ± 0.462
0.434TrpHis: 0.434 ± 0.462
0.434TrpIle: 0.434 ± 0.389
0.868TrpLys: 0.868 ± 0.778
1.302TrpLeu: 1.302 ± 0.686
0.0TrpMet: 0.0 ± 0.0
0.434TrpAsn: 0.434 ± 0.39
0.434TrpPro: 0.434 ± 0.39
0.0TrpGln: 0.0 ± 0.0
1.302TrpArg: 1.302 ± 0.754
0.434TrpSer: 0.434 ± 0.39
1.735TrpThr: 1.735 ± 0.816
1.302TrpVal: 1.302 ± 0.798
0.0TrpTrp: 0.0 ± 0.0
0.434TrpTyr: 0.434 ± 0.404
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.169TyrAla: 2.169 ± 1.053
0.868TyrCys: 0.868 ± 0.532
0.868TyrAsp: 0.868 ± 0.4
0.868TyrGlu: 0.868 ± 0.924
3.471TyrPhe: 3.471 ± 0.982
1.735TyrGly: 1.735 ± 0.218
0.434TyrHis: 0.434 ± 0.462
0.0TyrIle: 0.0 ± 0.0
2.603TyrLys: 2.603 ± 1.107
2.169TyrLeu: 2.169 ± 0.73
1.302TyrMet: 1.302 ± 0.785
0.868TyrAsn: 0.868 ± 0.454
0.434TyrPro: 0.434 ± 0.404
0.868TyrGln: 0.868 ± 0.4
2.169TyrArg: 2.169 ± 0.714
0.868TyrSer: 0.868 ± 0.625
1.735TyrThr: 1.735 ± 0.991
1.302TyrVal: 1.302 ± 0.487
0.434TyrTrp: 0.434 ± 0.462
2.603TyrTyr: 2.603 ± 1.697
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2306 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski