Amino acid dipepetide frequency for Francolinus leucoscepus papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.013AlaAla: 5.013 ± 1.022
0.386AlaCys: 0.386 ± 0.574
3.471AlaAsp: 3.471 ± 0.599
4.628AlaGlu: 4.628 ± 0.976
2.7AlaPhe: 2.7 ± 0.783
6.17AlaGly: 6.17 ± 1.372
1.543AlaHis: 1.543 ± 0.732
2.314AlaIle: 2.314 ± 0.904
3.085AlaLys: 3.085 ± 0.718
5.399AlaLeu: 5.399 ± 1.99
1.928AlaMet: 1.928 ± 0.749
1.543AlaAsn: 1.543 ± 0.68
5.785AlaPro: 5.785 ± 1.028
1.928AlaGln: 1.928 ± 0.608
6.17AlaArg: 6.17 ± 1.608
6.556AlaSer: 6.556 ± 1.628
1.543AlaThr: 1.543 ± 0.558
4.242AlaVal: 4.242 ± 1.736
0.771AlaTrp: 0.771 ± 0.364
1.543AlaTyr: 1.543 ± 0.617
0.0AlaXaa: 0.0 ± 0.0
Cys
3.085CysAla: 3.085 ± 1.328
0.0CysCys: 0.0 ± 0.0
0.386CysAsp: 0.386 ± 0.309
0.386CysGlu: 0.386 ± 0.549
0.386CysPhe: 0.386 ± 0.338
1.928CysGly: 1.928 ± 0.846
0.386CysHis: 0.386 ± 0.309
0.0CysIle: 0.0 ± 0.0
1.928CysLys: 1.928 ± 0.78
1.928CysLeu: 1.928 ± 1.158
0.386CysMet: 0.386 ± 0.309
0.771CysAsn: 0.771 ± 0.684
1.543CysPro: 1.543 ± 0.524
0.771CysGln: 0.771 ± 0.625
1.928CysArg: 1.928 ± 0.893
1.928CysSer: 1.928 ± 0.994
0.771CysThr: 0.771 ± 0.366
0.771CysVal: 0.771 ± 0.632
0.386CysTrp: 0.386 ± 0.338
0.771CysTyr: 0.771 ± 0.786
0.0CysXaa: 0.0 ± 0.0
Asp
4.242AspAla: 4.242 ± 1.462
2.7AspCys: 2.7 ± 1.07
3.857AspAsp: 3.857 ± 2.025
3.471AspGlu: 3.471 ± 1.207
3.471AspPhe: 3.471 ± 0.599
5.013AspGly: 5.013 ± 1.714
0.771AspHis: 0.771 ± 0.404
5.399AspIle: 5.399 ± 2.627
1.928AspLys: 1.928 ± 0.929
3.085AspLeu: 3.085 ± 1.026
1.543AspMet: 1.543 ± 0.844
2.7AspAsn: 2.7 ± 0.838
6.17AspPro: 6.17 ± 1.357
1.928AspGln: 1.928 ± 0.809
4.242AspArg: 4.242 ± 1.311
5.013AspSer: 5.013 ± 1.441
5.013AspThr: 5.013 ± 1.649
2.7AspVal: 2.7 ± 1.01
1.157AspTrp: 1.157 ± 0.587
0.771AspTyr: 0.771 ± 0.366
0.0AspXaa: 0.0 ± 0.0
Glu
6.556GluAla: 6.556 ± 1.174
0.771GluCys: 0.771 ± 0.599
4.628GluAsp: 4.628 ± 1.841
3.857GluGlu: 3.857 ± 1.358
1.928GluPhe: 1.928 ± 0.504
4.242GluGly: 4.242 ± 1.265
1.157GluHis: 1.157 ± 0.587
2.314GluIle: 2.314 ± 1.065
1.157GluLys: 1.157 ± 0.633
5.013GluLeu: 5.013 ± 1.42
0.771GluMet: 0.771 ± 0.445
1.157GluAsn: 1.157 ± 0.404
2.314GluPro: 2.314 ± 0.818
1.157GluGln: 1.157 ± 0.701
2.314GluArg: 2.314 ± 0.906
2.314GluSer: 2.314 ± 1.097
4.242GluThr: 4.242 ± 1.355
3.857GluVal: 3.857 ± 0.986
0.386GluTrp: 0.386 ± 0.309
0.771GluTyr: 0.771 ± 0.632
0.0GluXaa: 0.0 ± 0.0
Phe
1.157PheAla: 1.157 ± 0.61
0.771PheCys: 0.771 ± 0.404
2.7PheAsp: 2.7 ± 1.116
2.7PheGlu: 2.7 ± 1.046
1.157PhePhe: 1.157 ± 0.596
3.471PheGly: 3.471 ± 0.88
1.543PheHis: 1.543 ± 0.703
2.314PheIle: 2.314 ± 0.818
1.157PheLys: 1.157 ± 0.616
3.471PheLeu: 3.471 ± 1.402
0.0PheMet: 0.0 ± 0.0
2.314PheAsn: 2.314 ± 0.903
1.543PhePro: 1.543 ± 0.526
2.314PheGln: 2.314 ± 0.951
0.386PheArg: 0.386 ± 0.338
1.543PheSer: 1.543 ± 0.304
1.928PheThr: 1.928 ± 0.787
2.314PheVal: 2.314 ± 0.915
1.543PheTrp: 1.543 ± 0.632
1.157PheTyr: 1.157 ± 0.323
0.0PheXaa: 0.0 ± 0.0
Gly
6.942GlyAla: 6.942 ± 1.665
1.928GlyCys: 1.928 ± 0.901
6.556GlyAsp: 6.556 ± 1.883
5.399GlyGlu: 5.399 ± 1.355
2.314GlyPhe: 2.314 ± 0.659
4.628GlyGly: 4.628 ± 1.508
0.771GlyHis: 0.771 ± 0.676
3.857GlyIle: 3.857 ± 1.184
3.085GlyLys: 3.085 ± 1.92
6.942GlyLeu: 6.942 ± 1.435
0.386GlyMet: 0.386 ± 0.309
2.314GlyAsn: 2.314 ± 0.878
6.17GlyPro: 6.17 ± 2.14
4.242GlyGln: 4.242 ± 0.963
10.027GlyArg: 10.027 ± 1.526
5.785GlySer: 5.785 ± 2.05
2.314GlyThr: 2.314 ± 0.904
4.628GlyVal: 4.628 ± 0.668
1.157GlyTrp: 1.157 ± 1.107
2.314GlyTyr: 2.314 ± 0.915
0.0GlyXaa: 0.0 ± 0.0
His
0.771HisAla: 0.771 ± 0.618
0.386HisCys: 0.386 ± 0.309
2.314HisAsp: 2.314 ± 1.193
0.771HisGlu: 0.771 ± 0.445
2.314HisPhe: 2.314 ± 0.661
1.157HisGly: 1.157 ± 0.731
0.386HisHis: 0.386 ± 0.318
0.386HisIle: 0.386 ± 0.338
0.0HisLys: 0.0 ± 0.0
0.386HisLeu: 0.386 ± 0.549
0.0HisMet: 0.0 ± 0.0
0.386HisAsn: 0.386 ± 0.338
2.314HisPro: 2.314 ± 0.415
1.157HisGln: 1.157 ± 0.805
1.543HisArg: 1.543 ± 0.685
0.771HisSer: 0.771 ± 0.385
2.314HisThr: 2.314 ± 0.642
2.7HisVal: 2.7 ± 0.488
0.771HisTrp: 0.771 ± 0.463
1.157HisTyr: 1.157 ± 0.596
0.0HisXaa: 0.0 ± 0.0
Ile
3.857IleAla: 3.857 ± 1.317
0.386IleCys: 0.386 ± 0.549
2.7IleAsp: 2.7 ± 1.255
2.314IleGlu: 2.314 ± 0.608
0.771IlePhe: 0.771 ± 0.385
4.242IleGly: 4.242 ± 1.443
0.0IleHis: 0.0 ± 0.0
0.386IleIle: 0.386 ± 0.318
0.386IleLys: 0.386 ± 0.309
4.242IleLeu: 4.242 ± 1.099
0.771IleMet: 0.771 ± 0.615
0.386IleAsn: 0.386 ± 0.318
3.857IlePro: 3.857 ± 1.085
1.543IleGln: 1.543 ± 0.652
1.928IleArg: 1.928 ± 0.969
4.242IleSer: 4.242 ± 1.182
2.7IleThr: 2.7 ± 0.793
3.085IleVal: 3.085 ± 1.285
0.771IleTrp: 0.771 ± 0.385
0.386IleTyr: 0.386 ± 0.309
0.0IleXaa: 0.0 ± 0.0
Lys
0.771LysAla: 0.771 ± 0.618
1.543LysCys: 1.543 ± 1.309
2.314LysAsp: 2.314 ± 0.603
1.157LysGlu: 1.157 ± 0.616
0.386LysPhe: 0.386 ± 0.309
1.928LysGly: 1.928 ± 0.734
1.157LysHis: 1.157 ± 0.928
0.386LysIle: 0.386 ± 0.309
0.771LysLys: 0.771 ± 0.471
1.928LysLeu: 1.928 ± 0.699
0.386LysMet: 0.386 ± 0.309
1.543LysAsn: 1.543 ± 0.655
0.771LysPro: 0.771 ± 0.592
1.543LysGln: 1.543 ± 0.551
3.471LysArg: 3.471 ± 0.744
2.7LysSer: 2.7 ± 1.428
3.857LysThr: 3.857 ± 1.342
1.928LysVal: 1.928 ± 0.681
0.386LysTrp: 0.386 ± 0.372
1.543LysTyr: 1.543 ± 0.732
0.0LysXaa: 0.0 ± 0.0
Leu
5.399LeuAla: 5.399 ± 1.146
1.543LeuCys: 1.543 ± 0.747
5.013LeuAsp: 5.013 ± 2.086
2.7LeuGlu: 2.7 ± 1.261
4.242LeuPhe: 4.242 ± 0.828
8.099LeuGly: 8.099 ± 2.91
2.314LeuHis: 2.314 ± 1.028
2.7LeuIle: 2.7 ± 0.784
3.857LeuLys: 3.857 ± 1.007
6.942LeuLeu: 6.942 ± 1.476
1.543LeuMet: 1.543 ± 1.09
1.157LeuAsn: 1.157 ± 0.589
6.556LeuPro: 6.556 ± 2.432
5.399LeuGln: 5.399 ± 1.415
5.785LeuArg: 5.785 ± 1.768
6.556LeuSer: 6.556 ± 2.213
6.556LeuThr: 6.556 ± 1.229
5.013LeuVal: 5.013 ± 1.164
0.771LeuTrp: 0.771 ± 0.404
4.242LeuTyr: 4.242 ± 1.646
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.928MetAsp: 1.928 ± 0.793
1.928MetGlu: 1.928 ± 0.617
0.386MetPhe: 0.386 ± 0.309
0.386MetGly: 0.386 ± 0.318
0.0MetHis: 0.0 ± 0.0
0.771MetIle: 0.771 ± 0.592
0.386MetLys: 0.386 ± 0.338
1.928MetLeu: 1.928 ± 1.05
0.0MetMet: 0.0 ± 0.0
0.386MetAsn: 0.386 ± 0.309
1.543MetPro: 1.543 ± 1.183
1.157MetGln: 1.157 ± 0.676
0.771MetArg: 0.771 ± 0.452
2.314MetSer: 2.314 ± 0.971
1.157MetThr: 1.157 ± 0.806
0.386MetVal: 0.386 ± 0.309
0.386MetTrp: 0.386 ± 0.549
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.242AsnAla: 4.242 ± 0.672
0.386AsnCys: 0.386 ± 0.574
1.928AsnAsp: 1.928 ± 0.809
1.543AsnGlu: 1.543 ± 0.626
0.0AsnPhe: 0.0 ± 0.0
0.771AsnGly: 0.771 ± 0.676
0.0AsnHis: 0.0 ± 0.0
1.157AsnIle: 1.157 ± 0.633
0.771AsnLys: 0.771 ± 0.366
1.928AsnLeu: 1.928 ± 0.666
0.386AsnMet: 0.386 ± 0.338
1.543AsnAsn: 1.543 ± 0.732
2.314AsnPro: 2.314 ± 1.109
1.928AsnGln: 1.928 ± 0.977
3.857AsnArg: 3.857 ± 0.614
1.543AsnSer: 1.543 ± 0.714
0.771AsnThr: 0.771 ± 0.601
2.314AsnVal: 2.314 ± 0.486
0.0AsnTrp: 0.0 ± 0.0
0.771AsnTyr: 0.771 ± 0.366
0.0AsnXaa: 0.0 ± 0.0
Pro
6.17ProAla: 6.17 ± 1.092
1.157ProCys: 1.157 ± 0.587
6.556ProAsp: 6.556 ± 0.748
5.399ProGlu: 5.399 ± 2.589
1.543ProPhe: 1.543 ± 1.273
5.399ProGly: 5.399 ± 1.076
1.157ProHis: 1.157 ± 0.673
2.7ProIle: 2.7 ± 0.788
1.543ProLys: 1.543 ± 0.732
7.327ProLeu: 7.327 ± 1.814
1.157ProMet: 1.157 ± 0.596
2.7ProAsn: 2.7 ± 0.838
7.327ProPro: 7.327 ± 1.648
2.314ProGln: 2.314 ± 1.008
5.013ProArg: 5.013 ± 1.236
5.399ProSer: 5.399 ± 1.642
5.399ProThr: 5.399 ± 1.854
5.785ProVal: 5.785 ± 1.655
1.157ProTrp: 1.157 ± 0.621
5.785ProTyr: 5.785 ± 2.077
0.0ProXaa: 0.0 ± 0.0
Gln
5.399GlnAla: 5.399 ± 1.052
0.0GlnCys: 0.0 ± 0.0
1.157GlnAsp: 1.157 ± 0.806
1.543GlnGlu: 1.543 ± 0.737
1.543GlnPhe: 1.543 ± 0.501
1.157GlnGly: 1.157 ± 0.685
1.157GlnHis: 1.157 ± 0.587
1.157GlnIle: 1.157 ± 0.69
1.928GlnLys: 1.928 ± 0.856
6.556GlnLeu: 6.556 ± 0.623
2.314GlnMet: 2.314 ± 0.66
0.386GlnAsn: 0.386 ± 0.338
3.857GlnPro: 3.857 ± 1.022
1.157GlnGln: 1.157 ± 0.756
3.471GlnArg: 3.471 ± 1.485
1.157GlnSer: 1.157 ± 0.75
1.928GlnThr: 1.928 ± 0.681
1.928GlnVal: 1.928 ± 0.999
1.543GlnTrp: 1.543 ± 0.655
0.771GlnTyr: 0.771 ± 0.463
0.0GlnXaa: 0.0 ± 0.0
Arg
3.471ArgAla: 3.471 ± 1.27
1.928ArgCys: 1.928 ± 0.742
3.471ArgAsp: 3.471 ± 0.862
1.928ArgGlu: 1.928 ± 0.439
2.314ArgPhe: 2.314 ± 0.795
9.256ArgGly: 9.256 ± 2.455
3.857ArgHis: 3.857 ± 1.185
2.314ArgIle: 2.314 ± 0.915
1.543ArgLys: 1.543 ± 0.68
8.099ArgLeu: 8.099 ± 2.285
1.928ArgMet: 1.928 ± 1.053
1.157ArgAsn: 1.157 ± 0.587
6.556ArgPro: 6.556 ± 1.629
3.471ArgGln: 3.471 ± 1.142
10.413ArgArg: 10.413 ± 2.061
5.013ArgSer: 5.013 ± 1.188
3.471ArgThr: 3.471 ± 0.858
6.942ArgVal: 6.942 ± 1.917
0.386ArgTrp: 0.386 ± 0.338
2.314ArgTyr: 2.314 ± 0.935
0.0ArgXaa: 0.0 ± 0.0
Ser
2.314SerAla: 2.314 ± 0.946
1.157SerCys: 1.157 ± 0.651
5.013SerAsp: 5.013 ± 1.678
3.471SerGlu: 3.471 ± 1.769
2.7SerPhe: 2.7 ± 0.663
6.942SerGly: 6.942 ± 0.853
0.771SerHis: 0.771 ± 0.404
1.543SerIle: 1.543 ± 0.517
2.314SerLys: 2.314 ± 0.895
5.785SerLeu: 5.785 ± 1.238
1.157SerMet: 1.157 ± 0.769
3.085SerAsn: 3.085 ± 1.768
5.013SerPro: 5.013 ± 1.255
3.085SerGln: 3.085 ± 0.695
3.085SerArg: 3.085 ± 1.005
4.628SerSer: 4.628 ± 1.586
11.184SerThr: 11.184 ± 1.924
5.399SerVal: 5.399 ± 1.51
0.0SerTrp: 0.0 ± 0.0
1.543SerTyr: 1.543 ± 0.524
0.0SerXaa: 0.0 ± 0.0
Thr
4.242ThrAla: 4.242 ± 0.7
1.543ThrCys: 1.543 ± 1.234
5.013ThrAsp: 5.013 ± 1.17
3.085ThrGlu: 3.085 ± 0.73
3.085ThrPhe: 3.085 ± 0.518
6.942ThrGly: 6.942 ± 1.389
1.543ThrHis: 1.543 ± 0.95
4.628ThrIle: 4.628 ± 1.098
0.0ThrLys: 0.0 ± 0.0
3.857ThrLeu: 3.857 ± 1.027
0.0ThrMet: 0.0 ± 0.0
1.543ThrAsn: 1.543 ± 0.622
4.242ThrPro: 4.242 ± 1.232
2.7ThrGln: 2.7 ± 0.754
5.399ThrArg: 5.399 ± 1.636
5.013ThrSer: 5.013 ± 0.669
8.484ThrThr: 8.484 ± 1.849
6.556ThrVal: 6.556 ± 1.62
1.543ThrTrp: 1.543 ± 1.105
1.543ThrTyr: 1.543 ± 0.925
0.0ThrXaa: 0.0 ± 0.0
Val
1.543ValAla: 1.543 ± 1.062
1.157ValCys: 1.157 ± 0.483
3.857ValAsp: 3.857 ± 0.722
3.471ValGlu: 3.471 ± 0.897
3.085ValPhe: 3.085 ± 1.131
6.17ValGly: 6.17 ± 0.955
1.543ValHis: 1.543 ± 0.875
3.085ValIle: 3.085 ± 0.752
1.543ValLys: 1.543 ± 0.888
6.556ValLeu: 6.556 ± 1.016
0.771ValMet: 0.771 ± 1.147
1.543ValAsn: 1.543 ± 0.963
10.027ValPro: 10.027 ± 2.142
0.771ValGln: 0.771 ± 0.636
6.942ValArg: 6.942 ± 1.514
5.013ValSer: 5.013 ± 1.048
4.242ValThr: 4.242 ± 0.716
3.857ValVal: 3.857 ± 0.983
0.386ValTrp: 0.386 ± 0.338
2.7ValTyr: 2.7 ± 0.683
0.0ValXaa: 0.0 ± 0.0
Trp
0.771TrpAla: 0.771 ± 0.471
0.386TrpCys: 0.386 ± 0.309
1.157TrpAsp: 1.157 ± 0.323
0.386TrpGlu: 0.386 ± 0.318
0.386TrpPhe: 0.386 ± 0.318
1.543TrpGly: 1.543 ± 0.963
0.386TrpHis: 0.386 ± 0.372
0.0TrpIle: 0.0 ± 0.0
1.157TrpLys: 1.157 ± 0.616
2.7TrpLeu: 2.7 ± 0.908
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.386TrpPro: 0.386 ± 0.318
1.543TrpGln: 1.543 ± 0.699
1.157TrpArg: 1.157 ± 0.756
0.386TrpSer: 0.386 ± 0.372
1.157TrpThr: 1.157 ± 1.101
1.157TrpVal: 1.157 ± 0.519
0.386TrpTrp: 0.386 ± 0.318
0.386TrpTyr: 0.386 ± 0.372
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.771TyrAla: 0.771 ± 0.404
2.314TyrCys: 2.314 ± 0.781
1.157TyrAsp: 1.157 ± 0.399
0.771TyrGlu: 0.771 ± 0.364
0.771TyrPhe: 0.771 ± 0.632
2.314TyrGly: 2.314 ± 0.608
1.543TyrHis: 1.543 ± 0.737
1.543TyrIle: 1.543 ± 0.768
1.928TyrLys: 1.928 ± 0.608
2.314TyrLeu: 2.314 ± 0.792
0.0TyrMet: 0.0 ± 0.0
1.543TyrAsn: 1.543 ± 0.635
2.7TyrPro: 2.7 ± 1.289
0.0TyrGln: 0.0 ± 0.0
2.314TyrArg: 2.314 ± 1.267
2.314TyrSer: 2.314 ± 0.705
1.928TyrThr: 1.928 ± 0.741
2.7TyrVal: 2.7 ± 1.028
1.543TyrTrp: 1.543 ± 0.304
1.543TyrTyr: 1.543 ± 1.115
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2594 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski