Amino acid dipepetide frequency for Caretta caretta papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.214AlaAla: 6.214 ± 1.752
0.888AlaCys: 0.888 ± 0.899
4.439AlaAsp: 4.439 ± 1.327
3.995AlaGlu: 3.995 ± 0.701
3.551AlaPhe: 3.551 ± 0.736
5.326AlaGly: 5.326 ± 1.776
0.888AlaHis: 0.888 ± 0.83
1.332AlaIle: 1.332 ± 0.468
3.551AlaLys: 3.551 ± 1.326
7.102AlaLeu: 7.102 ± 3.248
1.332AlaMet: 1.332 ± 0.878
0.888AlaAsn: 0.888 ± 0.428
3.107AlaPro: 3.107 ± 0.94
1.775AlaGln: 1.775 ± 0.614
3.107AlaArg: 3.107 ± 1.112
5.326AlaSer: 5.326 ± 1.739
4.882AlaThr: 4.882 ± 2.261
5.77AlaVal: 5.77 ± 1.067
0.444AlaTrp: 0.444 ± 0.359
2.219AlaTyr: 2.219 ± 0.463
0.0AlaXaa: 0.0 ± 0.0
Cys
1.332CysAla: 1.332 ± 0.859
1.332CysCys: 1.332 ± 0.893
0.444CysAsp: 0.444 ± 0.359
1.775CysGlu: 1.775 ± 1.15
1.332CysPhe: 1.332 ± 0.791
0.888CysGly: 0.888 ± 0.64
0.0CysHis: 0.0 ± 0.0
0.888CysIle: 0.888 ± 0.644
2.663CysLys: 2.663 ± 1.549
1.332CysLeu: 1.332 ± 1.076
1.332CysMet: 1.332 ± 0.859
0.0CysAsn: 0.0 ± 0.0
1.775CysPro: 1.775 ± 0.649
0.0CysGln: 0.0 ± 0.0
2.219CysArg: 2.219 ± 1.507
1.332CysSer: 1.332 ± 0.693
0.888CysThr: 0.888 ± 0.489
2.219CysVal: 2.219 ± 0.832
0.888CysTrp: 0.888 ± 0.489
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.77AspAla: 5.77 ± 1.02
1.775AspCys: 1.775 ± 0.596
3.551AspAsp: 3.551 ± 0.963
3.551AspGlu: 3.551 ± 0.724
1.332AspPhe: 1.332 ± 0.775
4.439AspGly: 4.439 ± 1.419
0.444AspHis: 0.444 ± 0.359
4.439AspIle: 4.439 ± 1.413
1.332AspLys: 1.332 ± 0.913
4.882AspLeu: 4.882 ± 1.134
1.775AspMet: 1.775 ± 1.469
3.551AspAsn: 3.551 ± 1.116
4.882AspPro: 4.882 ± 1.431
1.332AspGln: 1.332 ± 0.45
1.332AspArg: 1.332 ± 0.787
4.882AspSer: 4.882 ± 1.604
2.663AspThr: 2.663 ± 0.906
4.439AspVal: 4.439 ± 1.206
1.332AspTrp: 1.332 ± 0.787
2.663AspTyr: 2.663 ± 0.768
0.0AspXaa: 0.0 ± 0.0
Glu
4.439GluAla: 4.439 ± 1.257
0.888GluCys: 0.888 ± 0.717
4.439GluAsp: 4.439 ± 1.625
3.995GluGlu: 3.995 ± 1.499
1.775GluPhe: 1.775 ± 0.929
3.995GluGly: 3.995 ± 1.673
1.775GluHis: 1.775 ± 0.682
1.775GluIle: 1.775 ± 0.652
2.663GluLys: 2.663 ± 1.08
3.995GluLeu: 3.995 ± 1.826
1.332GluMet: 1.332 ± 0.468
1.332GluAsn: 1.332 ± 0.794
2.219GluPro: 2.219 ± 0.589
1.775GluGln: 1.775 ± 0.896
2.663GluArg: 2.663 ± 1.301
3.995GluSer: 3.995 ± 1.348
4.439GluThr: 4.439 ± 0.901
4.882GluVal: 4.882 ± 1.552
0.0GluTrp: 0.0 ± 0.0
2.219GluTyr: 2.219 ± 0.801
0.0GluXaa: 0.0 ± 0.0
Phe
4.882PheAla: 4.882 ± 1.477
0.444PheCys: 0.444 ± 0.359
3.107PheAsp: 3.107 ± 1.153
2.663PheGlu: 2.663 ± 1.54
2.663PhePhe: 2.663 ± 0.877
2.219PheGly: 2.219 ± 1.022
0.444PheHis: 0.444 ± 0.367
1.332PheIle: 1.332 ± 0.388
3.551PheLys: 3.551 ± 1.139
4.439PheLeu: 4.439 ± 1.151
1.332PheMet: 1.332 ± 0.658
2.663PheAsn: 2.663 ± 1.119
2.663PhePro: 2.663 ± 1.006
1.775PheGln: 1.775 ± 0.649
1.775PheArg: 1.775 ± 0.855
3.551PheSer: 3.551 ± 1.569
2.219PheThr: 2.219 ± 1.155
1.775PheVal: 1.775 ± 0.649
0.888PheTrp: 0.888 ± 0.489
0.888PheTyr: 0.888 ± 0.577
0.0PheXaa: 0.0 ± 0.0
Gly
2.663GlyAla: 2.663 ± 0.861
1.775GlyCys: 1.775 ± 0.596
3.107GlyAsp: 3.107 ± 0.646
3.995GlyGlu: 3.995 ± 1.851
1.332GlyPhe: 1.332 ± 0.794
7.545GlyGly: 7.545 ± 3.444
0.0GlyHis: 0.0 ± 0.0
2.219GlyIle: 2.219 ± 0.732
3.995GlyLys: 3.995 ± 1.37
4.439GlyLeu: 4.439 ± 1.899
2.663GlyMet: 2.663 ± 0.902
5.77GlyAsn: 5.77 ± 1.505
2.663GlyPro: 2.663 ± 1.386
2.663GlyGln: 2.663 ± 0.852
5.326GlyArg: 5.326 ± 1.035
3.995GlySer: 3.995 ± 1.127
5.77GlyThr: 5.77 ± 1.549
4.882GlyVal: 4.882 ± 1.539
1.332GlyTrp: 1.332 ± 0.614
1.332GlyTyr: 1.332 ± 0.954
0.0GlyXaa: 0.0 ± 0.0
His
1.332HisAla: 1.332 ± 1.113
1.332HisCys: 1.332 ± 0.388
0.444HisAsp: 0.444 ± 0.415
0.444HisGlu: 0.444 ± 0.359
0.888HisPhe: 0.888 ± 0.717
0.888HisGly: 0.888 ± 0.489
0.444HisHis: 0.444 ± 0.367
1.775HisIle: 1.775 ± 1.295
0.0HisLys: 0.0 ± 0.0
2.663HisLeu: 2.663 ± 1.276
0.0HisMet: 0.0 ± 0.0
2.219HisAsn: 2.219 ± 1.031
1.775HisPro: 1.775 ± 1.136
0.0HisGln: 0.0 ± 0.0
0.888HisArg: 0.888 ± 0.647
0.444HisSer: 0.444 ± 0.367
2.663HisThr: 2.663 ± 1.558
1.775HisVal: 1.775 ± 1.041
0.888HisTrp: 0.888 ± 0.428
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.775IleAla: 1.775 ± 0.682
0.888IleCys: 0.888 ± 0.644
1.775IleAsp: 1.775 ± 0.701
3.107IleGlu: 3.107 ± 0.871
1.775IlePhe: 1.775 ± 0.647
3.551IleGly: 3.551 ± 0.793
0.444IleHis: 0.444 ± 0.423
2.663IleIle: 2.663 ± 1.732
3.107IleLys: 3.107 ± 0.659
3.995IleLeu: 3.995 ± 0.619
1.332IleMet: 1.332 ± 0.954
0.444IleAsn: 0.444 ± 0.367
3.551IlePro: 3.551 ± 1.295
2.219IleGln: 2.219 ± 0.69
2.663IleArg: 2.663 ± 1.002
3.995IleSer: 3.995 ± 0.963
3.107IleThr: 3.107 ± 0.703
2.219IleVal: 2.219 ± 1.078
0.444IleTrp: 0.444 ± 0.423
0.888IleTyr: 0.888 ± 0.83
0.0IleXaa: 0.0 ± 0.0
Lys
3.107LysAla: 3.107 ± 1.058
1.775LysCys: 1.775 ± 0.596
4.439LysAsp: 4.439 ± 1.517
2.219LysGlu: 2.219 ± 0.795
0.888LysPhe: 0.888 ± 0.734
4.882LysGly: 4.882 ± 1.537
0.444LysHis: 0.444 ± 0.415
0.444LysIle: 0.444 ± 0.359
4.882LysLys: 4.882 ± 2.492
3.107LysLeu: 3.107 ± 1.428
0.888LysMet: 0.888 ± 0.428
1.332LysAsn: 1.332 ± 0.718
3.107LysPro: 3.107 ± 1.119
3.995LysGln: 3.995 ± 1.212
3.995LysArg: 3.995 ± 1.097
3.995LysSer: 3.995 ± 1.419
2.219LysThr: 2.219 ± 0.827
2.663LysVal: 2.663 ± 0.588
0.888LysTrp: 0.888 ± 0.577
3.995LysTyr: 3.995 ± 1.216
0.0LysXaa: 0.0 ± 0.0
Leu
3.995LeuAla: 3.995 ± 2.236
2.663LeuCys: 2.663 ± 1.031
4.439LeuAsp: 4.439 ± 0.944
6.214LeuGlu: 6.214 ± 1.568
6.214LeuPhe: 6.214 ± 1.509
4.882LeuGly: 4.882 ± 1.334
3.995LeuHis: 3.995 ± 2.271
4.439LeuIle: 4.439 ± 1.764
4.439LeuLys: 4.439 ± 1.142
4.882LeuLeu: 4.882 ± 1.748
0.888LeuMet: 0.888 ± 0.757
3.551LeuAsn: 3.551 ± 1.369
3.995LeuPro: 3.995 ± 1.031
4.882LeuGln: 4.882 ± 1.692
2.219LeuArg: 2.219 ± 1.359
6.658LeuSer: 6.658 ± 1.511
5.77LeuThr: 5.77 ± 0.929
2.663LeuVal: 2.663 ± 1.201
0.444LeuTrp: 0.444 ± 0.359
2.219LeuTyr: 2.219 ± 0.762
0.0LeuXaa: 0.0 ± 0.0
Met
1.332MetAla: 1.332 ± 0.775
0.888MetCys: 0.888 ± 0.644
1.775MetAsp: 1.775 ± 0.677
1.332MetGlu: 1.332 ± 0.484
1.775MetPhe: 1.775 ± 1.031
0.888MetGly: 0.888 ± 0.734
0.444MetHis: 0.444 ± 0.582
1.775MetIle: 1.775 ± 0.714
0.444MetLys: 0.444 ± 0.367
2.663MetLeu: 2.663 ± 1.43
0.444MetMet: 0.444 ± 0.423
0.888MetAsn: 0.888 ± 0.644
0.888MetPro: 0.888 ± 0.819
1.332MetGln: 1.332 ± 0.913
0.444MetArg: 0.444 ± 0.359
2.663MetSer: 2.663 ± 0.926
0.888MetThr: 0.888 ± 0.464
1.775MetVal: 1.775 ± 0.682
0.0MetTrp: 0.0 ± 0.0
0.444MetTyr: 0.444 ± 0.367
0.0MetXaa: 0.0 ± 0.0
Asn
5.326AsnAla: 5.326 ± 1.811
0.444AsnCys: 0.444 ± 0.582
0.888AsnAsp: 0.888 ± 0.452
2.219AsnGlu: 2.219 ± 0.853
0.888AsnPhe: 0.888 ± 0.428
2.663AsnGly: 2.663 ± 0.622
0.888AsnHis: 0.888 ± 0.734
1.332AsnIle: 1.332 ± 0.388
2.219AsnLys: 2.219 ± 1.172
1.332AsnLeu: 1.332 ± 0.484
0.888AsnMet: 0.888 ± 0.489
1.775AsnAsn: 1.775 ± 0.692
4.439AsnPro: 4.439 ± 1.178
1.332AsnGln: 1.332 ± 0.872
1.332AsnArg: 1.332 ± 0.388
3.551AsnSer: 3.551 ± 0.828
3.107AsnThr: 3.107 ± 0.94
2.219AsnVal: 2.219 ± 0.589
0.888AsnTrp: 0.888 ± 0.428
0.888AsnTyr: 0.888 ± 0.489
0.0AsnXaa: 0.0 ± 0.0
Pro
4.439ProAla: 4.439 ± 0.869
0.888ProCys: 0.888 ± 0.713
5.77ProAsp: 5.77 ± 1.366
2.219ProGlu: 2.219 ± 0.733
3.107ProPhe: 3.107 ± 0.709
3.995ProGly: 3.995 ± 1.054
0.888ProHis: 0.888 ± 1.155
6.658ProIle: 6.658 ± 3.211
2.663ProLys: 2.663 ± 0.944
5.326ProLeu: 5.326 ± 1.525
0.444ProMet: 0.444 ± 0.423
2.663ProAsn: 2.663 ± 0.62
11.54ProPro: 11.54 ± 3.146
2.663ProGln: 2.663 ± 0.506
4.439ProArg: 4.439 ± 2.166
5.77ProSer: 5.77 ± 1.666
4.882ProThr: 4.882 ± 1.802
6.214ProVal: 6.214 ± 1.633
0.888ProTrp: 0.888 ± 0.452
1.775ProTyr: 1.775 ± 1.031
0.0ProXaa: 0.0 ± 0.0
Gln
3.107GlnAla: 3.107 ± 1.41
0.444GlnCys: 0.444 ± 0.578
1.775GlnAsp: 1.775 ± 0.614
1.775GlnGlu: 1.775 ± 1.029
3.107GlnPhe: 3.107 ± 0.725
1.332GlnGly: 1.332 ± 0.468
1.775GlnHis: 1.775 ± 1.007
2.663GlnIle: 2.663 ± 0.877
1.332GlnLys: 1.332 ± 0.649
4.439GlnLeu: 4.439 ± 1.293
0.888GlnMet: 0.888 ± 0.428
1.775GlnAsn: 1.775 ± 0.781
2.219GlnPro: 2.219 ± 0.79
2.219GlnGln: 2.219 ± 1.151
2.219GlnArg: 2.219 ± 0.988
3.107GlnSer: 3.107 ± 1.449
0.888GlnThr: 0.888 ± 0.489
1.332GlnVal: 1.332 ± 0.681
1.332GlnTrp: 1.332 ± 0.804
1.332GlnTyr: 1.332 ± 0.45
0.0GlnXaa: 0.0 ± 0.0
Arg
1.775ArgAla: 1.775 ± 1.314
2.219ArgCys: 2.219 ± 1.407
2.663ArgAsp: 2.663 ± 1.472
3.107ArgGlu: 3.107 ± 0.646
2.663ArgPhe: 2.663 ± 0.776
3.107ArgGly: 3.107 ± 1.481
2.219ArgHis: 2.219 ± 0.839
2.219ArgIle: 2.219 ± 1.169
4.882ArgLys: 4.882 ± 0.847
6.658ArgLeu: 6.658 ± 1.567
0.0ArgMet: 0.0 ± 0.333
1.775ArgAsn: 1.775 ± 0.714
4.439ArgPro: 4.439 ± 1.513
0.888ArgGln: 0.888 ± 0.717
7.102ArgArg: 7.102 ± 1.154
3.995ArgSer: 3.995 ± 1.448
1.775ArgThr: 1.775 ± 0.781
4.882ArgVal: 4.882 ± 1.764
0.0ArgTrp: 0.0 ± 0.0
0.444ArgTyr: 0.444 ± 0.68
0.0ArgXaa: 0.0 ± 0.0
Ser
4.439SerAla: 4.439 ± 0.878
0.888SerCys: 0.888 ± 0.899
5.326SerAsp: 5.326 ± 1.875
3.995SerGlu: 3.995 ± 2.013
5.326SerPhe: 5.326 ± 1.124
7.102SerGly: 7.102 ± 1.907
1.332SerHis: 1.332 ± 0.572
2.663SerIle: 2.663 ± 0.966
4.439SerLys: 4.439 ± 1.654
7.102SerLeu: 7.102 ± 1.057
0.888SerMet: 0.888 ± 0.489
3.551SerAsn: 3.551 ± 1.567
6.658SerPro: 6.658 ± 3.69
2.663SerGln: 2.663 ± 1.101
4.882SerArg: 4.882 ± 1.464
3.551SerSer: 3.551 ± 0.775
9.321SerThr: 9.321 ± 2.063
1.332SerVal: 1.332 ± 0.388
0.444SerTrp: 0.444 ± 0.367
1.332SerTyr: 1.332 ± 0.679
0.0SerXaa: 0.0 ± 0.0
Thr
3.995ThrAla: 3.995 ± 1.086
1.332ThrCys: 1.332 ± 0.775
4.882ThrAsp: 4.882 ± 2.13
2.663ThrGlu: 2.663 ± 0.877
0.888ThrPhe: 0.888 ± 0.666
2.663ThrGly: 2.663 ± 1.197
2.663ThrHis: 2.663 ± 1.469
3.107ThrIle: 3.107 ± 0.955
3.107ThrLys: 3.107 ± 0.882
4.439ThrLeu: 4.439 ± 0.805
1.775ThrMet: 1.775 ± 1.007
0.888ThrAsn: 0.888 ± 0.387
9.321ThrPro: 9.321 ± 2.51
0.444ThrGln: 0.444 ± 0.359
6.214ThrArg: 6.214 ± 2.284
6.214ThrSer: 6.214 ± 1.49
4.439ThrThr: 4.439 ± 1.374
6.658ThrVal: 6.658 ± 1.468
0.444ThrTrp: 0.444 ± 0.415
2.219ThrTyr: 2.219 ± 1.374
0.0ThrXaa: 0.0 ± 0.0
Val
2.663ValAla: 2.663 ± 1.295
0.888ValCys: 0.888 ± 0.64
4.882ValAsp: 4.882 ± 1.108
3.995ValGlu: 3.995 ± 0.741
3.551ValPhe: 3.551 ± 0.619
5.326ValGly: 5.326 ± 1.277
0.888ValHis: 0.888 ± 0.387
0.888ValIle: 0.888 ± 0.452
0.888ValLys: 0.888 ± 0.464
3.551ValLeu: 3.551 ± 1.193
1.775ValMet: 1.775 ± 0.62
2.663ValAsn: 2.663 ± 1.018
5.77ValPro: 5.77 ± 0.894
3.551ValGln: 3.551 ± 1.165
2.219ValArg: 2.219 ± 0.843
6.214ValSer: 6.214 ± 2.271
7.102ValThr: 7.102 ± 1.798
2.219ValVal: 2.219 ± 1.394
0.888ValTrp: 0.888 ± 0.734
3.107ValTyr: 3.107 ± 0.709
0.0ValXaa: 0.0 ± 0.0
Trp
1.332TrpAla: 1.332 ± 0.388
0.0TrpCys: 0.0 ± 0.0
0.888TrpAsp: 0.888 ± 0.734
0.444TrpGlu: 0.444 ± 0.415
0.888TrpPhe: 0.888 ± 0.428
0.888TrpGly: 0.888 ± 0.734
0.888TrpHis: 0.888 ± 0.428
0.888TrpIle: 0.888 ± 0.464
1.332TrpLys: 1.332 ± 0.718
1.775TrpLeu: 1.775 ± 0.979
0.444TrpMet: 0.444 ± 0.423
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.888TrpGln: 0.888 ± 0.387
0.888TrpArg: 0.888 ± 0.577
0.444TrpSer: 0.444 ± 0.367
0.444TrpThr: 0.444 ± 0.367
0.444TrpVal: 0.444 ± 0.359
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.219TyrAla: 2.219 ± 0.783
0.888TyrCys: 0.888 ± 0.713
1.332TyrAsp: 1.332 ± 0.775
0.888TyrGlu: 0.888 ± 0.489
1.332TyrPhe: 1.332 ± 0.45
0.888TyrGly: 0.888 ± 0.428
0.0TyrHis: 0.0 ± 0.0
0.444TyrIle: 0.444 ± 0.359
1.775TyrLys: 1.775 ± 0.979
1.332TyrLeu: 1.332 ± 0.468
2.219TyrMet: 2.219 ± 0.743
0.888TyrAsn: 0.888 ± 0.845
2.219TyrPro: 2.219 ± 0.762
2.663TyrGln: 2.663 ± 1.081
1.332TyrArg: 1.332 ± 0.45
3.551TyrSer: 3.551 ± 1.073
0.888TyrThr: 0.888 ± 0.387
2.663TyrVal: 2.663 ± 1.081
0.444TyrTrp: 0.444 ± 0.367
1.775TyrTyr: 1.775 ± 1.295
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2254 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski