Amino acid dipepetide frequency for Corvus monedula polyomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.047AlaAla: 8.047 ± 2.456
2.146AlaCys: 2.146 ± 1.129
4.292AlaAsp: 4.292 ± 1.007
6.438AlaGlu: 6.438 ± 2.033
0.536AlaPhe: 0.536 ± 0.571
4.828AlaGly: 4.828 ± 1.122
1.609AlaHis: 1.609 ± 0.792
5.901AlaIle: 5.901 ± 1.102
2.146AlaLys: 2.146 ± 1.014
9.657AlaLeu: 9.657 ± 4.437
0.536AlaMet: 0.536 ± 0.571
1.609AlaAsn: 1.609 ± 0.799
7.511AlaPro: 7.511 ± 3.051
2.146AlaGln: 2.146 ± 0.834
4.292AlaArg: 4.292 ± 1.202
4.828AlaSer: 4.828 ± 1.298
7.511AlaThr: 7.511 ± 2.76
5.365AlaVal: 5.365 ± 0.655
0.0AlaTrp: 0.0 ± 0.0
2.146AlaTyr: 2.146 ± 1.063
0.0AlaXaa: 0.0 ± 0.0
Cys
1.609CysAla: 1.609 ± 0.732
0.536CysCys: 0.536 ± 0.394
0.0CysAsp: 0.0 ± 0.0
1.609CysGlu: 1.609 ± 0.799
0.536CysPhe: 0.536 ± 0.394
0.0CysGly: 0.0 ± 0.0
0.536CysHis: 0.536 ± 0.544
0.0CysIle: 0.0 ± 0.0
4.828CysLys: 4.828 ± 2.477
2.682CysLeu: 2.682 ± 1.49
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.609CysPro: 1.609 ± 0.563
0.536CysGln: 0.536 ± 0.394
0.0CysArg: 0.0 ± 0.0
0.536CysSer: 0.536 ± 0.394
2.146CysThr: 2.146 ± 1.188
1.609CysVal: 1.609 ± 1.181
0.536CysTrp: 0.536 ± 0.537
0.536CysTyr: 0.536 ± 0.544
0.0CysXaa: 0.0 ± 0.0
Asp
6.438AspAla: 6.438 ± 2.112
0.536AspCys: 0.536 ± 0.394
1.609AspAsp: 1.609 ± 0.792
4.292AspGlu: 4.292 ± 1.466
3.219AspPhe: 3.219 ± 0.879
3.755AspGly: 3.755 ± 2.238
2.146AspHis: 2.146 ± 1.406
2.682AspIle: 2.682 ± 0.668
2.146AspLys: 2.146 ± 0.733
5.365AspLeu: 5.365 ± 1.626
0.536AspMet: 0.536 ± 0.537
1.073AspAsn: 1.073 ± 0.787
2.682AspPro: 2.682 ± 1.449
0.536AspGln: 0.536 ± 0.394
1.609AspArg: 1.609 ± 0.732
2.682AspSer: 2.682 ± 0.776
2.682AspThr: 2.682 ± 1.49
3.755AspVal: 3.755 ± 1.436
2.146AspTrp: 2.146 ± 1.464
3.755AspTyr: 3.755 ± 0.886
0.0AspXaa: 0.0 ± 0.0
Glu
6.974GluAla: 6.974 ± 2.535
2.146GluCys: 2.146 ± 1.06
4.828GluAsp: 4.828 ± 1.583
5.365GluGlu: 5.365 ± 1.438
1.609GluPhe: 1.609 ± 0.563
4.292GluGly: 4.292 ± 1.21
1.073GluHis: 1.073 ± 0.732
1.609GluIle: 1.609 ± 1.181
3.219GluLys: 3.219 ± 1.099
3.219GluLeu: 3.219 ± 1.321
1.073GluMet: 1.073 ± 0.5
3.219GluAsn: 3.219 ± 0.954
4.828GluPro: 4.828 ± 1.248
3.219GluGln: 3.219 ± 1.011
2.146GluArg: 2.146 ± 1.575
2.682GluSer: 2.682 ± 0.715
8.584GluThr: 8.584 ± 3.132
4.292GluVal: 4.292 ± 1.656
0.0GluTrp: 0.0 ± 0.0
0.536GluTyr: 0.536 ± 0.544
0.0GluXaa: 0.0 ± 0.0
Phe
1.073PheAla: 1.073 ± 0.507
1.073PheCys: 1.073 ± 0.559
2.146PheAsp: 2.146 ± 0.638
2.146PheGlu: 2.146 ± 1.575
1.073PhePhe: 1.073 ± 0.753
1.073PheGly: 1.073 ± 1.075
0.0PheHis: 0.0 ± 0.0
1.073PheIle: 1.073 ± 0.732
1.609PheLys: 1.609 ± 1.181
2.682PheLeu: 2.682 ± 0.898
0.536PheMet: 0.536 ± 0.521
1.073PheAsn: 1.073 ± 0.787
3.755PhePro: 3.755 ± 1.963
1.073PheGln: 1.073 ± 0.777
2.146PheArg: 2.146 ± 0.47
4.828PheSer: 4.828 ± 0.856
2.682PheThr: 2.682 ± 1.064
1.609PheVal: 1.609 ± 0.563
0.0PheTrp: 0.0 ± 0.0
0.536PheTyr: 0.536 ± 0.571
0.0PheXaa: 0.0 ± 0.0
Gly
5.365GlyAla: 5.365 ± 1.844
1.073GlyCys: 1.073 ± 0.787
3.219GlyAsp: 3.219 ± 0.954
2.682GlyGlu: 2.682 ± 0.815
1.073GlyPhe: 1.073 ± 0.658
5.365GlyGly: 5.365 ± 0.872
1.609GlyHis: 1.609 ± 1.181
2.682GlyIle: 2.682 ± 2.007
2.146GlyLys: 2.146 ± 1.06
9.657GlyLeu: 9.657 ± 3.995
2.682GlyMet: 2.682 ± 0.353
2.682GlyAsn: 2.682 ± 0.841
6.974GlyPro: 6.974 ± 1.25
3.755GlyGln: 3.755 ± 1.885
4.292GlyArg: 4.292 ± 2.179
3.219GlySer: 3.219 ± 1.489
3.755GlyThr: 3.755 ± 0.668
3.755GlyVal: 3.755 ± 1.848
1.073GlyTrp: 1.073 ± 0.732
2.146GlyTyr: 2.146 ± 0.565
0.0GlyXaa: 0.0 ± 0.0
His
1.073HisAla: 1.073 ± 0.559
1.073HisCys: 1.073 ± 0.787
0.536HisAsp: 0.536 ± 0.593
0.0HisGlu: 0.0 ± 0.0
1.609HisPhe: 1.609 ± 0.63
1.609HisGly: 1.609 ± 0.887
0.536HisHis: 0.536 ± 0.394
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
3.755HisLeu: 3.755 ± 1.16
0.536HisMet: 0.536 ± 0.394
0.0HisAsn: 0.0 ± 0.0
1.073HisPro: 1.073 ± 0.559
2.146HisGln: 2.146 ± 0.638
1.609HisArg: 1.609 ± 0.792
1.073HisSer: 1.073 ± 0.721
0.536HisThr: 0.536 ± 0.537
1.073HisVal: 1.073 ± 0.732
0.0HisTrp: 0.0 ± 0.0
1.073HisTyr: 1.073 ± 0.527
0.0HisXaa: 0.0 ± 0.0
Ile
2.682IleAla: 2.682 ± 0.506
1.073IleCys: 1.073 ± 0.787
2.682IleAsp: 2.682 ± 0.796
3.219IleGlu: 3.219 ± 1.414
1.073IlePhe: 1.073 ± 0.787
3.219IleGly: 3.219 ± 2.089
0.0IleHis: 0.0 ± 0.0
2.682IleIle: 2.682 ± 0.898
2.682IleLys: 2.682 ± 0.993
7.511IleLeu: 7.511 ± 2.167
1.073IleMet: 1.073 ± 0.787
1.073IleAsn: 1.073 ± 0.787
2.146IlePro: 2.146 ± 0.47
1.609IleGln: 1.609 ± 0.563
0.0IleArg: 0.0 ± 0.0
3.755IleSer: 3.755 ± 0.888
2.146IleThr: 2.146 ± 1.029
0.536IleVal: 0.536 ± 0.537
1.073IleTrp: 1.073 ± 0.732
1.073IleTyr: 1.073 ± 0.507
0.0IleXaa: 0.0 ± 0.0
Lys
6.438LysAla: 6.438 ± 1.215
0.0LysCys: 0.0 ± 0.0
2.146LysAsp: 2.146 ± 1.014
1.609LysGlu: 1.609 ± 1.181
0.536LysPhe: 0.536 ± 0.394
2.682LysGly: 2.682 ± 1.006
1.609LysHis: 1.609 ± 1.181
2.682LysIle: 2.682 ± 1.322
3.755LysLys: 3.755 ± 1.029
4.828LysLeu: 4.828 ± 1.023
2.146LysMet: 2.146 ± 0.735
2.146LysAsn: 2.146 ± 0.47
0.0LysPro: 0.0 ± 0.0
2.146LysGln: 2.146 ± 0.733
5.901LysArg: 5.901 ± 2.355
1.609LysSer: 1.609 ± 1.181
4.828LysThr: 4.828 ± 0.759
2.146LysVal: 2.146 ± 1.464
0.536LysTrp: 0.536 ± 0.394
1.073LysTyr: 1.073 ± 0.559
0.0LysXaa: 0.0 ± 0.0
Leu
10.193LeuAla: 10.193 ± 3.505
1.609LeuCys: 1.609 ± 0.732
5.901LeuAsp: 5.901 ± 2.028
8.047LeuGlu: 8.047 ± 3.248
4.828LeuPhe: 4.828 ± 1.587
5.901LeuGly: 5.901 ± 2.579
1.609LeuHis: 1.609 ± 1.125
3.219LeuIle: 3.219 ± 0.582
2.682LeuLys: 2.682 ± 0.993
10.193LeuLeu: 10.193 ± 2.148
3.219LeuMet: 3.219 ± 1.522
6.438LeuAsn: 6.438 ± 2.499
8.584LeuPro: 8.584 ± 2.795
8.047LeuGln: 8.047 ± 2.173
5.365LeuArg: 5.365 ± 0.716
2.146LeuSer: 2.146 ± 2.284
6.438LeuThr: 6.438 ± 1.263
2.682LeuVal: 2.682 ± 1.422
0.0LeuTrp: 0.0 ± 0.0
4.828LeuTyr: 4.828 ± 1.018
0.0LeuXaa: 0.0 ± 0.0
Met
2.146MetAla: 2.146 ± 0.748
0.0MetCys: 0.0 ± 0.0
2.682MetAsp: 2.682 ± 1.064
1.073MetGlu: 1.073 ± 1.186
1.609MetPhe: 1.609 ± 1.031
2.682MetGly: 2.682 ± 1.052
0.0MetHis: 0.0 ± 0.0
0.536MetIle: 0.536 ± 0.394
2.146MetLys: 2.146 ± 0.733
2.146MetLeu: 2.146 ± 0.638
0.536MetMet: 0.536 ± 0.394
1.073MetAsn: 1.073 ± 0.787
0.0MetPro: 0.0 ± 0.0
0.536MetGln: 0.536 ± 0.537
2.146MetArg: 2.146 ± 0.748
0.536MetSer: 0.536 ± 0.394
1.073MetThr: 1.073 ± 0.507
0.536MetVal: 0.536 ± 0.394
0.536MetTrp: 0.536 ± 0.537
1.073MetTyr: 1.073 ± 0.559
0.0MetXaa: 0.0 ± 0.0
Asn
1.073AsnAla: 1.073 ± 0.787
0.536AsnCys: 0.536 ± 0.544
1.073AsnAsp: 1.073 ± 0.507
2.146AsnGlu: 2.146 ± 1.014
1.073AsnPhe: 1.073 ± 0.787
1.073AsnGly: 1.073 ± 0.507
1.073AsnHis: 1.073 ± 0.787
3.755AsnIle: 3.755 ± 0.659
1.073AsnLys: 1.073 ± 0.559
3.219AsnLeu: 3.219 ± 1.464
1.073AsnMet: 1.073 ± 0.559
1.073AsnAsn: 1.073 ± 0.559
5.901AsnPro: 5.901 ± 0.792
2.146AsnGln: 2.146 ± 0.748
1.073AsnArg: 1.073 ± 0.732
3.219AsnSer: 3.219 ± 0.482
1.609AsnThr: 1.609 ± 0.69
0.0AsnVal: 0.0 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
1.609AsnTyr: 1.609 ± 0.732
0.0AsnXaa: 0.0 ± 0.0
Pro
2.682ProAla: 2.682 ± 1.926
0.0ProCys: 0.0 ± 0.0
6.438ProAsp: 6.438 ± 0.797
5.365ProGlu: 5.365 ± 1.631
0.536ProPhe: 0.536 ± 0.394
8.047ProGly: 8.047 ± 1.753
0.536ProHis: 0.536 ± 0.537
4.292ProIle: 4.292 ± 2.138
6.438ProLys: 6.438 ± 0.49
4.828ProLeu: 4.828 ± 1.248
1.073ProMet: 1.073 ± 0.721
1.609ProAsn: 1.609 ± 0.563
5.365ProPro: 5.365 ± 2.33
4.828ProGln: 4.828 ± 2.276
5.901ProArg: 5.901 ± 2.323
6.438ProSer: 6.438 ± 2.255
4.828ProThr: 4.828 ± 1.316
5.901ProVal: 5.901 ± 1.567
0.536ProTrp: 0.536 ± 0.544
2.146ProTyr: 2.146 ± 0.47
0.0ProXaa: 0.0 ± 0.0
Gln
3.219GlnAla: 3.219 ± 1.706
1.609GlnCys: 1.609 ± 1.031
0.0GlnAsp: 0.0 ± 0.0
3.219GlnGlu: 3.219 ± 1.011
1.609GlnPhe: 1.609 ± 0.799
3.755GlnGly: 3.755 ± 1.802
0.536GlnHis: 0.536 ± 0.544
3.755GlnIle: 3.755 ± 0.771
4.292GlnLys: 4.292 ± 1.276
4.292GlnLeu: 4.292 ± 0.91
0.536GlnMet: 0.536 ± 0.394
2.682GlnAsn: 2.682 ± 1.006
1.609GlnPro: 1.609 ± 1.206
1.073GlnGln: 1.073 ± 0.732
4.292GlnArg: 4.292 ± 1.159
2.682GlnSer: 2.682 ± 0.796
2.146GlnThr: 2.146 ± 0.47
1.609GlnVal: 1.609 ± 0.792
0.0GlnTrp: 0.0 ± 0.0
1.073GlnTyr: 1.073 ± 1.075
0.0GlnXaa: 0.0 ± 0.0
Arg
5.365ArgAla: 5.365 ± 1.18
0.536ArgCys: 0.536 ± 0.394
4.292ArgAsp: 4.292 ± 1.201
2.682ArgGlu: 2.682 ± 1.422
1.609ArgPhe: 1.609 ± 0.563
3.219ArgGly: 3.219 ± 0.582
2.146ArgHis: 2.146 ± 1.464
2.146ArgIle: 2.146 ± 1.014
3.755ArgLys: 3.755 ± 1.578
4.292ArgLeu: 4.292 ± 0.819
2.682ArgMet: 2.682 ± 0.827
1.609ArgAsn: 1.609 ± 0.69
4.828ArgPro: 4.828 ± 0.947
2.146ArgGln: 2.146 ± 0.748
8.047ArgArg: 8.047 ± 2.333
6.438ArgSer: 6.438 ± 3.716
3.219ArgThr: 3.219 ± 1.422
1.073ArgVal: 1.073 ± 0.507
0.536ArgTrp: 0.536 ± 0.394
2.682ArgTyr: 2.682 ± 1.449
0.0ArgXaa: 0.0 ± 0.0
Ser
6.974SerAla: 6.974 ± 2.142
4.292SerCys: 4.292 ± 0.65
1.609SerAsp: 1.609 ± 0.799
2.682SerGlu: 2.682 ± 1.725
4.292SerPhe: 4.292 ± 0.91
5.365SerGly: 5.365 ± 1.523
0.0SerHis: 0.0 ± 0.0
1.073SerIle: 1.073 ± 0.527
1.073SerLys: 1.073 ± 0.732
5.901SerLeu: 5.901 ± 0.883
0.536SerMet: 0.536 ± 0.394
1.073SerAsn: 1.073 ± 1.186
6.974SerPro: 6.974 ± 3.051
2.682SerGln: 2.682 ± 0.715
3.219SerArg: 3.219 ± 1.769
5.365SerSer: 5.365 ± 2.878
4.292SerThr: 4.292 ± 1.491
2.146SerVal: 2.146 ± 1.029
0.0SerTrp: 0.0 ± 0.0
3.219SerTyr: 3.219 ± 1.655
0.0SerXaa: 0.0 ± 0.0
Thr
4.828ThrAla: 4.828 ± 1.124
1.073ThrCys: 1.073 ± 0.703
4.292ThrAsp: 4.292 ± 2.194
2.682ThrGlu: 2.682 ± 0.668
1.073ThrPhe: 1.073 ± 0.703
5.901ThrGly: 5.901 ± 0.853
1.609ThrHis: 1.609 ± 0.799
0.536ThrIle: 0.536 ± 0.537
1.073ThrLys: 1.073 ± 0.753
9.657ThrLeu: 9.657 ± 0.996
2.146ThrMet: 2.146 ± 1.138
1.609ThrAsn: 1.609 ± 0.732
8.584ThrPro: 8.584 ± 4.079
1.073ThrGln: 1.073 ± 1.075
3.755ThrArg: 3.755 ± 0.668
3.755ThrSer: 3.755 ± 2.957
3.755ThrThr: 3.755 ± 1.512
6.438ThrVal: 6.438 ± 1.355
1.073ThrTrp: 1.073 ± 0.732
2.682ThrTyr: 2.682 ± 1.594
0.0ThrXaa: 0.0 ± 0.0
Val
4.292ValAla: 4.292 ± 0.979
0.536ValCys: 0.536 ± 0.394
2.682ValAsp: 2.682 ± 0.838
5.365ValGlu: 5.365 ± 1.012
1.609ValPhe: 1.609 ± 0.563
1.609ValGly: 1.609 ± 1.125
0.0ValHis: 0.0 ± 0.0
2.146ValIle: 2.146 ± 0.889
1.609ValLys: 1.609 ± 0.968
5.365ValLeu: 5.365 ± 1.829
0.0ValMet: 0.0 ± 0.0
3.755ValAsn: 3.755 ± 1.136
3.755ValPro: 3.755 ± 1.126
2.146ValGln: 2.146 ± 0.47
2.682ValArg: 2.682 ± 1.422
4.828ValSer: 4.828 ± 0.924
2.146ValThr: 2.146 ± 1.014
4.292ValVal: 4.292 ± 0.885
1.073ValTrp: 1.073 ± 0.732
0.536ValTyr: 0.536 ± 0.394
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.073TrpAsp: 1.073 ± 0.732
0.536TrpGlu: 0.536 ± 0.537
0.0TrpPhe: 0.0 ± 0.0
1.609TrpGly: 1.609 ± 0.563
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.073TrpLys: 1.073 ± 0.732
0.0TrpLeu: 0.0 ± 0.0
1.073TrpMet: 1.073 ± 0.732
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.536TrpArg: 0.536 ± 0.394
0.536TrpSer: 0.536 ± 0.537
1.609TrpThr: 1.609 ± 0.958
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.073TrpTyr: 1.073 ± 0.732
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.073TyrAla: 1.073 ± 0.787
0.0TyrCys: 0.0 ± 0.0
2.146TyrAsp: 2.146 ± 0.689
3.755TyrGlu: 3.755 ± 1.605
2.682TyrPhe: 2.682 ± 1.33
3.219TyrGly: 3.219 ± 1.327
2.682TyrHis: 2.682 ± 1.133
0.536TyrIle: 0.536 ± 0.537
1.073TyrLys: 1.073 ± 0.787
2.682TyrLeu: 2.682 ± 0.353
0.536TyrMet: 0.536 ± 0.544
0.0TyrAsn: 0.0 ± 0.0
1.609TyrPro: 1.609 ± 1.612
2.146TyrGln: 2.146 ± 0.995
4.292TyrArg: 4.292 ± 1.629
1.609TyrSer: 1.609 ± 0.63
2.146TyrThr: 2.146 ± 1.542
1.609TyrVal: 1.609 ± 0.563
0.0TyrTrp: 0.0 ± 0.0
1.609TyrTyr: 1.609 ± 0.563
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1865 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski