Amino acid dipepetide frequency for Raccoon-associated polyomavirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.054AlaAla: 4.054 ± 1.131
2.703AlaCys: 2.703 ± 1.253
2.252AlaAsp: 2.252 ± 0.69
4.505AlaGlu: 4.505 ± 1.811
1.802AlaPhe: 1.802 ± 0.587
3.153AlaGly: 3.153 ± 0.701
0.0AlaHis: 0.0 ± 0.0
2.252AlaIle: 2.252 ± 0.682
0.901AlaLys: 0.901 ± 0.807
3.604AlaLeu: 3.604 ± 1.551
0.901AlaMet: 0.901 ± 0.392
0.0AlaAsn: 0.0 ± 0.0
4.505AlaPro: 4.505 ± 0.772
0.901AlaGln: 0.901 ± 0.516
5.405AlaArg: 5.405 ± 2.723
8.108AlaSer: 8.108 ± 1.098
0.901AlaThr: 0.901 ± 0.414
3.604AlaVal: 3.604 ± 1.906
0.45AlaTrp: 0.45 ± 0.309
1.802AlaTyr: 1.802 ± 0.882
0.0AlaXaa: 0.0 ± 0.0
Cys
0.901CysAla: 0.901 ± 0.414
2.703CysCys: 2.703 ± 1.796
0.901CysAsp: 0.901 ± 0.414
0.45CysGlu: 0.45 ± 0.309
1.351CysPhe: 1.351 ± 0.732
0.45CysGly: 0.45 ± 0.403
0.0CysHis: 0.0 ± 0.0
1.351CysIle: 1.351 ± 0.885
3.604CysLys: 3.604 ± 1.38
1.802CysLeu: 1.802 ± 1.115
0.0CysMet: 0.0 ± 0.0
0.45CysAsn: 0.45 ± 0.309
1.351CysPro: 1.351 ± 0.609
0.901CysGln: 0.901 ± 0.618
1.802CysArg: 1.802 ± 1.431
1.351CysSer: 1.351 ± 0.926
0.45CysThr: 0.45 ± 0.309
0.901CysVal: 0.901 ± 0.716
0.0CysTrp: 0.0 ± 0.0
2.252CysTyr: 2.252 ± 1.171
0.0CysXaa: 0.0 ± 0.0
Asp
0.901AspAla: 0.901 ± 0.392
0.45AspCys: 0.45 ± 0.309
1.802AspAsp: 1.802 ± 1.235
4.054AspGlu: 4.054 ± 0.728
2.252AspPhe: 2.252 ± 1.01
2.252AspGly: 2.252 ± 0.69
0.45AspHis: 0.45 ± 0.309
2.703AspIle: 2.703 ± 1.467
4.054AspLys: 4.054 ± 1.966
4.955AspLeu: 4.955 ± 1.0
1.351AspMet: 1.351 ± 0.757
0.901AspAsn: 0.901 ± 0.618
3.604AspPro: 3.604 ± 1.597
2.252AspGln: 2.252 ± 0.715
0.901AspArg: 0.901 ± 0.817
2.703AspSer: 2.703 ± 1.219
3.153AspThr: 3.153 ± 0.493
3.604AspVal: 3.604 ± 0.937
1.802AspTrp: 1.802 ± 1.635
2.703AspTyr: 2.703 ± 0.621
0.0AspXaa: 0.0 ± 0.0
Glu
4.505GluAla: 4.505 ± 1.786
0.901GluCys: 0.901 ± 0.538
4.505GluAsp: 4.505 ± 1.412
9.91GluGlu: 9.91 ± 1.989
0.901GluPhe: 0.901 ± 0.618
2.703GluGly: 2.703 ± 1.48
0.45GluHis: 0.45 ± 0.309
0.45GluIle: 0.45 ± 0.309
9.459GluLys: 9.459 ± 2.419
7.207GluLeu: 7.207 ± 2.177
0.0GluMet: 0.0 ± 0.0
1.802GluAsn: 1.802 ± 0.829
4.054GluPro: 4.054 ± 0.91
3.153GluGln: 3.153 ± 2.331
4.505GluArg: 4.505 ± 1.618
4.054GluSer: 4.054 ± 1.896
2.252GluThr: 2.252 ± 0.876
3.153GluVal: 3.153 ± 1.831
0.45GluTrp: 0.45 ± 0.309
0.901GluTyr: 0.901 ± 0.618
0.0GluXaa: 0.0 ± 0.0
Phe
2.252PheAla: 2.252 ± 0.564
0.45PheCys: 0.45 ± 0.309
2.703PheAsp: 2.703 ± 1.4
0.901PheGlu: 0.901 ± 0.618
2.252PhePhe: 2.252 ± 0.93
1.802PheGly: 1.802 ± 0.483
1.802PheHis: 1.802 ± 0.774
1.802PheIle: 1.802 ± 0.829
1.351PheLys: 1.351 ± 0.926
3.153PheLeu: 3.153 ± 1.473
0.45PheMet: 0.45 ± 0.309
0.901PheAsn: 0.901 ± 0.414
3.604PhePro: 3.604 ± 0.54
2.252PheGln: 2.252 ± 1.416
1.351PheArg: 1.351 ± 0.734
5.856PheSer: 5.856 ± 1.495
2.252PheThr: 2.252 ± 1.25
0.901PheVal: 0.901 ± 0.414
0.0PheTrp: 0.0 ± 0.0
0.45PheTyr: 0.45 ± 0.309
0.0PheXaa: 0.0 ± 0.0
Gly
2.703GlyAla: 2.703 ± 0.917
0.901GlyCys: 0.901 ± 0.618
5.405GlyAsp: 5.405 ± 0.917
1.802GlyGlu: 1.802 ± 0.483
1.802GlyPhe: 1.802 ± 0.704
5.856GlyGly: 5.856 ± 1.145
0.0GlyHis: 0.0 ± 0.0
3.604GlyIle: 3.604 ± 1.115
3.153GlyLys: 3.153 ± 1.155
7.207GlyLeu: 7.207 ± 1.562
0.0GlyMet: 0.0 ± 0.0
1.802GlyAsn: 1.802 ± 0.655
3.604GlyPro: 3.604 ± 0.841
4.955GlyGln: 4.955 ± 1.629
1.351GlyArg: 1.351 ± 0.734
3.604GlySer: 3.604 ± 2.281
1.802GlyThr: 1.802 ± 0.873
5.405GlyVal: 5.405 ± 1.868
0.45GlyTrp: 0.45 ± 0.484
0.45GlyTyr: 0.45 ± 0.403
0.0GlyXaa: 0.0 ± 0.0
His
1.351HisAla: 1.351 ± 0.749
0.0HisCys: 0.0 ± 0.0
0.901HisAsp: 0.901 ± 0.817
1.351HisGlu: 1.351 ± 0.554
0.45HisPhe: 0.45 ± 0.403
0.0HisGly: 0.0 ± 0.0
2.252HisHis: 2.252 ± 1.008
0.901HisIle: 0.901 ± 0.618
0.45HisLys: 0.45 ± 0.309
3.153HisLeu: 3.153 ± 1.766
0.901HisMet: 0.901 ± 0.762
0.45HisAsn: 0.45 ± 0.309
4.054HisPro: 4.054 ± 2.501
0.901HisGln: 0.901 ± 0.779
2.252HisArg: 2.252 ± 1.468
0.0HisSer: 0.0 ± 0.0
1.351HisThr: 1.351 ± 0.734
0.901HisVal: 0.901 ± 0.414
0.901HisTrp: 0.901 ± 0.968
0.45HisTyr: 0.45 ± 0.309
0.0HisXaa: 0.0 ± 0.0
Ile
1.351IleAla: 1.351 ± 0.734
0.45IleCys: 0.45 ± 0.309
1.351IleAsp: 1.351 ± 0.609
5.405IleGlu: 5.405 ± 3.282
0.45IlePhe: 0.45 ± 0.309
0.901IleGly: 0.901 ± 0.817
1.351IleHis: 1.351 ± 0.734
1.802IleIle: 1.802 ± 0.873
1.802IleLys: 1.802 ± 0.873
4.054IleLeu: 4.054 ± 1.183
0.45IleMet: 0.45 ± 0.309
2.252IleAsn: 2.252 ± 0.564
3.604IlePro: 3.604 ± 0.868
2.703IleGln: 2.703 ± 0.744
0.45IleArg: 0.45 ± 0.403
2.252IleSer: 2.252 ± 0.69
4.505IleThr: 4.505 ± 1.93
1.351IleVal: 1.351 ± 0.749
0.901IleTrp: 0.901 ± 0.538
1.802IleTyr: 1.802 ± 0.587
0.0IleXaa: 0.0 ± 0.0
Lys
3.153LysAla: 3.153 ± 1.485
0.901LysCys: 0.901 ± 0.618
1.351LysAsp: 1.351 ± 0.733
6.757LysGlu: 6.757 ± 1.908
0.45LysPhe: 0.45 ± 0.309
4.505LysGly: 4.505 ± 1.437
0.901LysHis: 0.901 ± 0.618
0.45LysIle: 0.45 ± 0.309
8.108LysLys: 8.108 ± 1.522
5.405LysLeu: 5.405 ± 1.613
3.604LysMet: 3.604 ± 1.31
2.703LysAsn: 2.703 ± 1.454
1.802LysPro: 1.802 ± 0.942
3.153LysGln: 3.153 ± 1.117
5.405LysArg: 5.405 ± 0.782
2.252LysSer: 2.252 ± 1.01
2.703LysThr: 2.703 ± 1.219
2.252LysVal: 2.252 ± 1.01
0.901LysTrp: 0.901 ± 0.779
4.505LysTyr: 4.505 ± 1.318
0.0LysXaa: 0.0 ± 0.0
Leu
4.955LeuAla: 4.955 ± 1.436
4.054LeuCys: 4.054 ± 1.249
6.306LeuAsp: 6.306 ± 1.658
7.207LeuGlu: 7.207 ± 2.178
5.856LeuPhe: 5.856 ± 1.477
5.405LeuGly: 5.405 ± 1.521
5.405LeuHis: 5.405 ± 2.484
5.405LeuIle: 5.405 ± 1.531
4.054LeuLys: 4.054 ± 1.806
21.171LeuLeu: 21.171 ± 8.61
5.405LeuMet: 5.405 ± 1.33
4.505LeuAsn: 4.505 ± 1.567
9.459LeuPro: 9.459 ± 3.474
4.505LeuGln: 4.505 ± 1.196
4.505LeuArg: 4.505 ± 0.739
8.108LeuSer: 8.108 ± 2.092
1.351LeuThr: 1.351 ± 0.609
5.856LeuVal: 5.856 ± 1.406
0.0LeuTrp: 0.0 ± 0.0
2.252LeuTyr: 2.252 ± 1.522
0.0LeuXaa: 0.0 ± 0.0
Met
3.153MetAla: 3.153 ± 1.309
0.45MetCys: 0.45 ± 0.309
2.252MetAsp: 2.252 ± 1.01
0.45MetGlu: 0.45 ± 0.403
0.901MetPhe: 0.901 ± 0.414
1.802MetGly: 1.802 ± 0.483
0.901MetHis: 0.901 ± 0.779
0.0MetIle: 0.0 ± 0.0
1.802MetLys: 1.802 ± 0.812
2.252MetLeu: 2.252 ± 0.893
0.901MetMet: 0.901 ± 0.39
0.901MetAsn: 0.901 ± 0.414
1.351MetPro: 1.351 ± 0.377
2.252MetGln: 2.252 ± 0.715
0.901MetArg: 0.901 ± 0.779
2.252MetSer: 2.252 ± 0.832
0.901MetThr: 0.901 ± 0.807
1.802MetVal: 1.802 ± 0.812
0.45MetTrp: 0.45 ± 0.403
1.351MetTyr: 1.351 ± 0.757
0.0MetXaa: 0.0 ± 0.0
Asn
1.802AsnAla: 1.802 ± 0.646
0.45AsnCys: 0.45 ± 0.309
1.802AsnAsp: 1.802 ± 1.141
1.351AsnGlu: 1.351 ± 1.21
1.802AsnPhe: 1.802 ± 0.812
1.351AsnGly: 1.351 ± 0.609
0.0AsnHis: 0.0 ± 0.0
1.351AsnIle: 1.351 ± 0.926
2.252AsnLys: 2.252 ± 1.159
4.054AsnLeu: 4.054 ± 1.296
0.45AsnMet: 0.45 ± 0.438
1.351AsnAsn: 1.351 ± 0.609
2.703AsnPro: 2.703 ± 0.966
0.901AsnGln: 0.901 ± 0.414
0.45AsnArg: 0.45 ± 0.438
4.054AsnSer: 4.054 ± 0.883
2.703AsnThr: 2.703 ± 0.889
3.153AsnVal: 3.153 ± 1.586
0.45AsnTrp: 0.45 ± 0.309
0.901AsnTyr: 0.901 ± 0.618
0.0AsnXaa: 0.0 ± 0.0
Pro
4.955ProAla: 4.955 ± 1.795
1.351ProCys: 1.351 ± 0.7
4.505ProAsp: 4.505 ± 0.974
4.505ProGlu: 4.505 ± 1.319
1.351ProPhe: 1.351 ± 0.926
5.856ProGly: 5.856 ± 1.269
4.505ProHis: 4.505 ± 2.452
2.252ProIle: 2.252 ± 0.69
3.604ProLys: 3.604 ± 0.852
9.91ProLeu: 9.91 ± 2.641
2.703ProMet: 2.703 ± 1.086
1.351ProAsn: 1.351 ± 0.609
16.216ProPro: 16.216 ± 3.758
4.955ProGln: 4.955 ± 1.352
4.955ProArg: 4.955 ± 1.451
4.955ProSer: 4.955 ± 1.625
5.856ProThr: 5.856 ± 1.368
3.604ProVal: 3.604 ± 2.318
0.901ProTrp: 0.901 ± 0.817
0.901ProTyr: 0.901 ± 0.414
0.0ProXaa: 0.0 ± 0.0
Gln
3.153GlnAla: 3.153 ± 0.665
0.0GlnCys: 0.0 ± 0.0
2.252GlnAsp: 2.252 ± 1.088
0.45GlnGlu: 0.45 ± 0.309
2.703GlnPhe: 2.703 ± 0.886
2.703GlnGly: 2.703 ± 1.132
1.802GlnHis: 1.802 ± 0.655
4.505GlnIle: 4.505 ± 3.045
2.703GlnLys: 2.703 ± 0.886
4.955GlnLeu: 4.955 ± 2.238
1.802GlnMet: 1.802 ± 0.837
0.45GlnAsn: 0.45 ± 0.309
4.054GlnPro: 4.054 ± 1.426
3.153GlnGln: 3.153 ± 0.936
3.153GlnArg: 3.153 ± 1.266
1.351GlnSer: 1.351 ± 0.609
2.703GlnThr: 2.703 ± 0.917
3.604GlnVal: 3.604 ± 1.256
0.45GlnTrp: 0.45 ± 0.309
1.351GlnTyr: 1.351 ± 0.609
0.0GlnXaa: 0.0 ± 0.0
Arg
2.252ArgAla: 2.252 ± 1.008
0.901ArgCys: 0.901 ± 0.716
2.703ArgAsp: 2.703 ± 0.814
5.405ArgGlu: 5.405 ± 2.0
1.351ArgPhe: 1.351 ± 0.757
1.802ArgGly: 1.802 ± 0.587
1.802ArgHis: 1.802 ± 1.01
2.252ArgIle: 2.252 ± 0.69
2.252ArgLys: 2.252 ± 0.929
4.955ArgLeu: 4.955 ± 1.998
0.901ArgMet: 0.901 ± 0.414
3.604ArgAsn: 3.604 ± 1.108
3.153ArgPro: 3.153 ± 0.739
3.153ArgGln: 3.153 ± 1.496
5.405ArgArg: 5.405 ± 2.659
5.856ArgSer: 5.856 ± 1.359
1.351ArgThr: 1.351 ± 1.21
1.802ArgVal: 1.802 ± 0.774
0.901ArgTrp: 0.901 ± 0.817
1.351ArgTyr: 1.351 ± 0.757
0.0ArgXaa: 0.0 ± 0.0
Ser
4.054SerAla: 4.054 ± 1.041
2.252SerCys: 2.252 ± 1.036
1.802SerAsp: 1.802 ± 0.829
2.703SerGlu: 2.703 ± 1.219
4.054SerPhe: 4.054 ± 1.141
4.505SerGly: 4.505 ± 1.002
0.901SerHis: 0.901 ± 0.779
0.45SerIle: 0.45 ± 0.403
4.505SerLys: 4.505 ± 2.175
15.315SerLeu: 15.315 ± 3.033
0.901SerMet: 0.901 ± 0.488
2.703SerAsn: 2.703 ± 0.966
7.207SerPro: 7.207 ± 1.578
3.604SerGln: 3.604 ± 1.38
4.054SerArg: 4.054 ± 1.862
6.757SerSer: 6.757 ± 2.142
4.054SerThr: 4.054 ± 1.488
4.054SerVal: 4.054 ± 1.471
0.0SerTrp: 0.0 ± 0.0
1.802SerTyr: 1.802 ± 0.587
0.0SerXaa: 0.0 ± 0.0
Thr
3.153ThrAla: 3.153 ± 1.092
0.901ThrCys: 0.901 ± 0.807
0.901ThrAsp: 0.901 ± 0.817
1.802ThrGlu: 1.802 ± 1.141
2.703ThrPhe: 2.703 ± 1.361
1.351ThrGly: 1.351 ± 0.817
0.45ThrHis: 0.45 ± 0.309
3.153ThrIle: 3.153 ± 1.309
0.45ThrLys: 0.45 ± 0.309
3.604ThrLeu: 3.604 ± 1.305
2.252ThrMet: 2.252 ± 0.564
2.252ThrAsn: 2.252 ± 0.69
8.559ThrPro: 8.559 ± 2.427
0.45ThrGln: 0.45 ± 0.403
2.703ThrArg: 2.703 ± 0.945
4.054ThrSer: 4.054 ± 1.259
4.955ThrThr: 4.955 ± 2.147
4.054ThrVal: 4.054 ± 1.141
0.901ThrTrp: 0.901 ± 0.817
2.703ThrTyr: 2.703 ± 0.966
0.0ThrXaa: 0.0 ± 0.0
Val
3.153ValAla: 3.153 ± 0.772
1.802ValCys: 1.802 ± 0.655
0.901ValAsp: 0.901 ± 0.392
3.604ValGlu: 3.604 ± 1.368
1.351ValPhe: 1.351 ± 0.926
2.703ValGly: 2.703 ± 1.435
0.0ValHis: 0.0 ± 0.0
2.252ValIle: 2.252 ± 0.69
3.153ValLys: 3.153 ± 1.092
6.306ValLeu: 6.306 ± 1.257
0.901ValMet: 0.901 ± 0.567
4.054ValAsn: 4.054 ± 1.107
4.955ValPro: 4.955 ± 2.631
2.252ValGln: 2.252 ± 0.564
1.351ValArg: 1.351 ± 0.749
5.405ValSer: 5.405 ± 1.075
5.405ValThr: 5.405 ± 1.225
6.306ValVal: 6.306 ± 0.772
1.802ValTrp: 1.802 ± 0.782
0.901ValTyr: 0.901 ± 0.618
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.901TrpAsp: 0.901 ± 0.817
1.351TrpGlu: 1.351 ± 0.609
0.45TrpPhe: 0.45 ± 0.484
1.351TrpGly: 1.351 ± 0.909
0.0TrpHis: 0.0 ± 0.0
0.901TrpIle: 0.901 ± 0.817
0.901TrpLys: 0.901 ± 0.538
0.901TrpLeu: 0.901 ± 0.538
1.802TrpMet: 1.802 ± 1.154
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.45TrpGln: 0.45 ± 0.309
0.45TrpArg: 0.45 ± 0.484
1.351TrpSer: 1.351 ± 0.749
0.901TrpThr: 0.901 ± 0.817
0.45TrpVal: 0.45 ± 0.309
1.351TrpTrp: 1.351 ± 0.732
0.901TrpTyr: 0.901 ± 0.817
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
1.351TyrCys: 1.351 ± 0.732
1.351TyrAsp: 1.351 ± 0.734
1.351TyrGlu: 1.351 ± 0.749
2.252TyrPhe: 2.252 ± 0.69
5.405TyrGly: 5.405 ± 0.903
0.45TyrHis: 0.45 ± 0.403
1.351TyrIle: 1.351 ± 0.734
2.703TyrLys: 2.703 ± 0.814
1.802TyrLeu: 1.802 ± 0.774
0.901TyrMet: 0.901 ± 0.618
0.901TyrAsn: 0.901 ± 0.538
1.351TyrPro: 1.351 ± 1.21
0.45TyrGln: 0.45 ± 0.309
1.802TyrArg: 1.802 ± 0.743
1.351TyrSer: 1.351 ± 0.757
1.802TyrThr: 1.802 ± 0.873
1.802TyrVal: 1.802 ± 0.774
1.351TyrTrp: 1.351 ± 0.734
1.802TyrTyr: 1.802 ± 0.982
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2221 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski