Amino acid dipepetide frequency for Sus scrofa polyomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.022AlaAla: 3.022 ± 0.913
3.694AlaCys: 3.694 ± 0.791
2.015AlaAsp: 2.015 ± 0.676
2.686AlaGlu: 2.686 ± 1.149
3.358AlaPhe: 3.358 ± 0.702
8.059AlaGly: 8.059 ± 1.322
1.007AlaHis: 1.007 ± 0.7
1.343AlaIle: 1.343 ± 0.489
3.022AlaLys: 3.022 ± 1.271
9.402AlaLeu: 9.402 ± 1.922
1.343AlaMet: 1.343 ± 0.655
0.672AlaAsn: 0.672 ± 0.437
5.373AlaPro: 5.373 ± 1.463
1.679AlaGln: 1.679 ± 0.42
3.358AlaArg: 3.358 ± 0.742
5.037AlaSer: 5.037 ± 1.395
1.343AlaThr: 1.343 ± 0.657
3.694AlaVal: 3.694 ± 0.813
0.336AlaTrp: 0.336 ± 0.233
0.672AlaTyr: 0.672 ± 0.467
0.0AlaXaa: 0.0 ± 0.0
Cys
1.343CysAla: 1.343 ± 0.658
0.336CysCys: 0.336 ± 0.233
0.0CysAsp: 0.0 ± 0.0
0.336CysGlu: 0.336 ± 0.233
3.358CysPhe: 3.358 ± 0.922
1.343CysGly: 1.343 ± 0.864
0.0CysHis: 0.0 ± 0.0
0.672CysIle: 0.672 ± 0.463
1.343CysLys: 1.343 ± 0.683
8.059CysLeu: 8.059 ± 1.732
0.0CysMet: 0.0 ± 0.0
0.336CysAsn: 0.336 ± 0.338
1.679CysPro: 1.679 ± 0.647
0.336CysGln: 0.336 ± 0.233
0.672CysArg: 0.672 ± 0.383
1.343CysSer: 1.343 ± 0.524
0.672CysThr: 0.672 ± 0.383
0.672CysVal: 0.672 ± 0.329
0.0CysTrp: 0.0 ± 0.0
4.365CysTyr: 4.365 ± 0.995
0.0CysXaa: 0.0 ± 0.0
Asp
5.037AspAla: 5.037 ± 0.883
3.358AspCys: 3.358 ± 0.922
3.358AspAsp: 3.358 ± 1.111
1.679AspGlu: 1.679 ± 0.483
1.679AspPhe: 1.679 ± 0.912
3.358AspGly: 3.358 ± 0.931
1.679AspHis: 1.679 ± 0.731
5.037AspIle: 5.037 ± 1.118
4.365AspLys: 4.365 ± 0.705
7.388AspLeu: 7.388 ± 1.819
0.336AspMet: 0.336 ± 0.306
0.336AspAsn: 0.336 ± 0.233
4.365AspPro: 4.365 ± 0.96
0.0AspGln: 0.0 ± 0.0
0.0AspArg: 0.0 ± 0.0
5.373AspSer: 5.373 ± 0.785
0.336AspThr: 0.336 ± 0.343
1.679AspVal: 1.679 ± 0.75
1.007AspTrp: 1.007 ± 0.617
1.007AspTyr: 1.007 ± 0.379
0.0AspXaa: 0.0 ± 0.0
Glu
5.709GluAla: 5.709 ± 2.176
0.336GluCys: 0.336 ± 0.345
5.037GluAsp: 5.037 ± 1.308
4.701GluGlu: 4.701 ± 0.87
1.007GluPhe: 1.007 ± 0.7
3.358GluGly: 3.358 ± 1.187
1.007GluHis: 1.007 ± 0.459
1.679GluIle: 1.679 ± 0.525
2.351GluLys: 2.351 ± 1.634
12.089GluLeu: 12.089 ± 1.825
0.336GluMet: 0.336 ± 0.233
5.037GluAsn: 5.037 ± 0.834
3.022GluPro: 3.022 ± 1.075
2.351GluGln: 2.351 ± 0.761
4.701GluArg: 4.701 ± 1.092
1.679GluSer: 1.679 ± 0.579
1.007GluThr: 1.007 ± 0.391
3.358GluVal: 3.358 ± 1.368
0.336GluTrp: 0.336 ± 0.345
0.336GluTyr: 0.336 ± 0.233
0.0GluXaa: 0.0 ± 0.0
Phe
3.694PheAla: 3.694 ± 1.388
5.037PheCys: 5.037 ± 1.087
1.007PheAsp: 1.007 ± 0.482
2.015PheGlu: 2.015 ± 1.137
0.336PhePhe: 0.336 ± 0.233
5.373PheGly: 5.373 ± 0.827
1.679PheHis: 1.679 ± 0.534
0.672PheIle: 0.672 ± 0.313
1.343PheLys: 1.343 ± 0.694
4.365PheLeu: 4.365 ± 1.179
0.336PheMet: 0.336 ± 0.33
1.343PheAsn: 1.343 ± 0.475
6.044PhePro: 6.044 ± 1.768
6.38PheGln: 6.38 ± 1.61
2.015PheArg: 2.015 ± 0.655
2.351PheSer: 2.351 ± 0.662
2.686PheThr: 2.686 ± 0.49
0.336PheVal: 0.336 ± 0.338
0.0PheTrp: 0.0 ± 0.0
4.03PheTyr: 4.03 ± 0.638
0.0PheXaa: 0.0 ± 0.0
Gly
3.022GlyAla: 3.022 ± 0.76
1.343GlyCys: 1.343 ± 0.524
4.701GlyAsp: 4.701 ± 0.747
5.037GlyGlu: 5.037 ± 1.113
6.716GlyPhe: 6.716 ± 1.175
7.723GlyGly: 7.723 ± 1.015
0.0GlyHis: 0.0 ± 0.0
4.701GlyIle: 4.701 ± 1.55
6.38GlyLys: 6.38 ± 0.96
5.037GlyLeu: 5.037 ± 1.771
0.0GlyMet: 0.0 ± 0.0
3.358GlyAsn: 3.358 ± 0.745
2.686GlyPro: 2.686 ± 1.034
1.343GlyGln: 1.343 ± 0.426
3.022GlyArg: 3.022 ± 1.045
5.037GlySer: 5.037 ± 1.279
3.022GlyThr: 3.022 ± 1.257
1.679GlyVal: 1.679 ± 0.615
0.336GlyTrp: 0.336 ± 0.383
1.007GlyTyr: 1.007 ± 0.379
0.0GlyXaa: 0.0 ± 0.0
His
1.343HisAla: 1.343 ± 0.479
3.358HisCys: 3.358 ± 0.918
0.672HisAsp: 0.672 ± 0.389
0.672HisGlu: 0.672 ± 0.313
2.351HisPhe: 2.351 ± 1.123
1.007HisGly: 1.007 ± 0.391
0.672HisHis: 0.672 ± 0.677
0.0HisIle: 0.0 ± 0.0
0.672HisLys: 0.672 ± 0.467
1.343HisLeu: 1.343 ± 0.479
1.007HisMet: 1.007 ± 0.363
0.336HisAsn: 0.336 ± 0.233
3.358HisPro: 3.358 ± 0.745
0.672HisGln: 0.672 ± 0.518
1.343HisArg: 1.343 ± 0.637
3.694HisSer: 3.694 ± 0.987
0.0HisThr: 0.0 ± 0.0
1.343HisVal: 1.343 ± 0.546
1.343HisTrp: 1.343 ± 0.541
0.336HisTyr: 0.336 ± 0.233
0.0HisXaa: 0.0 ± 0.0
Ile
4.365IleAla: 4.365 ± 1.345
1.679IleCys: 1.679 ± 0.678
4.365IleAsp: 4.365 ± 0.617
1.679IleGlu: 1.679 ± 0.871
2.686IlePhe: 2.686 ± 0.7
2.351IleGly: 2.351 ± 0.77
1.007IleHis: 1.007 ± 0.435
1.679IleIle: 1.679 ± 0.871
3.022IleLys: 3.022 ± 0.797
3.022IleLeu: 3.022 ± 1.064
1.007IleMet: 1.007 ± 0.453
1.343IleAsn: 1.343 ± 0.752
4.701IlePro: 4.701 ± 0.783
1.343IleGln: 1.343 ± 0.562
1.007IleArg: 1.007 ± 0.437
3.358IleSer: 3.358 ± 0.648
2.015IleThr: 2.015 ± 1.235
2.686IleVal: 2.686 ± 0.883
0.336IleTrp: 0.336 ± 0.338
1.007IleTyr: 1.007 ± 0.379
0.0IleXaa: 0.0 ± 0.0
Lys
5.709LysAla: 5.709 ± 0.89
0.672LysCys: 0.672 ± 0.467
1.007LysAsp: 1.007 ± 0.573
6.044LysGlu: 6.044 ± 0.685
5.037LysPhe: 5.037 ± 0.815
4.701LysGly: 4.701 ± 0.664
2.686LysHis: 2.686 ± 0.81
2.015LysIle: 2.015 ± 1.199
4.701LysLys: 4.701 ± 1.388
4.365LysLeu: 4.365 ± 1.224
5.709LysMet: 5.709 ± 1.206
1.343LysAsn: 1.343 ± 0.658
2.351LysPro: 2.351 ± 0.828
1.343LysGln: 1.343 ± 0.747
2.686LysArg: 2.686 ± 0.847
1.343LysSer: 1.343 ± 0.934
3.022LysThr: 3.022 ± 0.932
0.336LysVal: 0.336 ± 0.306
0.0LysTrp: 0.0 ± 0.0
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
7.723LeuAla: 7.723 ± 1.739
1.007LeuCys: 1.007 ± 0.435
8.059LeuAsp: 8.059 ± 1.184
5.037LeuGlu: 5.037 ± 1.506
4.03LeuPhe: 4.03 ± 0.761
3.022LeuGly: 3.022 ± 1.121
5.373LeuHis: 5.373 ± 0.777
3.358LeuIle: 3.358 ± 0.924
2.686LeuLys: 2.686 ± 0.977
14.775LeuLeu: 14.775 ± 2.304
3.358LeuMet: 3.358 ± 1.139
7.052LeuAsn: 7.052 ± 0.899
8.395LeuPro: 8.395 ± 3.358
7.723LeuGln: 7.723 ± 2.546
8.731LeuArg: 8.731 ± 1.864
7.052LeuSer: 7.052 ± 2.479
10.074LeuThr: 10.074 ± 1.407
3.358LeuVal: 3.358 ± 1.143
3.694LeuTrp: 3.694 ± 0.864
2.686LeuTyr: 2.686 ± 0.813
0.0LeuXaa: 0.0 ± 0.0
Met
1.679MetAla: 1.679 ± 0.505
0.0MetCys: 0.0 ± 0.0
3.022MetAsp: 3.022 ± 0.796
3.022MetGlu: 3.022 ± 1.285
0.336MetPhe: 0.336 ± 0.306
1.679MetGly: 1.679 ± 0.526
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
3.358MetLys: 3.358 ± 0.745
2.351MetLeu: 2.351 ± 0.878
4.03MetMet: 4.03 ± 1.163
0.336MetAsn: 0.336 ± 0.345
0.336MetPro: 0.336 ± 0.306
3.358MetGln: 3.358 ± 0.745
0.336MetArg: 0.336 ± 0.306
0.672MetSer: 0.672 ± 0.5
1.343MetThr: 1.343 ± 0.828
0.0MetVal: 0.0 ± 0.0
0.336MetTrp: 0.336 ± 0.306
0.672MetTyr: 0.672 ± 0.518
0.0MetXaa: 0.0 ± 0.0
Asn
4.03AsnAla: 4.03 ± 0.906
0.672AsnCys: 0.672 ± 0.383
0.672AsnAsp: 0.672 ± 0.329
4.365AsnGlu: 4.365 ± 0.645
2.015AsnPhe: 2.015 ± 0.78
4.365AsnGly: 4.365 ± 0.983
0.0AsnHis: 0.0 ± 0.0
4.701AsnIle: 4.701 ± 0.931
2.015AsnLys: 2.015 ± 0.847
2.686AsnLeu: 2.686 ± 0.759
0.0AsnMet: 0.0 ± 0.0
1.679AsnAsn: 1.679 ± 0.973
0.336AsnPro: 0.336 ± 0.306
1.343AsnGln: 1.343 ± 0.576
1.343AsnArg: 1.343 ± 0.652
1.343AsnSer: 1.343 ± 0.464
1.007AsnThr: 1.007 ± 0.573
1.007AsnVal: 1.007 ± 0.573
0.336AsnTrp: 0.336 ± 0.345
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.015ProAla: 2.015 ± 0.989
1.343ProCys: 1.343 ± 0.606
6.716ProAsp: 6.716 ± 0.792
1.007ProGlu: 1.007 ± 0.387
1.679ProPhe: 1.679 ± 0.699
2.686ProGly: 2.686 ± 0.852
5.373ProHis: 5.373 ± 0.972
2.686ProIle: 2.686 ± 0.843
4.365ProLys: 4.365 ± 1.482
7.388ProLeu: 7.388 ± 1.997
4.701ProMet: 4.701 ± 1.143
0.672ProAsn: 0.672 ± 0.415
8.731ProPro: 8.731 ± 2.227
1.343ProGln: 1.343 ± 1.117
3.358ProArg: 3.358 ± 1.44
6.716ProSer: 6.716 ± 1.73
2.351ProThr: 2.351 ± 0.678
3.694ProVal: 3.694 ± 0.956
0.672ProTrp: 0.672 ± 0.455
0.336ProTyr: 0.336 ± 0.306
0.0ProXaa: 0.0 ± 0.0
Gln
1.343GlnAla: 1.343 ± 0.706
0.0GlnCys: 0.0 ± 0.0
2.686GlnAsp: 2.686 ± 0.982
3.694GlnGlu: 3.694 ± 1.689
0.672GlnPhe: 0.672 ± 0.437
2.015GlnGly: 2.015 ± 0.818
1.007GlnHis: 1.007 ± 0.506
1.343GlnIle: 1.343 ± 0.658
2.686GlnLys: 2.686 ± 0.869
1.679GlnLeu: 1.679 ± 0.518
1.007GlnMet: 1.007 ± 0.57
0.672GlnAsn: 0.672 ± 0.521
0.672GlnPro: 0.672 ± 0.313
4.701GlnGln: 4.701 ± 0.81
8.731GlnArg: 8.731 ± 1.823
6.044GlnSer: 6.044 ± 1.204
5.037GlnThr: 5.037 ± 0.561
2.351GlnVal: 2.351 ± 0.714
0.0GlnTrp: 0.0 ± 0.0
0.336GlnTyr: 0.336 ± 0.306
0.0GlnXaa: 0.0 ± 0.0
Arg
2.351ArgAla: 2.351 ± 0.643
1.007ArgCys: 1.007 ± 0.718
2.351ArgAsp: 2.351 ± 0.612
5.373ArgGlu: 5.373 ± 1.035
2.351ArgPhe: 2.351 ± 1.016
2.686ArgGly: 2.686 ± 0.974
1.343ArgHis: 1.343 ± 0.694
1.679ArgIle: 1.679 ± 0.806
6.716ArgLys: 6.716 ± 1.491
10.41ArgLeu: 10.41 ± 2.598
1.343ArgMet: 1.343 ± 0.745
4.03ArgAsn: 4.03 ± 0.638
1.343ArgPro: 1.343 ± 0.637
1.679ArgGln: 1.679 ± 0.81
6.38ArgArg: 6.38 ± 1.719
6.38ArgSer: 6.38 ± 1.497
1.343ArgThr: 1.343 ± 0.475
2.351ArgVal: 2.351 ± 0.688
1.007ArgTrp: 1.007 ± 0.564
1.343ArgTyr: 1.343 ± 0.606
0.0ArgXaa: 0.0 ± 0.0
Ser
5.373SerAla: 5.373 ± 0.942
0.336SerCys: 0.336 ± 0.233
1.679SerAsp: 1.679 ± 0.702
6.716SerGlu: 6.716 ± 1.101
5.373SerPhe: 5.373 ± 0.897
5.373SerGly: 5.373 ± 1.861
2.015SerHis: 2.015 ± 0.731
5.373SerIle: 5.373 ± 1.112
1.679SerLys: 1.679 ± 0.58
8.059SerLeu: 8.059 ± 1.852
0.0SerMet: 0.0 ± 0.0
2.015SerAsn: 2.015 ± 0.734
6.044SerPro: 6.044 ± 1.373
5.037SerGln: 5.037 ± 1.344
6.044SerArg: 6.044 ± 2.462
13.432SerSer: 13.432 ± 5.229
7.052SerThr: 7.052 ± 1.784
1.007SerVal: 1.007 ± 0.379
1.007SerTrp: 1.007 ± 0.49
1.343SerTyr: 1.343 ± 0.625
0.0SerXaa: 0.0 ± 0.0
Thr
2.351ThrAla: 2.351 ± 0.669
1.343ThrCys: 1.343 ± 0.423
1.679ThrAsp: 1.679 ± 0.589
1.679ThrGlu: 1.679 ± 0.647
4.701ThrPhe: 4.701 ± 1.189
3.358ThrGly: 3.358 ± 0.946
0.336ThrHis: 0.336 ± 0.357
3.358ThrIle: 3.358 ± 1.606
0.672ThrLys: 0.672 ± 0.313
6.716ThrLeu: 6.716 ± 1.121
0.336ThrMet: 0.336 ± 0.306
0.672ThrAsn: 0.672 ± 0.612
5.709ThrPro: 5.709 ± 1.27
1.343ThrGln: 1.343 ± 0.531
4.365ThrArg: 4.365 ± 0.663
4.03ThrSer: 4.03 ± 1.468
3.022ThrThr: 3.022 ± 1.155
2.351ThrVal: 2.351 ± 1.092
0.336ThrTrp: 0.336 ± 0.233
0.672ThrTyr: 0.672 ± 0.437
0.0ThrXaa: 0.0 ± 0.0
Val
1.007ValAla: 1.007 ± 0.7
1.343ValCys: 1.343 ± 0.482
1.343ValAsp: 1.343 ± 0.649
1.007ValGlu: 1.007 ± 0.54
1.679ValPhe: 1.679 ± 0.794
1.007ValGly: 1.007 ± 0.918
0.336ValHis: 0.336 ± 0.233
2.015ValIle: 2.015 ± 0.93
1.679ValLys: 1.679 ± 0.635
4.701ValLeu: 4.701 ± 1.8
1.343ValMet: 1.343 ± 0.627
2.351ValAsn: 2.351 ± 0.961
2.351ValPro: 2.351 ± 0.82
0.672ValGln: 0.672 ± 0.431
0.672ValArg: 0.672 ± 0.612
4.365ValSer: 4.365 ± 1.191
2.686ValThr: 2.686 ± 1.153
2.015ValVal: 2.015 ± 1.146
0.672ValTrp: 0.672 ± 0.383
1.007ValTyr: 1.007 ± 0.387
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.336TrpAsp: 0.336 ± 0.345
0.672TrpGlu: 0.672 ± 0.431
0.336TrpPhe: 0.336 ± 0.345
0.672TrpGly: 0.672 ± 0.431
0.0TrpHis: 0.0 ± 0.0
0.336TrpIle: 0.336 ± 0.343
1.679TrpLys: 1.679 ± 0.628
1.343TrpLeu: 1.343 ± 0.564
0.0TrpMet: 0.0 ± 0.0
0.336TrpAsn: 0.336 ± 0.233
0.0TrpPro: 0.0 ± 0.0
3.694TrpGln: 3.694 ± 0.791
0.336TrpArg: 0.336 ± 0.338
1.343TrpSer: 1.343 ± 0.541
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
1.007TrpTrp: 1.007 ± 0.411
0.672TrpTyr: 0.672 ± 0.313
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.0TyrCys: 0.0 ± 0.0
0.672TyrAsp: 0.672 ± 0.313
2.015TyrGlu: 2.015 ± 0.918
1.343TyrPhe: 1.343 ± 0.606
2.015TyrGly: 2.015 ± 0.439
0.0TyrHis: 0.0 ± 0.0
2.015TyrIle: 2.015 ± 0.633
0.336TyrLys: 0.336 ± 0.306
2.015TyrLeu: 2.015 ± 0.741
0.336TyrMet: 0.336 ± 0.233
0.0TyrAsn: 0.0 ± 0.0
0.336TyrPro: 0.336 ± 0.306
0.672TyrGln: 0.672 ± 0.612
4.365TyrArg: 4.365 ± 0.857
4.365TyrSer: 4.365 ± 0.737
1.007TyrThr: 1.007 ± 0.564
0.336TyrVal: 0.336 ± 0.233
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (2979 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski