Amino acid dipepetide frequency for Cumuto virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.108AlaAla: 2.108 ± 1.611
1.204AlaCys: 1.204 ± 0.871
2.409AlaAsp: 2.409 ± 0.263
2.409AlaGlu: 2.409 ± 1.573
1.506AlaPhe: 1.506 ± 0.414
3.312AlaGly: 3.312 ± 0.896
0.602AlaHis: 0.602 ± 0.344
3.312AlaIle: 3.312 ± 1.414
5.42AlaLys: 5.42 ± 1.119
5.721AlaLeu: 5.721 ± 4.311
0.903AlaMet: 0.903 ± 0.516
1.807AlaAsn: 1.807 ± 0.319
1.506AlaPro: 1.506 ± 1.735
1.506AlaGln: 1.506 ± 0.414
2.71AlaArg: 2.71 ± 0.848
4.216AlaSer: 4.216 ± 0.771
3.011AlaThr: 3.011 ± 0.572
2.71AlaVal: 2.71 ± 1.867
0.602AlaTrp: 0.602 ± 0.344
1.807AlaTyr: 1.807 ± 1.081
0.0AlaXaa: 0.0 ± 0.0
Cys
0.903CysAla: 0.903 ± 0.159
0.301CysCys: 0.301 ± 0.172
1.506CysAsp: 1.506 ± 1.21
1.506CysGlu: 1.506 ± 0.731
0.602CysPhe: 0.602 ± 0.68
1.204CysGly: 1.204 ± 0.871
1.204CysHis: 1.204 ± 1.36
1.506CysIle: 1.506 ± 1.21
2.108CysLys: 2.108 ± 1.403
3.011CysLeu: 3.011 ± 0.572
0.602CysMet: 0.602 ± 0.202
1.204CysAsn: 1.204 ± 1.36
1.506CysPro: 1.506 ± 1.701
1.204CysGln: 1.204 ± 0.405
0.0CysArg: 0.0 ± 0.0
2.409CysSer: 2.409 ± 2.229
1.506CysThr: 1.506 ± 0.731
0.903CysVal: 0.903 ± 0.533
0.0CysTrp: 0.0 ± 0.0
0.301CysTyr: 0.301 ± 0.34
0.0CysXaa: 0.0 ± 0.0
Asp
3.011AspAla: 3.011 ± 1.553
1.807AspCys: 1.807 ± 2.041
5.42AspAsp: 5.42 ± 1.599
4.818AspGlu: 4.818 ± 0.93
2.108AspPhe: 2.108 ± 0.536
4.216AspGly: 4.216 ± 0.797
0.602AspHis: 0.602 ± 0.202
3.312AspIle: 3.312 ± 0.989
3.312AspLys: 3.312 ± 0.656
5.42AspLeu: 5.42 ± 1.285
1.204AspMet: 1.204 ± 0.687
2.71AspAsn: 2.71 ± 0.671
1.506AspPro: 1.506 ± 0.414
2.71AspGln: 2.71 ± 1.233
3.613AspArg: 3.613 ± 0.894
4.517AspSer: 4.517 ± 0.772
1.807AspThr: 1.807 ± 0.607
3.914AspVal: 3.914 ± 0.683
1.204AspTrp: 1.204 ± 0.262
0.903AspTyr: 0.903 ± 0.863
0.0AspXaa: 0.0 ± 0.0
Glu
2.409GluAla: 2.409 ± 0.525
0.903GluCys: 0.903 ± 0.533
3.312GluAsp: 3.312 ± 0.656
8.431GluGlu: 8.431 ± 2.555
4.517GluPhe: 4.517 ± 1.657
3.613GluGly: 3.613 ± 1.35
1.807GluHis: 1.807 ± 0.447
4.216GluIle: 4.216 ± 1.084
4.818GluLys: 4.818 ± 0.856
4.818GluLeu: 4.818 ± 0.943
2.71GluMet: 2.71 ± 0.287
3.011GluAsn: 3.011 ± 0.547
2.108GluPro: 2.108 ± 0.399
1.204GluGln: 1.204 ± 0.642
2.108GluArg: 2.108 ± 0.744
7.829GluSer: 7.829 ± 1.28
3.914GluThr: 3.914 ± 0.566
3.613GluVal: 3.613 ± 0.787
1.807GluTrp: 1.807 ± 0.319
1.807GluTyr: 1.807 ± 1.031
0.0GluXaa: 0.0 ± 0.0
Phe
1.204PheAla: 1.204 ± 0.45
0.903PheCys: 0.903 ± 0.533
3.914PheAsp: 3.914 ± 1.849
2.108PheGlu: 2.108 ± 0.399
2.71PhePhe: 2.71 ± 0.671
2.71PheGly: 2.71 ± 0.287
2.409PheHis: 2.409 ± 1.741
1.807PheIle: 1.807 ± 0.607
1.807PheLys: 1.807 ± 0.319
1.807PheLeu: 1.807 ± 0.447
1.506PheMet: 1.506 ± 0.414
1.204PheAsn: 1.204 ± 0.687
1.807PhePro: 1.807 ± 1.031
0.903PheGln: 0.903 ± 0.159
2.108PheArg: 2.108 ± 0.744
3.312PheSer: 3.312 ± 0.656
2.409PheThr: 2.409 ± 0.913
1.807PheVal: 1.807 ± 0.319
0.903PheTrp: 0.903 ± 0.159
0.602PheTyr: 0.602 ± 0.202
0.0PheXaa: 0.0 ± 0.0
Gly
3.011GlyAla: 3.011 ± 0.829
1.506GlyCys: 1.506 ± 1.701
3.914GlyAsp: 3.914 ± 1.565
3.011GlyGlu: 3.011 ± 0.572
3.613GlyPhe: 3.613 ± 0.729
2.409GlyGly: 2.409 ± 0.465
0.602GlyHis: 0.602 ± 0.344
3.914GlyIle: 3.914 ± 0.434
2.108GlyLys: 2.108 ± 0.931
4.818GlyLeu: 4.818 ± 0.202
1.204GlyMet: 1.204 ± 0.262
2.108GlyAsn: 2.108 ± 0.619
0.301GlyPro: 0.301 ± 0.34
1.204GlyGln: 1.204 ± 0.701
1.807GlyArg: 1.807 ± 1.081
5.721GlySer: 5.721 ± 1.137
2.108GlyThr: 2.108 ± 0.619
3.914GlyVal: 3.914 ± 0.783
0.301GlyTrp: 0.301 ± 0.172
2.71GlyTyr: 2.71 ± 0.334
0.0GlyXaa: 0.0 ± 0.0
His
0.602HisAla: 0.602 ± 0.68
0.0HisCys: 0.0 ± 0.0
0.301HisAsp: 0.301 ± 0.172
0.903HisGlu: 0.903 ± 0.159
0.903HisPhe: 0.903 ± 0.533
0.602HisGly: 0.602 ± 0.202
0.301HisHis: 0.301 ± 0.172
2.108HisIle: 2.108 ± 0.513
1.506HisLys: 1.506 ± 0.545
1.506HisLeu: 1.506 ± 0.321
0.903HisMet: 0.903 ± 0.533
1.807HisAsn: 1.807 ± 0.319
0.301HisPro: 0.301 ± 0.172
0.903HisGln: 0.903 ± 0.541
0.602HisArg: 0.602 ± 0.344
0.903HisSer: 0.903 ± 0.159
1.506HisThr: 1.506 ± 0.731
0.903HisVal: 0.903 ± 0.159
0.602HisTrp: 0.602 ± 0.68
1.204HisTyr: 1.204 ± 0.262
0.0HisXaa: 0.0 ± 0.0
Ile
2.409IleAla: 2.409 ± 2.217
1.807IleCys: 1.807 ± 1.065
3.914IleAsp: 3.914 ± 0.783
6.323IleGlu: 6.323 ± 1.068
3.011IlePhe: 3.011 ± 0.642
3.613IleGly: 3.613 ± 1.229
2.108IleHis: 2.108 ± 0.744
3.914IleIle: 3.914 ± 1.565
4.818IleLys: 4.818 ± 1.535
6.323IleLeu: 6.323 ± 1.196
1.204IleMet: 1.204 ± 0.405
3.613IleAsn: 3.613 ± 1.398
3.312IlePro: 3.312 ± 0.401
3.613IleGln: 3.613 ± 0.805
4.216IleArg: 4.216 ± 0.797
6.323IleSer: 6.323 ± 0.879
3.914IleThr: 3.914 ± 0.566
4.517IleVal: 4.517 ± 0.274
0.602IleTrp: 0.602 ± 0.202
2.108IleTyr: 2.108 ± 0.619
0.0IleXaa: 0.0 ± 0.0
Lys
3.914LysAla: 3.914 ± 1.32
0.903LysCys: 0.903 ± 1.02
4.517LysAsp: 4.517 ± 0.68
3.613LysGlu: 3.613 ± 1.154
3.312LysPhe: 3.312 ± 0.616
3.011LysGly: 3.011 ± 1.012
0.301LysHis: 0.301 ± 0.34
7.829LysIle: 7.829 ± 0.869
4.818LysLys: 4.818 ± 0.856
7.227LysLeu: 7.227 ± 0.98
3.312LysMet: 3.312 ± 1.064
6.323LysAsn: 6.323 ± 1.218
3.914LysPro: 3.914 ± 1.111
2.409LysGln: 2.409 ± 0.451
3.914LysArg: 3.914 ± 1.32
4.517LysSer: 4.517 ± 1.243
2.71LysThr: 2.71 ± 0.287
4.517LysVal: 4.517 ± 0.796
1.807LysTrp: 1.807 ± 0.447
2.71LysTyr: 2.71 ± 1.547
0.0LysXaa: 0.0 ± 0.0
Leu
5.721LeuAla: 5.721 ± 3.691
1.807LeuCys: 1.807 ± 0.577
3.011LeuAsp: 3.011 ± 1.253
5.721LeuGlu: 5.721 ± 3.265
2.409LeuPhe: 2.409 ± 0.659
3.914LeuGly: 3.914 ± 1.565
1.807LeuHis: 1.807 ± 1.065
8.13LeuIle: 8.13 ± 2.646
9.937LeuLys: 9.937 ± 1.099
8.13LeuLeu: 8.13 ± 0.542
3.312LeuMet: 3.312 ± 0.401
5.42LeuAsn: 5.42 ± 0.384
3.312LeuPro: 3.312 ± 0.91
4.216LeuGln: 4.216 ± 1.488
4.818LeuArg: 4.818 ± 1.792
7.227LeuSer: 7.227 ± 1.539
5.721LeuThr: 5.721 ± 1.498
5.42LeuVal: 5.42 ± 1.599
1.807LeuTrp: 1.807 ± 1.031
3.011LeuTyr: 3.011 ± 1.09
0.0LeuXaa: 0.0 ± 0.0
Met
1.807MetAla: 1.807 ± 0.607
0.602MetCys: 0.602 ± 0.57
1.506MetAsp: 1.506 ± 0.414
3.312MetGlu: 3.312 ± 1.424
0.903MetPhe: 0.903 ± 0.516
2.409MetGly: 2.409 ± 0.451
0.301MetHis: 0.301 ± 0.172
3.312MetIle: 3.312 ± 1.795
1.807MetLys: 1.807 ± 0.607
3.914MetLeu: 3.914 ± 1.766
1.807MetMet: 1.807 ± 0.858
0.301MetAsn: 0.301 ± 0.34
0.301MetPro: 0.301 ± 0.172
0.903MetGln: 0.903 ± 0.541
3.312MetArg: 3.312 ± 1.172
2.409MetSer: 2.409 ± 0.915
1.807MetThr: 1.807 ± 0.607
1.506MetVal: 1.506 ± 0.414
0.0MetTrp: 0.0 ± 0.0
1.807MetTyr: 1.807 ± 0.447
0.0MetXaa: 0.0 ± 0.0
Asn
1.506AsnAla: 1.506 ± 1.058
2.71AsnCys: 2.71 ± 1.131
2.108AsnAsp: 2.108 ± 0.994
2.71AsnGlu: 2.71 ± 0.478
1.506AsnPhe: 1.506 ± 0.859
3.011AsnGly: 3.011 ± 1.012
0.301AsnHis: 0.301 ± 0.608
2.71AsnIle: 2.71 ± 1.083
4.517AsnLys: 4.517 ± 0.796
4.517AsnLeu: 4.517 ± 1.07
0.602AsnMet: 0.602 ± 0.344
3.011AsnAsn: 3.011 ± 0.828
2.108AsnPro: 2.108 ± 0.399
3.011AsnGln: 3.011 ± 0.828
1.204AsnArg: 1.204 ± 0.405
2.71AsnSer: 2.71 ± 0.334
3.312AsnThr: 3.312 ± 0.656
4.216AsnVal: 4.216 ± 0.602
0.903AsnTrp: 0.903 ± 0.516
1.807AsnTyr: 1.807 ± 1.081
0.0AsnXaa: 0.0 ± 0.0
Pro
2.108ProAla: 2.108 ± 0.399
0.0ProCys: 0.0 ± 0.0
2.71ProAsp: 2.71 ± 1.547
2.409ProGlu: 2.409 ± 0.465
1.204ProPhe: 1.204 ± 0.687
1.506ProGly: 1.506 ± 0.414
1.204ProHis: 1.204 ± 0.262
2.71ProIle: 2.71 ± 0.478
2.108ProLys: 2.108 ± 0.619
0.903ProLeu: 0.903 ± 0.541
2.108ProMet: 2.108 ± 1.403
2.409ProAsn: 2.409 ± 0.451
0.301ProPro: 0.301 ± 0.172
0.903ProGln: 0.903 ± 0.159
0.301ProArg: 0.301 ± 0.608
3.312ProSer: 3.312 ± 0.401
3.011ProThr: 3.011 ± 1.09
3.312ProVal: 3.312 ± 2.392
0.903ProTrp: 0.903 ± 0.159
0.602ProTyr: 0.602 ± 0.344
0.0ProXaa: 0.0 ± 0.0
Gln
2.108GlnAla: 2.108 ± 0.98
0.602GlnCys: 0.602 ± 0.68
1.204GlnAsp: 1.204 ± 0.701
2.71GlnGlu: 2.71 ± 1.291
1.204GlnPhe: 1.204 ± 0.687
2.108GlnGly: 2.108 ± 0.536
0.903GlnHis: 0.903 ± 0.159
3.011GlnIle: 3.011 ± 0.142
2.409GlnLys: 2.409 ± 0.451
4.216GlnLeu: 4.216 ± 0.705
2.108GlnMet: 2.108 ± 0.293
1.506GlnAsn: 1.506 ± 0.321
0.903GlnPro: 0.903 ± 0.159
0.602GlnGln: 0.602 ± 0.57
1.204GlnArg: 1.204 ± 0.405
2.71GlnSer: 2.71 ± 0.71
2.71GlnThr: 2.71 ± 0.478
1.807GlnVal: 1.807 ± 1.081
0.903GlnTrp: 0.903 ± 0.516
1.506GlnTyr: 1.506 ± 0.414
0.0GlnXaa: 0.0 ± 0.0
Arg
3.011ArgAla: 3.011 ± 0.951
0.602ArgCys: 0.602 ± 0.68
3.613ArgAsp: 3.613 ± 0.688
3.312ArgGlu: 3.312 ± 0.611
1.807ArgPhe: 1.807 ± 0.858
0.903ArgGly: 0.903 ± 0.582
0.0ArgHis: 0.0 ± 0.0
4.216ArgIle: 4.216 ± 1.072
3.613ArgLys: 3.613 ± 1.595
6.926ArgLeu: 6.926 ± 1.755
2.108ArgMet: 2.108 ± 1.203
0.903ArgAsn: 0.903 ± 0.516
2.108ArgPro: 2.108 ± 0.293
2.71ArgGln: 2.71 ± 1.083
0.903ArgArg: 0.903 ± 0.516
3.613ArgSer: 3.613 ± 1.274
2.108ArgThr: 2.108 ± 0.984
5.119ArgVal: 5.119 ± 0.736
0.301ArgTrp: 0.301 ± 0.172
1.807ArgTyr: 1.807 ± 0.607
0.0ArgXaa: 0.0 ± 0.0
Ser
5.119SerAla: 5.119 ± 2.291
2.71SerCys: 2.71 ± 3.061
5.119SerAsp: 5.119 ± 0.942
5.42SerGlu: 5.42 ± 0.384
0.903SerPhe: 0.903 ± 0.516
3.914SerGly: 3.914 ± 1.605
0.903SerHis: 0.903 ± 0.159
5.721SerIle: 5.721 ± 1.415
5.119SerLys: 5.119 ± 0.635
9.335SerLeu: 9.335 ± 3.93
1.506SerMet: 1.506 ± 0.96
3.613SerAsn: 3.613 ± 0.688
3.312SerPro: 3.312 ± 0.611
3.011SerGln: 3.011 ± 0.828
6.323SerArg: 6.323 ± 2.777
6.323SerSer: 6.323 ± 2.232
3.613SerThr: 3.613 ± 0.894
5.721SerVal: 5.721 ± 1.013
0.602SerTrp: 0.602 ± 0.68
3.312SerTyr: 3.312 ± 0.656
0.0SerXaa: 0.0 ± 0.0
Thr
1.807ThrAla: 1.807 ± 0.402
1.506ThrCys: 1.506 ± 1.21
2.409ThrAsp: 2.409 ± 0.465
2.409ThrGlu: 2.409 ± 1.375
3.011ThrPhe: 3.011 ± 0.572
3.312ThrGly: 3.312 ± 0.91
0.602ThrHis: 0.602 ± 0.202
3.613ThrIle: 3.613 ± 1.149
3.613ThrLys: 3.613 ± 0.271
7.227ThrLeu: 7.227 ± 1.377
3.011ThrMet: 3.011 ± 0.142
2.409ThrAsn: 2.409 ± 0.913
2.71ThrPro: 2.71 ± 0.478
1.807ThrGln: 1.807 ± 0.607
2.409ThrArg: 2.409 ± 0.465
3.914ThrSer: 3.914 ± 0.933
4.517ThrThr: 4.517 ± 0.773
2.71ThrVal: 2.71 ± 2.226
0.301ThrTrp: 0.301 ± 0.172
0.903ThrTyr: 0.903 ± 0.533
0.0ThrXaa: 0.0 ± 0.0
Val
3.914ValAla: 3.914 ± 0.987
1.204ValCys: 1.204 ± 0.262
3.914ValAsp: 3.914 ± 0.072
3.613ValGlu: 3.613 ± 0.831
0.602ValPhe: 0.602 ± 0.202
2.71ValGly: 2.71 ± 0.478
2.108ValHis: 2.108 ± 0.513
4.216ValIle: 4.216 ± 2.175
7.227ValLys: 7.227 ± 0.542
6.022ValLeu: 6.022 ± 0.408
2.71ValMet: 2.71 ± 0.324
2.71ValAsn: 2.71 ± 0.287
1.807ValPro: 1.807 ± 1.744
2.108ValGln: 2.108 ± 0.513
4.216ValArg: 4.216 ± 0.127
3.613ValSer: 3.613 ± 0.231
1.506ValThr: 1.506 ± 0.545
5.119ValVal: 5.119 ± 2.86
1.807ValTrp: 1.807 ± 0.577
3.011ValTyr: 3.011 ± 0.142
0.0ValXaa: 0.0 ± 0.0
Trp
0.602TrpAla: 0.602 ± 0.202
0.602TrpCys: 0.602 ± 0.202
1.506TrpAsp: 1.506 ± 0.738
1.506TrpGlu: 1.506 ± 0.321
0.903TrpPhe: 0.903 ± 0.533
0.301TrpGly: 0.301 ± 0.172
0.301TrpHis: 0.301 ± 0.34
0.602TrpIle: 0.602 ± 0.344
1.506TrpLys: 1.506 ± 0.414
0.903TrpLeu: 0.903 ± 0.159
0.602TrpMet: 0.602 ± 0.344
1.204TrpAsn: 1.204 ± 0.687
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.903TrpArg: 0.903 ± 0.516
1.807TrpSer: 1.807 ± 1.031
0.602TrpThr: 0.602 ± 0.202
1.506TrpVal: 1.506 ± 0.321
0.301TrpTrp: 0.301 ± 0.34
0.301TrpTyr: 0.301 ± 0.34
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.506TyrAla: 1.506 ± 0.414
1.807TyrCys: 1.807 ± 1.549
1.807TyrAsp: 1.807 ± 0.607
2.71TyrGlu: 2.71 ± 0.71
1.204TyrPhe: 1.204 ± 0.405
1.506TyrGly: 1.506 ± 0.731
0.0TyrHis: 0.0 ± 0.0
1.506TyrIle: 1.506 ± 0.738
3.011TyrLys: 3.011 ± 0.461
2.108TyrLeu: 2.108 ± 0.536
0.301TyrMet: 0.301 ± 0.156
1.204TyrAsn: 1.204 ± 0.262
0.903TyrPro: 0.903 ± 0.582
1.506TyrGln: 1.506 ± 1.14
3.011TyrArg: 3.011 ± 0.828
4.216TyrSer: 4.216 ± 1.072
2.409TyrThr: 2.409 ± 0.81
0.903TyrVal: 0.903 ± 0.516
0.301TyrTrp: 0.301 ± 0.34
1.506TyrTyr: 1.506 ± 0.859
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3322 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski