Amino acid dipepetide frequency for Restan virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.387AlaAla: 0.387 ± 0.178
1.16AlaCys: 1.16 ± 0.535
1.933AlaAsp: 1.933 ± 1.986
4.252AlaGlu: 4.252 ± 1.011
3.479AlaPhe: 3.479 ± 0.775
0.773AlaGly: 0.773 ± 2.015
0.387AlaHis: 0.387 ± 0.178
3.092AlaIle: 3.092 ± 0.697
6.185AlaLys: 6.185 ± 4.517
5.798AlaLeu: 5.798 ± 3.028
0.773AlaMet: 0.773 ± 0.356
5.025AlaAsn: 5.025 ± 3.589
0.387AlaPro: 0.387 ± 0.178
2.319AlaGln: 2.319 ± 1.856
2.319AlaArg: 2.319 ± 1.614
2.706AlaSer: 2.706 ± 1.247
1.933AlaThr: 1.933 ± 0.722
1.933AlaVal: 1.933 ± 0.722
0.387AlaTrp: 0.387 ± 0.178
1.933AlaTyr: 1.933 ± 0.722
0.0AlaXaa: 0.0 ± 0.0
Cys
1.16CysAla: 1.16 ± 0.535
0.387CysCys: 0.387 ± 0.178
0.387CysAsp: 0.387 ± 0.178
0.387CysGlu: 0.387 ± 0.178
1.16CysPhe: 1.16 ± 0.535
1.546CysGly: 1.546 ± 0.713
0.0CysHis: 0.0 ± 0.0
1.546CysIle: 1.546 ± 0.713
0.773CysLys: 0.773 ± 0.356
1.546CysLeu: 1.546 ± 0.713
0.387CysMet: 0.387 ± 0.178
1.546CysAsn: 1.546 ± 0.713
0.773CysPro: 0.773 ± 0.356
0.387CysGln: 0.387 ± 0.178
0.0CysArg: 0.0 ± 0.0
2.319CysSer: 2.319 ± 1.069
0.773CysThr: 0.773 ± 0.356
0.0CysVal: 0.0 ± 0.0
0.387CysTrp: 0.387 ± 0.178
0.387CysTyr: 0.387 ± 0.178
0.0CysXaa: 0.0 ± 0.0
Asp
1.933AspAla: 1.933 ± 0.891
1.16AspCys: 1.16 ± 0.535
3.479AspAsp: 3.479 ± 1.604
4.252AspGlu: 4.252 ± 1.378
5.025AspPhe: 5.025 ± 1.303
2.706AspGly: 2.706 ± 0.659
0.0AspHis: 0.0 ± 0.0
2.706AspIle: 2.706 ± 1.247
3.092AspLys: 3.092 ± 1.706
5.025AspLeu: 5.025 ± 2.316
2.319AspMet: 2.319 ± 1.069
2.319AspAsn: 2.319 ± 1.069
1.546AspPro: 1.546 ± 0.812
3.092AspGln: 3.092 ± 1.425
3.865AspArg: 3.865 ± 1.782
1.16AspSer: 1.16 ± 0.535
5.412AspThr: 5.412 ± 1.629
4.252AspVal: 4.252 ± 1.011
0.387AspTrp: 0.387 ± 0.178
1.546AspTyr: 1.546 ± 0.713
0.0AspXaa: 0.0 ± 0.0
Glu
2.319GluAla: 2.319 ± 0.667
0.773GluCys: 0.773 ± 0.356
3.865GluAsp: 3.865 ± 1.782
3.865GluGlu: 3.865 ± 1.443
6.958GluPhe: 6.958 ± 2.002
0.773GluGly: 0.773 ± 0.356
1.16GluHis: 1.16 ± 2.266
10.437GluIle: 10.437 ± 0.713
5.412GluLys: 5.412 ± 1.317
6.571GluLeu: 6.571 ± 2.042
5.025GluMet: 5.025 ± 1.869
4.252GluAsn: 4.252 ± 1.011
1.933GluPro: 1.933 ± 0.891
2.319GluGln: 2.319 ± 1.069
4.252GluArg: 4.252 ± 1.011
3.479GluSer: 3.479 ± 1.552
3.092GluThr: 3.092 ± 1.425
2.706GluVal: 2.706 ± 1.247
0.387GluTrp: 0.387 ± 0.178
2.319GluTyr: 2.319 ± 1.069
0.0GluXaa: 0.0 ± 0.0
Phe
1.933PheAla: 1.933 ± 0.891
3.479PheCys: 3.479 ± 1.604
2.319PheAsp: 2.319 ± 0.667
4.639PheGlu: 4.639 ± 1.335
3.479PhePhe: 3.479 ± 1.526
1.933PheGly: 1.933 ± 1.986
1.16PheHis: 1.16 ± 0.535
4.252PheIle: 4.252 ± 1.378
5.025PheLys: 5.025 ± 2.316
6.185PheLeu: 6.185 ± 3.825
1.16PheMet: 1.16 ± 0.535
3.479PheAsn: 3.479 ± 1.604
1.16PhePro: 1.16 ± 0.928
1.933PheGln: 1.933 ± 0.891
3.479PheArg: 3.479 ± 1.477
4.639PheSer: 4.639 ± 1.152
1.933PheThr: 1.933 ± 0.891
2.706PheVal: 2.706 ± 1.247
0.387PheTrp: 0.387 ± 0.178
1.546PheTyr: 1.546 ± 0.713
0.0PheXaa: 0.0 ± 0.0
Gly
1.546GlyAla: 1.546 ± 0.812
0.773GlyCys: 0.773 ± 0.356
3.479GlyAsp: 3.479 ± 1.477
5.025GlyGlu: 5.025 ± 1.314
1.546GlyPhe: 1.546 ± 0.812
1.16GlyGly: 1.16 ± 0.535
0.0GlyHis: 0.0 ± 0.0
4.252GlyIle: 4.252 ± 1.011
1.933GlyLys: 1.933 ± 0.891
1.933GlyLeu: 1.933 ± 2.191
0.387GlyMet: 0.387 ± 0.178
2.706GlyAsn: 2.706 ± 1.247
0.773GlyPro: 0.773 ± 0.356
1.16GlyGln: 1.16 ± 0.928
2.706GlyArg: 2.706 ± 1.549
1.16GlySer: 1.16 ± 1.898
2.319GlyThr: 2.319 ± 3.185
1.16GlyVal: 1.16 ± 0.928
0.773GlyTrp: 0.773 ± 0.356
1.546GlyTyr: 1.546 ± 0.713
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.387HisCys: 0.387 ± 0.178
0.0HisAsp: 0.0 ± 0.0
0.773HisGlu: 0.773 ± 1.062
1.933HisPhe: 1.933 ± 0.891
1.546HisGly: 1.546 ± 0.713
0.387HisHis: 0.387 ± 0.178
1.546HisIle: 1.546 ± 0.713
0.773HisLys: 0.773 ± 0.356
1.933HisLeu: 1.933 ± 1.695
0.387HisMet: 0.387 ± 0.178
3.092HisAsn: 3.092 ± 1.425
0.773HisPro: 0.773 ± 0.356
0.0HisGln: 0.0 ± 0.0
1.546HisArg: 1.546 ± 3.472
1.16HisSer: 1.16 ± 0.535
0.387HisThr: 0.387 ± 0.178
0.773HisVal: 0.773 ± 0.356
0.773HisTrp: 0.773 ± 0.356
0.773HisTyr: 0.773 ± 1.062
0.0HisXaa: 0.0 ± 0.0
Ile
4.252IleAla: 4.252 ± 1.011
0.387IleCys: 0.387 ± 0.178
4.639IleAsp: 4.639 ± 1.984
6.958IleGlu: 6.958 ± 2.12
2.319IlePhe: 2.319 ± 1.069
4.639IleGly: 4.639 ± 2.138
1.933IleHis: 1.933 ± 0.891
4.639IleIle: 4.639 ± 1.152
9.664IleLys: 9.664 ± 4.455
10.437IleLeu: 10.437 ± 2.055
3.092IleMet: 3.092 ± 1.425
4.252IleAsn: 4.252 ± 1.011
2.706IlePro: 2.706 ± 1.247
2.319IleGln: 2.319 ± 2.027
6.571IleArg: 6.571 ± 0.822
3.479IleSer: 3.479 ± 1.526
5.025IleThr: 5.025 ± 3.273
2.706IleVal: 2.706 ± 1.247
0.773IleTrp: 0.773 ± 0.356
2.319IleTyr: 2.319 ± 1.856
0.0IleXaa: 0.0 ± 0.0
Lys
3.092LysAla: 3.092 ± 1.425
1.16LysCys: 1.16 ± 0.535
4.639LysAsp: 4.639 ± 1.152
7.731LysGlu: 7.731 ± 1.766
3.865LysPhe: 3.865 ± 1.443
5.025LysGly: 5.025 ± 1.314
2.319LysHis: 2.319 ± 1.069
3.865LysIle: 3.865 ± 1.404
5.798LysLys: 5.798 ± 1.344
8.118LysLeu: 8.118 ± 1.976
2.706LysMet: 2.706 ± 0.666
3.865LysAsn: 3.865 ± 0.883
2.706LysPro: 2.706 ± 0.659
2.706LysGln: 2.706 ± 1.735
2.319LysArg: 2.319 ± 1.614
3.865LysSer: 3.865 ± 0.883
4.252LysThr: 4.252 ± 1.96
5.025LysVal: 5.025 ± 2.335
1.16LysTrp: 1.16 ± 0.928
3.092LysTyr: 3.092 ± 1.425
0.0LysXaa: 0.0 ± 0.0
Leu
5.798LeuAla: 5.798 ± 5.568
0.773LeuCys: 0.773 ± 0.356
6.958LeuAsp: 6.958 ± 3.207
7.344LeuGlu: 7.344 ± 0.747
5.025LeuPhe: 5.025 ± 1.017
1.933LeuGly: 1.933 ± 3.911
2.319LeuHis: 2.319 ± 1.856
8.118LeuIle: 8.118 ± 2.489
5.798LeuLys: 5.798 ± 2.165
10.823LeuLeu: 10.823 ± 6.097
1.546LeuMet: 1.546 ± 1.791
8.891LeuAsn: 8.891 ± 4.549
3.092LeuPro: 3.092 ± 2.695
4.252LeuGln: 4.252 ± 1.488
5.798LeuArg: 5.798 ± 6.573
8.891LeuSer: 8.891 ± 2.354
5.025LeuThr: 5.025 ± 3.891
2.706LeuVal: 2.706 ± 1.247
0.387LeuTrp: 0.387 ± 0.178
4.252LeuTyr: 4.252 ± 1.011
0.0LeuXaa: 0.0 ± 0.0
Met
1.546MetAla: 1.546 ± 0.812
0.387MetCys: 0.387 ± 0.178
1.16MetAsp: 1.16 ± 0.535
1.546MetGlu: 1.546 ± 0.713
1.16MetPhe: 1.16 ± 0.928
0.773MetGly: 0.773 ± 0.356
0.773MetHis: 0.773 ± 0.356
2.319MetIle: 2.319 ± 0.667
4.252MetLys: 4.252 ± 1.378
2.706MetLeu: 2.706 ± 0.659
1.933MetMet: 1.933 ± 0.891
2.706MetAsn: 2.706 ± 1.247
1.933MetPro: 1.933 ± 0.891
0.0MetGln: 0.0 ± 0.0
1.933MetArg: 1.933 ± 1.695
3.479MetSer: 3.479 ± 5.694
1.933MetThr: 1.933 ± 0.891
2.319MetVal: 2.319 ± 1.614
0.0MetTrp: 0.0 ± 0.0
0.387MetTyr: 0.387 ± 0.178
0.0MetXaa: 0.0 ± 0.0
Asn
3.865AsnAla: 3.865 ± 1.472
0.387AsnCys: 0.387 ± 0.178
5.412AsnAsp: 5.412 ± 1.459
4.252AsnGlu: 4.252 ± 1.011
3.479AsnPhe: 3.479 ± 1.477
1.16AsnGly: 1.16 ± 1.898
3.092AsnHis: 3.092 ± 0.697
4.639AsnIle: 4.639 ± 1.152
3.479AsnLys: 3.479 ± 1.477
7.344AsnLeu: 7.344 ± 0.936
2.319AsnMet: 2.319 ± 0.983
3.092AsnAsn: 3.092 ± 0.697
2.706AsnPro: 2.706 ± 0.659
2.319AsnGln: 2.319 ± 1.069
0.773AsnArg: 0.773 ± 0.356
2.706AsnSer: 2.706 ± 0.659
5.412AsnThr: 5.412 ± 1.317
1.933AsnVal: 1.933 ± 0.891
0.773AsnTrp: 0.773 ± 0.356
2.706AsnTyr: 2.706 ± 1.247
0.0AsnXaa: 0.0 ± 0.0
Pro
3.092ProAla: 3.092 ± 0.697
0.0ProCys: 0.0 ± 0.0
1.546ProAsp: 1.546 ± 0.812
3.479ProGlu: 3.479 ± 0.775
0.773ProPhe: 0.773 ± 0.356
2.706ProGly: 2.706 ± 0.659
0.0ProHis: 0.0 ± 0.0
2.319ProIle: 2.319 ± 0.667
3.092ProLys: 3.092 ± 0.697
2.319ProLeu: 2.319 ± 3.185
0.387ProMet: 0.387 ± 0.178
1.933ProAsn: 1.933 ± 0.891
0.773ProPro: 0.773 ± 0.356
0.387ProGln: 0.387 ± 0.178
0.773ProArg: 0.773 ± 0.356
3.092ProSer: 3.092 ± 1.425
1.933ProThr: 1.933 ± 1.695
2.706ProVal: 2.706 ± 0.659
1.16ProTrp: 1.16 ± 0.928
0.773ProTyr: 0.773 ± 0.356
0.0ProXaa: 0.0 ± 0.0
Gln
3.092GlnAla: 3.092 ± 1.624
0.773GlnCys: 0.773 ± 0.356
1.933GlnAsp: 1.933 ± 0.891
0.773GlnGlu: 0.773 ± 0.356
1.546GlnPhe: 1.546 ± 0.713
1.546GlnGly: 1.546 ± 0.812
0.0GlnHis: 0.0 ± 0.0
2.706GlnIle: 2.706 ± 1.549
3.092GlnLys: 3.092 ± 1.624
2.706GlnLeu: 2.706 ± 1.549
1.546GlnMet: 1.546 ± 0.713
0.773GlnAsn: 0.773 ± 1.062
0.387GlnPro: 0.387 ± 0.178
1.933GlnGln: 1.933 ± 0.722
3.479GlnArg: 3.479 ± 0.775
3.865GlnSer: 3.865 ± 3.674
3.479GlnThr: 3.479 ± 1.604
1.16GlnVal: 1.16 ± 0.535
0.387GlnTrp: 0.387 ± 0.178
1.546GlnTyr: 1.546 ± 0.812
0.0GlnXaa: 0.0 ± 0.0
Arg
1.546ArgAla: 1.546 ± 2.123
1.546ArgCys: 1.546 ± 0.713
4.639ArgAsp: 4.639 ± 1.152
2.706ArgGlu: 2.706 ± 1.247
2.319ArgPhe: 2.319 ± 0.667
1.16ArgGly: 1.16 ± 0.928
1.16ArgHis: 1.16 ± 0.535
5.798ArgIle: 5.798 ± 1.452
3.092ArgLys: 3.092 ± 0.697
5.025ArgLeu: 5.025 ± 7.48
1.933ArgMet: 1.933 ± 1.695
3.479ArgAsn: 3.479 ± 1.477
0.387ArgPro: 0.387 ± 1.207
2.319ArgGln: 2.319 ± 2.027
1.16ArgArg: 1.16 ± 0.535
4.252ArgSer: 4.252 ± 3.306
3.092ArgThr: 3.092 ± 1.503
1.933ArgVal: 1.933 ± 2.191
0.0ArgTrp: 0.0 ± 0.0
2.706ArgTyr: 2.706 ± 1.247
0.0ArgXaa: 0.0 ± 0.0
Ser
3.865SerAla: 3.865 ± 2.339
0.773SerCys: 0.773 ± 0.356
3.865SerAsp: 3.865 ± 1.782
3.092SerGlu: 3.092 ± 1.425
2.319SerPhe: 2.319 ± 1.069
1.546SerGly: 1.546 ± 2.123
1.546SerHis: 1.546 ± 1.791
6.571SerIle: 6.571 ± 3.029
5.798SerLys: 5.798 ± 1.344
8.891SerLeu: 8.891 ± 6.532
2.319SerMet: 2.319 ± 1.614
1.933SerAsn: 1.933 ± 0.891
3.092SerPro: 3.092 ± 1.425
2.706SerGln: 2.706 ± 0.659
4.639SerArg: 4.639 ± 1.133
5.798SerSer: 5.798 ± 13.22
4.252SerThr: 4.252 ± 1.263
5.025SerVal: 5.025 ± 1.017
1.16SerTrp: 1.16 ± 1.898
1.16SerTyr: 1.16 ± 0.535
0.0SerXaa: 0.0 ± 0.0
Thr
4.252ThrAla: 4.252 ± 1.011
0.0ThrCys: 0.0 ± 0.0
3.092ThrAsp: 3.092 ± 1.425
4.639ThrGlu: 4.639 ± 4.053
3.479ThrPhe: 3.479 ± 0.775
1.16ThrGly: 1.16 ± 0.535
1.16ThrHis: 1.16 ± 0.535
5.412ThrIle: 5.412 ± 1.656
3.865ThrLys: 3.865 ± 1.782
5.025ThrLeu: 5.025 ± 8.361
1.546ThrMet: 1.546 ± 0.812
3.092ThrAsn: 3.092 ± 1.503
3.479ThrPro: 3.479 ± 1.526
1.933ThrGln: 1.933 ± 0.891
1.933ThrArg: 1.933 ± 0.891
5.798ThrSer: 5.798 ± 1.62
2.706ThrThr: 2.706 ± 1.247
3.092ThrVal: 3.092 ± 0.697
1.546ThrTrp: 1.546 ± 2.123
3.092ThrTyr: 3.092 ± 1.425
0.0ThrXaa: 0.0 ± 0.0
Val
1.933ValAla: 1.933 ± 1.986
1.546ValCys: 1.546 ± 0.713
0.0ValAsp: 0.0 ± 0.0
3.092ValGlu: 3.092 ± 0.697
4.252ValPhe: 4.252 ± 1.011
2.319ValGly: 2.319 ± 1.069
0.773ValHis: 0.773 ± 0.356
3.479ValIle: 3.479 ± 1.604
2.319ValLys: 2.319 ± 1.069
3.092ValLeu: 3.092 ± 1.425
1.16ValMet: 1.16 ± 0.928
3.092ValAsn: 3.092 ± 1.706
2.706ValPro: 2.706 ± 0.659
1.546ValGln: 1.546 ± 0.713
1.16ValArg: 1.16 ± 4.154
5.412ValSer: 5.412 ± 1.459
4.252ValThr: 4.252 ± 1.011
1.933ValVal: 1.933 ± 0.722
0.0ValTrp: 0.0 ± 0.0
0.773ValTyr: 0.773 ± 1.062
0.0ValXaa: 0.0 ± 0.0
Trp
0.773TrpAla: 0.773 ± 1.062
0.0TrpCys: 0.0 ± 0.0
0.773TrpAsp: 0.773 ± 0.356
1.16TrpGlu: 1.16 ± 0.535
0.387TrpPhe: 0.387 ± 0.178
0.773TrpGly: 0.773 ± 1.062
0.0TrpHis: 0.0 ± 0.0
0.773TrpIle: 0.773 ± 0.356
0.387TrpLys: 0.387 ± 1.207
1.16TrpLeu: 1.16 ± 0.535
0.387TrpMet: 0.387 ± 1.207
0.773TrpAsn: 0.773 ± 0.356
0.0TrpPro: 0.0 ± 0.0
1.546TrpGln: 1.546 ± 1.791
0.0TrpArg: 0.0 ± 0.0
1.546TrpSer: 1.546 ± 0.713
0.0TrpThr: 0.0 ± 0.0
0.387TrpVal: 0.387 ± 0.178
0.0TrpTrp: 0.0 ± 0.0
0.773TrpTyr: 0.773 ± 0.356
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.773TyrAla: 0.773 ± 1.062
0.0TyrCys: 0.0 ± 0.0
1.16TyrAsp: 1.16 ± 0.535
1.933TyrGlu: 1.933 ± 0.891
2.319TyrPhe: 2.319 ± 1.069
1.16TyrGly: 1.16 ± 0.535
0.773TyrHis: 0.773 ± 0.356
4.639TyrIle: 4.639 ± 1.152
3.479TyrLys: 3.479 ± 0.775
3.092TyrLeu: 3.092 ± 0.697
1.546TyrMet: 1.546 ± 0.713
1.933TyrAsn: 1.933 ± 0.891
1.933TyrPro: 1.933 ± 0.722
1.546TyrGln: 1.546 ± 0.812
1.546TyrArg: 1.546 ± 0.812
1.546TyrSer: 1.546 ± 0.713
3.092TyrThr: 3.092 ± 1.425
0.387TyrVal: 0.387 ± 0.178
0.773TyrTrp: 0.773 ± 0.356
1.546TyrTyr: 1.546 ± 0.713
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2588 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski