Amino acid dipepetide frequency for Witwatersrand virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.711AlaAla: 1.711 ± 0.567
0.489AlaCys: 0.489 ± 0.438
3.178AlaAsp: 3.178 ± 1.077
1.711AlaGlu: 1.711 ± 0.374
1.711AlaPhe: 1.711 ± 0.792
2.689AlaGly: 2.689 ± 0.841
1.222AlaHis: 1.222 ± 0.741
5.378AlaIle: 5.378 ± 1.151
4.4AlaLys: 4.4 ± 1.04
3.911AlaLeu: 3.911 ± 1.031
1.467AlaMet: 1.467 ± 0.549
3.178AlaAsn: 3.178 ± 0.542
0.978AlaPro: 0.978 ± 0.593
3.667AlaGln: 3.667 ± 0.356
2.444AlaArg: 2.444 ± 0.741
3.178AlaSer: 3.178 ± 1.39
2.933AlaThr: 2.933 ± 0.991
1.222AlaVal: 1.222 ± 0.948
0.733AlaTrp: 0.733 ± 0.43
2.933AlaTyr: 2.933 ± 0.676
0.0AlaXaa: 0.0 ± 0.0
Cys
1.711CysAla: 1.711 ± 0.719
0.0CysCys: 0.0 ± 0.0
1.711CysAsp: 1.711 ± 0.567
1.467CysGlu: 1.467 ± 0.976
1.711CysPhe: 1.711 ± 0.567
2.2CysGly: 2.2 ± 1.971
0.489CysHis: 0.489 ± 0.128
2.689CysIle: 2.689 ± 0.585
2.689CysLys: 2.689 ± 2.069
2.689CysLeu: 2.689 ± 1.123
0.489CysMet: 0.489 ± 0.128
0.733CysAsn: 0.733 ± 0.326
0.978CysPro: 0.978 ± 0.291
1.956CysGln: 1.956 ± 0.77
0.733CysArg: 0.733 ± 0.657
1.222CysSer: 1.222 ± 0.26
1.222CysThr: 1.222 ± 1.095
1.467CysVal: 1.467 ± 0.653
0.0CysTrp: 0.0 ± 0.0
0.733CysTyr: 0.733 ± 0.326
0.0CysXaa: 0.0 ± 0.0
Asp
2.444AspAla: 2.444 ± 0.886
1.467AspCys: 1.467 ± 0.338
2.2AspAsp: 2.2 ± 0.426
1.956AspGlu: 1.956 ± 0.582
3.667AspPhe: 3.667 ± 0.781
2.933AspGly: 2.933 ± 0.636
0.733AspHis: 0.733 ± 0.445
6.6AspIle: 6.6 ± 0.665
4.155AspLys: 4.155 ± 0.492
4.644AspLeu: 4.644 ± 0.874
1.222AspMet: 1.222 ± 0.462
4.644AspAsn: 4.644 ± 0.665
1.711AspPro: 1.711 ± 0.839
2.933AspGln: 2.933 ± 0.676
3.178AspArg: 3.178 ± 1.165
2.689AspSer: 2.689 ± 0.767
2.689AspThr: 2.689 ± 0.606
2.933AspVal: 2.933 ± 0.235
0.489AspTrp: 0.489 ± 0.128
2.2AspTyr: 2.2 ± 0.494
0.0AspXaa: 0.0 ± 0.0
Glu
3.911GluAla: 3.911 ± 0.733
1.467GluCys: 1.467 ± 0.383
3.178GluAsp: 3.178 ± 0.962
4.644GluGlu: 4.644 ± 1.004
2.933GluPhe: 2.933 ± 1.147
1.711GluGly: 1.711 ± 0.567
2.2GluHis: 2.2 ± 0.522
4.889GluIle: 4.889 ± 1.456
3.911GluLys: 3.911 ± 1.164
6.6GluLeu: 6.6 ± 0.516
2.933GluMet: 2.933 ± 1.214
2.933GluAsn: 2.933 ± 1.011
2.689GluPro: 2.689 ± 0.742
1.711GluGln: 1.711 ± 0.452
2.444GluArg: 2.444 ± 0.618
3.422GluSer: 3.422 ± 0.905
4.155GluThr: 4.155 ± 0.633
4.155GluVal: 4.155 ± 0.908
0.489GluTrp: 0.489 ± 0.796
3.178GluTyr: 3.178 ± 0.276
0.0GluXaa: 0.0 ± 0.0
Phe
2.2PheAla: 2.2 ± 0.979
0.733PheCys: 0.733 ± 0.445
2.444PheAsp: 2.444 ± 1.192
2.933PheGlu: 2.933 ± 0.766
2.444PhePhe: 2.444 ± 0.253
2.689PheGly: 2.689 ± 0.655
1.222PheHis: 1.222 ± 0.43
3.422PheIle: 3.422 ± 0.432
3.422PheLys: 3.422 ± 0.905
5.622PheLeu: 5.622 ± 2.387
1.222PheMet: 1.222 ± 0.43
3.178PheAsn: 3.178 ± 0.628
1.222PhePro: 1.222 ± 0.26
0.978PheGln: 0.978 ± 0.49
0.733PheArg: 0.733 ± 0.169
2.933PheSer: 2.933 ± 0.235
4.155PheThr: 4.155 ± 0.919
2.933PheVal: 2.933 ± 0.342
0.244PheTrp: 0.244 ± 0.148
1.711PheTyr: 1.711 ± 0.719
0.0PheXaa: 0.0 ± 0.0
Gly
1.711GlyAla: 1.711 ± 0.27
1.956GlyCys: 1.956 ± 0.511
2.444GlyAsp: 2.444 ± 0.575
2.444GlyGlu: 2.444 ± 0.253
1.711GlyPhe: 1.711 ± 1.194
1.711GlyGly: 1.711 ± 0.597
1.222GlyHis: 1.222 ± 0.37
3.178GlyIle: 3.178 ± 1.168
4.4GlyLys: 4.4 ± 1.086
4.4GlyLeu: 4.4 ± 1.014
0.978GlyMet: 0.978 ± 1.129
2.444GlyAsn: 2.444 ± 0.86
2.2GlyPro: 2.2 ± 0.862
1.956GlyGln: 1.956 ± 0.511
1.222GlyArg: 1.222 ± 0.43
3.178GlySer: 3.178 ± 0.711
1.956GlyThr: 1.956 ± 1.529
1.711GlyVal: 1.711 ± 1.376
0.489GlyTrp: 0.489 ± 0.128
2.444GlyTyr: 2.444 ± 0.272
0.0GlyXaa: 0.0 ± 0.0
His
0.978HisAla: 0.978 ± 0.49
0.489HisCys: 0.489 ± 0.128
1.467HisAsp: 1.467 ± 0.889
1.711HisGlu: 1.711 ± 0.567
1.222HisPhe: 1.222 ± 0.37
1.467HisGly: 1.467 ± 0.704
0.978HisHis: 0.978 ± 0.291
1.711HisIle: 1.711 ± 0.452
2.689HisLys: 2.689 ± 0.617
2.444HisLeu: 2.444 ± 1.511
0.244HisMet: 0.244 ± 0.219
0.978HisAsn: 0.978 ± 0.593
1.222HisPro: 1.222 ± 0.43
0.733HisGln: 0.733 ± 0.326
0.489HisArg: 0.489 ± 0.128
1.467HisSer: 1.467 ± 0.726
1.467HisThr: 1.467 ± 0.956
1.956HisVal: 1.956 ± 0.582
0.244HisTrp: 0.244 ± 0.148
0.978HisTyr: 0.978 ± 0.291
0.0HisXaa: 0.0 ± 0.0
Ile
4.155IleAla: 4.155 ± 0.752
2.933IleCys: 2.933 ± 0.991
6.6IleAsp: 6.6 ± 1.809
5.622IleGlu: 5.622 ± 1.614
2.933IlePhe: 2.933 ± 0.342
2.444IleGly: 2.444 ± 0.618
2.2IleHis: 2.2 ± 0.606
6.6IleIle: 6.6 ± 1.423
7.333IleLys: 7.333 ± 1.203
9.044IleLeu: 9.044 ± 1.468
2.2IleMet: 2.2 ± 0.507
5.378IleAsn: 5.378 ± 0.654
4.155IlePro: 4.155 ± 0.746
2.444IleGln: 2.444 ± 1.012
2.444IleArg: 2.444 ± 1.218
6.6IleSer: 6.6 ± 0.611
4.644IleThr: 4.644 ± 0.65
2.933IleVal: 2.933 ± 0.235
0.489IleTrp: 0.489 ± 0.296
2.689IleTyr: 2.689 ± 0.617
0.0IleXaa: 0.0 ± 0.0
Lys
4.4LysAla: 4.4 ± 0.666
1.956LysCys: 1.956 ± 1.413
5.133LysAsp: 5.133 ± 1.593
8.066LysGlu: 8.066 ± 2.464
4.155LysPhe: 4.155 ± 1.07
2.933LysGly: 2.933 ± 0.491
2.689LysHis: 2.689 ± 0.617
6.844LysIle: 6.844 ± 1.826
5.378LysLys: 5.378 ± 1.235
5.867LysLeu: 5.867 ± 0.973
1.467LysMet: 1.467 ± 0.423
4.644LysAsn: 4.644 ± 1.004
1.711LysPro: 1.711 ± 0.27
1.956LysGln: 1.956 ± 0.865
2.2LysArg: 2.2 ± 0.729
4.889LysSer: 4.889 ± 0.745
5.622LysThr: 5.622 ± 1.7
3.667LysVal: 3.667 ± 1.249
0.489LysTrp: 0.489 ± 0.128
3.911LysTyr: 3.911 ± 1.539
0.0LysXaa: 0.0 ± 0.0
Leu
6.111LeuAla: 6.111 ± 0.677
1.956LeuCys: 1.956 ± 1.082
6.111LeuAsp: 6.111 ± 1.407
5.622LeuGlu: 5.622 ± 1.138
2.689LeuPhe: 2.689 ± 0.617
3.667LeuGly: 3.667 ± 1.711
1.711LeuHis: 1.711 ± 0.656
6.6LeuIle: 6.6 ± 3.325
7.578LeuLys: 7.578 ± 0.732
8.555LeuLeu: 8.555 ± 3.353
2.2LeuMet: 2.2 ± 1.012
3.422LeuAsn: 3.422 ± 0.77
3.667LeuPro: 3.667 ± 0.578
4.644LeuGln: 4.644 ± 0.835
6.844LeuArg: 6.844 ± 1.914
4.644LeuSer: 4.644 ± 0.665
7.822LeuThr: 7.822 ± 2.229
3.911LeuVal: 3.911 ± 0.739
0.733LeuTrp: 0.733 ± 0.169
4.4LeuTyr: 4.4 ± 1.027
0.0LeuXaa: 0.0 ± 0.0
Met
0.978MetAla: 0.978 ± 1.606
1.222MetCys: 1.222 ± 0.43
0.978MetAsp: 0.978 ± 0.593
0.978MetGlu: 0.978 ± 0.593
1.222MetPhe: 1.222 ± 0.37
1.222MetGly: 1.222 ± 0.688
0.978MetHis: 0.978 ± 0.255
2.444MetIle: 2.444 ± 0.522
1.956MetLys: 1.956 ± 0.324
1.222MetLeu: 1.222 ± 0.43
0.733MetMet: 0.733 ± 0.303
1.711MetAsn: 1.711 ± 0.719
0.978MetPro: 0.978 ± 0.255
1.711MetGln: 1.711 ± 0.954
0.978MetArg: 0.978 ± 0.593
2.689MetSer: 2.689 ± 0.682
1.467MetThr: 1.467 ± 0.487
1.467MetVal: 1.467 ± 0.338
0.0MetTrp: 0.0 ± 0.0
0.978MetTyr: 0.978 ± 0.255
0.0MetXaa: 0.0 ± 0.0
Asn
2.933AsnAla: 2.933 ± 0.82
0.489AsnCys: 0.489 ± 0.438
3.667AsnAsp: 3.667 ± 1.29
3.667AsnGlu: 3.667 ± 0.559
4.4AsnPhe: 4.4 ± 0.729
1.467AsnGly: 1.467 ± 0.383
1.956AsnHis: 1.956 ± 0.42
4.155AsnIle: 4.155 ± 0.495
2.2AsnLys: 2.2 ± 0.426
5.133AsnLeu: 5.133 ± 1.121
1.467AsnMet: 1.467 ± 0.338
3.911AsnAsn: 3.911 ± 0.466
2.2AsnPro: 2.2 ± 0.893
1.956AsnGln: 1.956 ± 0.251
2.689AsnArg: 2.689 ± 1.003
3.422AsnSer: 3.422 ± 0.905
1.956AsnThr: 1.956 ± 0.608
2.444AsnVal: 2.444 ± 0.272
1.711AsnTrp: 1.711 ± 0.631
3.422AsnTyr: 3.422 ± 0.77
0.0AsnXaa: 0.0 ± 0.0
Pro
1.467ProAla: 1.467 ± 0.967
0.244ProCys: 0.244 ± 0.148
1.711ProAsp: 1.711 ± 1.007
3.422ProGlu: 3.422 ± 0.528
1.711ProPhe: 1.711 ± 0.374
2.933ProGly: 2.933 ± 0.636
0.733ProHis: 0.733 ± 0.169
4.4ProIle: 4.4 ± 1.04
2.2ProLys: 2.2 ± 0.692
2.444ProLeu: 2.444 ± 0.549
0.489ProMet: 0.489 ± 0.438
0.978ProAsn: 0.978 ± 0.291
0.489ProPro: 0.489 ± 0.524
0.489ProGln: 0.489 ± 0.476
1.222ProArg: 1.222 ± 0.68
2.444ProSer: 2.444 ± 0.522
2.444ProThr: 2.444 ± 0.272
0.978ProVal: 0.978 ± 0.255
0.489ProTrp: 0.489 ± 0.296
1.222ProTyr: 1.222 ± 0.26
0.0ProXaa: 0.0 ± 0.0
Gln
1.956GlnAla: 1.956 ± 0.852
0.0GlnCys: 0.0 ± 0.0
2.2GlnAsp: 2.2 ± 0.507
4.155GlnGlu: 4.155 ± 0.457
1.711GlnPhe: 1.711 ± 0.358
2.2GlnGly: 2.2 ± 0.507
0.489GlnHis: 0.489 ± 0.796
3.422GlnIle: 3.422 ± 0.577
3.422GlnLys: 3.422 ± 0.752
2.933GlnLeu: 2.933 ± 0.676
0.489GlnMet: 0.489 ± 0.476
1.711GlnAsn: 1.711 ± 0.374
0.733GlnPro: 0.733 ± 0.445
2.2GlnGln: 2.2 ± 0.719
2.444GlnArg: 2.444 ± 0.549
2.444GlnSer: 2.444 ± 0.639
2.444GlnThr: 2.444 ± 0.272
2.2GlnVal: 2.2 ± 0.862
0.489GlnTrp: 0.489 ± 0.524
1.956GlnTyr: 1.956 ± 0.746
0.0GlnXaa: 0.0 ± 0.0
Arg
1.467ArgAla: 1.467 ± 0.573
2.2ArgCys: 2.2 ± 1.299
2.689ArgAsp: 2.689 ± 0.629
1.467ArgGlu: 1.467 ± 0.704
1.956ArgPhe: 1.956 ± 1.197
1.467ArgGly: 1.467 ± 0.721
0.733ArgHis: 0.733 ± 0.445
3.422ArgIle: 3.422 ± 1.542
3.178ArgLys: 3.178 ± 0.742
4.644ArgLeu: 4.644 ± 1.578
1.467ArgMet: 1.467 ± 0.423
2.689ArgAsn: 2.689 ± 1.125
1.222ArgPro: 1.222 ± 0.948
1.956ArgGln: 1.956 ± 0.746
1.222ArgArg: 1.222 ± 0.43
3.422ArgSer: 3.422 ± 0.622
2.689ArgThr: 2.689 ± 0.819
2.444ArgVal: 2.444 ± 1.303
0.733ArgTrp: 0.733 ± 0.789
2.2ArgTyr: 2.2 ± 0.881
0.0ArgXaa: 0.0 ± 0.0
Ser
3.667SerAla: 3.667 ± 1.587
3.178SerCys: 3.178 ± 1.84
3.911SerAsp: 3.911 ± 1.164
3.911SerGlu: 3.911 ± 0.792
2.689SerPhe: 2.689 ± 1.437
2.933SerGly: 2.933 ± 0.722
1.467SerHis: 1.467 ± 0.573
6.6SerIle: 6.6 ± 1.424
4.4SerLys: 4.4 ± 0.505
6.355SerLeu: 6.355 ± 0.336
1.711SerMet: 1.711 ± 0.597
1.956SerAsn: 1.956 ± 0.711
1.711SerPro: 1.711 ± 0.631
2.933SerGln: 2.933 ± 0.235
3.667SerArg: 3.667 ± 0.845
3.911SerSer: 3.911 ± 3.138
4.889SerThr: 4.889 ± 0.341
3.178SerVal: 3.178 ± 1.009
0.244SerTrp: 0.244 ± 0.83
2.933SerTyr: 2.933 ± 0.653
0.0SerXaa: 0.0 ± 0.0
Thr
3.911ThrAla: 3.911 ± 0.739
2.2ThrCys: 2.2 ± 0.979
2.444ThrAsp: 2.444 ± 0.86
3.911ThrGlu: 3.911 ± 0.502
3.178ThrPhe: 3.178 ± 0.962
3.667ThrGly: 3.667 ± 1.636
0.978ThrHis: 0.978 ± 0.541
4.889ThrIle: 4.889 ± 1.378
5.622ThrLys: 5.622 ± 1.029
5.867ThrLeu: 5.867 ± 1.981
1.467ThrMet: 1.467 ± 0.423
3.911ThrAsn: 3.911 ± 1.258
1.711ThrPro: 1.711 ± 0.597
1.222ThrGln: 1.222 ± 0.741
3.911ThrArg: 3.911 ± 2.86
5.622ThrSer: 5.622 ± 1.328
6.355ThrThr: 6.355 ± 3.317
3.422ThrVal: 3.422 ± 1.422
1.467ThrTrp: 1.467 ± 1.142
2.933ThrTyr: 2.933 ± 0.342
0.0ThrXaa: 0.0 ± 0.0
Val
1.467ValAla: 1.467 ± 1.443
1.222ValCys: 1.222 ± 1.095
2.2ValAsp: 2.2 ± 0.729
2.444ValGlu: 2.444 ± 0.575
1.711ValPhe: 1.711 ± 0.452
1.467ValGly: 1.467 ± 0.291
1.467ValHis: 1.467 ± 0.922
3.178ValIle: 3.178 ± 0.864
3.422ValLys: 3.422 ± 0.282
3.911ValLeu: 3.911 ± 0.739
0.978ValMet: 0.978 ± 0.291
2.444ValAsn: 2.444 ± 1.234
1.711ValPro: 1.711 ± 0.358
1.711ValGln: 1.711 ± 0.836
1.467ValArg: 1.467 ± 0.704
4.155ValSer: 4.155 ± 0.919
4.889ValThr: 4.889 ± 1.111
2.689ValVal: 2.689 ± 0.207
0.244ValTrp: 0.244 ± 0.219
4.4ValTyr: 4.4 ± 0.505
0.0ValXaa: 0.0 ± 0.0
Trp
0.978TrpAla: 0.978 ± 0.291
0.244TrpCys: 0.244 ± 0.148
0.489TrpAsp: 0.489 ± 0.128
0.733TrpGlu: 0.733 ± 0.484
0.733TrpPhe: 0.733 ± 0.169
0.244TrpGly: 0.244 ± 0.219
0.489TrpHis: 0.489 ± 0.818
0.489TrpIle: 0.489 ± 0.128
0.0TrpLys: 0.0 ± 0.0
1.956TrpLeu: 1.956 ± 1.526
0.244TrpMet: 0.244 ± 0.512
0.733TrpAsn: 0.733 ± 0.789
0.244TrpPro: 0.244 ± 0.219
0.733TrpGln: 0.733 ± 0.748
0.244TrpArg: 0.244 ± 0.512
0.978TrpSer: 0.978 ± 0.593
0.244TrpThr: 0.244 ± 0.219
0.244TrpVal: 0.244 ± 0.148
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.222TyrAla: 1.222 ± 0.68
2.444TyrCys: 2.444 ± 1.192
0.733TyrAsp: 0.733 ± 0.169
2.689TyrGlu: 2.689 ± 1.003
1.956TyrPhe: 1.956 ± 0.511
1.711TyrGly: 1.711 ± 0.656
0.978TyrHis: 0.978 ± 0.255
2.933TyrIle: 2.933 ± 0.631
5.622TyrLys: 5.622 ± 1.256
4.644TyrLeu: 4.644 ± 0.904
2.2TyrMet: 2.2 ± 0.719
3.667TyrAsn: 3.667 ± 1.139
0.978TyrPro: 0.978 ± 0.49
1.711TyrGln: 1.711 ± 0.719
2.689TyrArg: 2.689 ± 0.781
2.933TyrSer: 2.933 ± 1.623
4.644TyrThr: 4.644 ± 0.861
1.222TyrVal: 1.222 ± 0.43
0.244TyrTrp: 0.244 ± 0.148
1.222TyrTyr: 1.222 ± 0.625
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4092 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski